ID A0A922CKQ5_MANSE Unreviewed; 790 AA.
AC A0A922CKQ5;
DT 22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2023, sequence version 1.
DT 28-JAN-2026, entry version 12.
DE RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=O3G_MSEX006655 {ECO:0000313|EMBL:KAG6450575.1};
OS Manduca sexta (Tobacco hawkmoth) (Tobacco hornworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC Sphingidae; Sphinginae; Sphingini; Manduca.
OX NCBI_TaxID=7130 {ECO:0000313|EMBL:KAG6450575.1, ECO:0000313|Proteomes:UP000791440};
RN [1] {ECO:0000313|EMBL:KAG6450575.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=27522922;
RA Kanost M.R., Arrese E.L., Cao X., Chen Y.R., Chellapilla S.,
RA Goldsmith M.R., Grosse-Wilde E., Heckel D.G., Herndon N., Jiang H.,
RA Papanicolaou A., Qu J., Soulages J.L., Vogel H., Walters J.,
RA Waterhouse R.M., Ahn S.J., Almeida F.C., An C., Aqrawi P.,
RA Bretschneider A., Bryant W.B., Bucks S., Chao H., Chevignon G.,
RA Christen J.M., Clarke D.F., Dittmer N.T., Ferguson L.C.F., Garavelou S.,
RA Gordon K.H.J., Gunaratna R.T., Han Y., Hauser F., He Y., Heidel-Fischer H.,
RA Hirsh A., Hu Y., Jiang H., Kalra D., Klinner C., Konig C., Kovar C.,
RA Kroll A.R., Kuwar S.S., Lee S.L., Lehman R., Li K., Li Z., Liang H.,
RA Lovelace S., Lu Z., Mansfield J.H., McCulloch K.J., Mathew T., Morton B.,
RA Muzny D.M., Neunemann D., Ongeri F., Pauchet Y., Pu L.L., Pyrousis I.,
RA Rao X.J., Redding A., Roesel C., Sanchez-Gracia A., Schaack S., Shukla A.,
RA Tetreau G., Wang Y., Xiong G.H., Traut W., Walsh T.K., Worley K.C., Wu D.,
RA Wu W., Wu Y.Q., Zhang X., Zou Z., Zucker H., Briscoe A.D., Burmester T.,
RA Clem R.J., Feyereisen R., Grimmelikhuijzen C.J.P., Hamodrakas S.J.,
RA Hansson B.S., Huguet E., Jermiin L.S., Lan Q., Lehman H.K., Lorenzen M.,
RA Merzendorfer H., Michalopoulos I., Morton D.B., Muthukrishnan S.,
RA Oakeshott J.G., Palmer W., Park Y., Passarelli A.L., Rozas J.,
RA Schwartz L.M., Smith W., Southgate A., Vilcinskas A., Vogt R., Wang P.,
RA Werren J., Yu X.Q., Zhou J.J., Brown S.J., Scherer S.E., Richards S.,
RA Blissard G.W.;
RT "Multifaceted biological insights from a draft genome sequence of the
RT tobacco hornworm moth, Manduca sexta.";
RL Insect Biochem. Mol. Biol. 76:118-147(2016).
RN [2] {ECO:0000313|EMBL:KAG6450575.1}
RP NUCLEOTIDE SEQUENCE.
RA Kanost M.;
RL Submitted (DEC-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAG6450575.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH668389; KAG6450575.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A922CKQ5; -.
DR Proteomes; UP000791440; Unassembled WGS sequence.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF906; COLLAGEN ALPHA-2(IX) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000791440}.
FT DOMAIN 508..553
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 590..755
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 440..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 13..31
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 74..85
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..185
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 200..216
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..235
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..273
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 289..306
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..330
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 331..360
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 361..386
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..405
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..430
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 449..459
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 790 AA; 81631 MW; 0A276E6DD72E642D CRC64;
MPELRGPPGS PGPTGADGTT GAPGKTGQIG EPGPPGPPGS KGDRGERGEV GSPGTEGQPG
PKGDPGLDGS TGPQGPPGPP GPPGPLTSAL MEATGLYGPS NPGIPGPQGE RGPMGLPGPQ
GERGYPGNKG ERGLQGSKGD KGERGYVGPR GPHGAKGERG QTGRDGSPGL PGAHGRPAEK
GEKGARGLPG PPGLPMPVVS ENSVSGDITR TESSLLQLPK IERGERGEKG EKGSRGMEGP
QGFPGNDGKP GERGDIGPSG LPGTQGPPGL TGSKGDRGEP GPPGPVAISR DEALVMTKGE
KGESGPRGKR GHPGPPGPRG PPGLPGPPGT PGTNGPSGDI GLPGWTGPPG AAGQQGPPGQ
KGEKGDSGIN PHDLEKIKGD KGERGYDGAP GPPGKEGPRG PPGTPGAPAT NIQYISVPGP
PGPPGPPGPP AIFANEVPVD SLTDTPGLNR REPGAGKQRD PLQILRSLNH LVHYRQDPYG
YRDPLDPLGE NSDFEDDEDG RAIVGTILFK STDSLIRLGT NSPLGTLAYV IQEQALLVRV
NNGWQYVAMG SLLAIHTPPA NGPTRTPLQN ILETSSLVHH KNPSVEGPVL RLAALNEPHT
GDMHGVSSTN YECRRQAQRA NLDGTFRAFI SSRVQTIDSI VSWVDREIPV VNIRGDVLFN
SWGEMFDGSG ALFAHAPRIF SFSGQNVLTD PGWPTKAVWH GASPNGEPAM DAYCDAWHSS
SPDKYGLASS LRSNKLLDQE TYSCSSKLIV LCVEATPVDT VRRKKRSRYR ATEKLQFLKD
HEGRNDTKQL
//