GenomeNet

Database: UniProt
Entry: A0A922CKQ5_MANSE
LinkDB: A0A922CKQ5_MANSE
Original site: A0A922CKQ5_MANSE 
ID   A0A922CKQ5_MANSE        Unreviewed;       790 AA.
AC   A0A922CKQ5;
DT   22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2023, sequence version 1.
DT   28-JAN-2026, entry version 12.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=O3G_MSEX006655 {ECO:0000313|EMBL:KAG6450575.1};
OS   Manduca sexta (Tobacco hawkmoth) (Tobacco hornworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC   Sphingidae; Sphinginae; Sphingini; Manduca.
OX   NCBI_TaxID=7130 {ECO:0000313|EMBL:KAG6450575.1, ECO:0000313|Proteomes:UP000791440};
RN   [1] {ECO:0000313|EMBL:KAG6450575.1}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=27522922;
RA   Kanost M.R., Arrese E.L., Cao X., Chen Y.R., Chellapilla S.,
RA   Goldsmith M.R., Grosse-Wilde E., Heckel D.G., Herndon N., Jiang H.,
RA   Papanicolaou A., Qu J., Soulages J.L., Vogel H., Walters J.,
RA   Waterhouse R.M., Ahn S.J., Almeida F.C., An C., Aqrawi P.,
RA   Bretschneider A., Bryant W.B., Bucks S., Chao H., Chevignon G.,
RA   Christen J.M., Clarke D.F., Dittmer N.T., Ferguson L.C.F., Garavelou S.,
RA   Gordon K.H.J., Gunaratna R.T., Han Y., Hauser F., He Y., Heidel-Fischer H.,
RA   Hirsh A., Hu Y., Jiang H., Kalra D., Klinner C., Konig C., Kovar C.,
RA   Kroll A.R., Kuwar S.S., Lee S.L., Lehman R., Li K., Li Z., Liang H.,
RA   Lovelace S., Lu Z., Mansfield J.H., McCulloch K.J., Mathew T., Morton B.,
RA   Muzny D.M., Neunemann D., Ongeri F., Pauchet Y., Pu L.L., Pyrousis I.,
RA   Rao X.J., Redding A., Roesel C., Sanchez-Gracia A., Schaack S., Shukla A.,
RA   Tetreau G., Wang Y., Xiong G.H., Traut W., Walsh T.K., Worley K.C., Wu D.,
RA   Wu W., Wu Y.Q., Zhang X., Zou Z., Zucker H., Briscoe A.D., Burmester T.,
RA   Clem R.J., Feyereisen R., Grimmelikhuijzen C.J.P., Hamodrakas S.J.,
RA   Hansson B.S., Huguet E., Jermiin L.S., Lan Q., Lehman H.K., Lorenzen M.,
RA   Merzendorfer H., Michalopoulos I., Morton D.B., Muthukrishnan S.,
RA   Oakeshott J.G., Palmer W., Park Y., Passarelli A.L., Rozas J.,
RA   Schwartz L.M., Smith W., Southgate A., Vilcinskas A., Vogt R., Wang P.,
RA   Werren J., Yu X.Q., Zhou J.J., Brown S.J., Scherer S.E., Richards S.,
RA   Blissard G.W.;
RT   "Multifaceted biological insights from a draft genome sequence of the
RT   tobacco hornworm moth, Manduca sexta.";
RL   Insect Biochem. Mol. Biol. 76:118-147(2016).
RN   [2] {ECO:0000313|EMBL:KAG6450575.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Kanost M.;
RL   Submitted (DEC-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAG6450575.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JH668389; KAG6450575.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A922CKQ5; -.
DR   Proteomes; UP000791440; Unassembled WGS sequence.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF906; COLLAGEN ALPHA-2(IX) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000791440}.
FT   DOMAIN          508..553
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          590..755
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..435
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          440..459
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        13..31
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        74..85
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        176..185
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        200..216
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        220..235
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        258..273
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        289..306
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        313..330
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        331..360
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        361..386
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        390..405
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        418..430
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        449..459
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   790 AA;  81631 MW;  0A276E6DD72E642D CRC64;
     MPELRGPPGS PGPTGADGTT GAPGKTGQIG EPGPPGPPGS KGDRGERGEV GSPGTEGQPG
     PKGDPGLDGS TGPQGPPGPP GPPGPLTSAL MEATGLYGPS NPGIPGPQGE RGPMGLPGPQ
     GERGYPGNKG ERGLQGSKGD KGERGYVGPR GPHGAKGERG QTGRDGSPGL PGAHGRPAEK
     GEKGARGLPG PPGLPMPVVS ENSVSGDITR TESSLLQLPK IERGERGEKG EKGSRGMEGP
     QGFPGNDGKP GERGDIGPSG LPGTQGPPGL TGSKGDRGEP GPPGPVAISR DEALVMTKGE
     KGESGPRGKR GHPGPPGPRG PPGLPGPPGT PGTNGPSGDI GLPGWTGPPG AAGQQGPPGQ
     KGEKGDSGIN PHDLEKIKGD KGERGYDGAP GPPGKEGPRG PPGTPGAPAT NIQYISVPGP
     PGPPGPPGPP AIFANEVPVD SLTDTPGLNR REPGAGKQRD PLQILRSLNH LVHYRQDPYG
     YRDPLDPLGE NSDFEDDEDG RAIVGTILFK STDSLIRLGT NSPLGTLAYV IQEQALLVRV
     NNGWQYVAMG SLLAIHTPPA NGPTRTPLQN ILETSSLVHH KNPSVEGPVL RLAALNEPHT
     GDMHGVSSTN YECRRQAQRA NLDGTFRAFI SSRVQTIDSI VSWVDREIPV VNIRGDVLFN
     SWGEMFDGSG ALFAHAPRIF SFSGQNVLTD PGWPTKAVWH GASPNGEPAM DAYCDAWHSS
     SPDKYGLASS LRSNKLLDQE TYSCSSKLIV LCVEATPVDT VRRKKRSRYR ATEKLQFLKD
     HEGRNDTKQL
//
DBGET integrated database retrieval system