GenomeNet

Database: UniProt
Entry: A0A2I0LLZ5_COLLI
LinkDB: A0A2I0LLZ5_COLLI
Original site: A0A2I0LLZ5_COLLI 
ID   A0A2I0LLZ5_COLLI        Unreviewed;       846 AA.
AC   A0A2I0LLZ5;
DT   28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 1.
DT   28-JAN-2026, entry version 26.
DE   SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:PKK18447.1};
GN   ORFNames=A306_00012815 {ECO:0000313|EMBL:PKK18447.1};
OS   Columba livia (Rock dove).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Columbimorphae; Columbiformes;
OC   Columbidae; Columba.
OX   NCBI_TaxID=8932 {ECO:0000313|EMBL:PKK18447.1, ECO:0000313|Proteomes:UP000053872};
RN   [1] {ECO:0000313|EMBL:PKK18447.1, ECO:0000313|Proteomes:UP000053872}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   TISSUE=Blood {ECO:0000313|EMBL:PKK18447.1};
RX   PubMed=23371554; DOI=10.1126/science.1230422;
RA   Shapiro M.D., Kronenberg Z., Li C., Domyan E.T., Pan H., Campbell M.,
RA   Tan H., Huff C.D., Hu H., Vickrey A.I., Nielsen S.C., Stringham S.A.,
RA   Hu H., Willerslev E., Gilbert M.T., Yandell M., Zhang G., Wang J.;
RT   "Genomic diversity and evolution of the head crest in the rock pigeon.";
RL   Science 339:1063-1067(2013).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PKK18447.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKCR02000201; PKK18447.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2I0LLZ5; -.
DR   InParanoid; A0A2I0LLZ5; -.
DR   Proteomes; UP000053872; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1114; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:PKK18447.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053872};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..846
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5014183448"
FT   DOMAIN          567..610
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          675..840
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          39..106
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          199..445
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        41..54
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        219..230
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        231..246
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        276..294
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        312..324
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        325..339
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        340..367
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        403..412
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        414..424
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        427..439
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   846 AA;  87462 MW;  F5BA3A26DFEC4904 CRC64;
     MGSGGPQRLL RALCILSVLA EHLPSATAQW FYPLGAEDTT PDPSVSPAAP TAPALDGEED
     VGSVEPPRKV LLSKPPLAMA PRGRQSLARG AQHRPTAAPE MFEGSAEEEE FLQIQTTAKG
     LTQRVLLAPE TDPALQMHNR SSCVCPVRPG PPGPKVQCVC VPGKRHRDHL GQIPWLFWPF
     KIFSALDLDS GFGVWGEKGD RGFPGERGQP GFSGEKGKTG SPGQPGHQGP RGPPGPPGPP
     GPPGPPGTWG GRSPPMAAAL PRGSENELGA SSPAGNPGPP GPPGLPGQPG PPGYPGHEGP
     PGVPGRDGKL GPPGPPGPVG PPGFPGAEGA PGSPGSAGPD GPPGAPGLPG PQGPPGVPGH
     EGPPGPTGPA ALPGKPGLRG EPGFPGLKGE KGEYGLPGMP GSPGRTGETG APGAPGPMGP
     PGPPGDYRCD SRHAGHRETA GPPGPKGCCY GEHGCKPGHL PFPGTGSQPS SWAPISGYQT
     GGKEEPEIYG AIIPHGLRGL PGNPGPPGPP GPPGAPGLLY FNRLYPSRAQ QPCKQPAATD
     TGWAADADIP RTELPDSRAD LQRQTWVFRS KELMLKSGSA VPEGSLVYVR EGSSAFLRTP
     TGWSRLLLED SESLFAGDDP SASTPQYQAT KRAQMKGDNM GTPVLAQTHS LVQKQEGQGL
     PQILPTTIAP RIPSLRLAAL NVPLAGDMSG VRGADLQCYR QSQEAQLYGT FRAFLSAPTQ
     DLVSIVKRTD RTLPVVNLKG QLLAKSWSSL FNGQAGAVPR GPIYSFNGRN VLTDPLWPQR
     LAWHGSTPRG GHAHRRDCQG WRSSGPGEGL AAPLGEGRLL AGQRHNCSQV LAVLCVEVAF
     PYRHMW
//
DBGET integrated database retrieval system