ID A0A2I0LLZ5_COLLI Unreviewed; 846 AA.
AC A0A2I0LLZ5;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 28-JAN-2026, entry version 26.
DE SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:PKK18447.1};
GN ORFNames=A306_00012815 {ECO:0000313|EMBL:PKK18447.1};
OS Columba livia (Rock dove).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Neoaves; Columbimorphae; Columbiformes;
OC Columbidae; Columba.
OX NCBI_TaxID=8932 {ECO:0000313|EMBL:PKK18447.1, ECO:0000313|Proteomes:UP000053872};
RN [1] {ECO:0000313|EMBL:PKK18447.1, ECO:0000313|Proteomes:UP000053872}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Blood {ECO:0000313|EMBL:PKK18447.1};
RX PubMed=23371554; DOI=10.1126/science.1230422;
RA Shapiro M.D., Kronenberg Z., Li C., Domyan E.T., Pan H., Campbell M.,
RA Tan H., Huff C.D., Hu H., Vickrey A.I., Nielsen S.C., Stringham S.A.,
RA Hu H., Willerslev E., Gilbert M.T., Yandell M., Zhang G., Wang J.;
RT "Genomic diversity and evolution of the head crest in the rock pigeon.";
RL Science 339:1063-1067(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PKK18447.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKCR02000201; PKK18447.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2I0LLZ5; -.
DR InParanoid; A0A2I0LLZ5; -.
DR Proteomes; UP000053872; Unassembled WGS sequence.
DR GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1114; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:PKK18447.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053872};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..846
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014183448"
FT DOMAIN 567..610
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 675..840
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 39..106
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 199..445
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..54
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 219..230
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 231..246
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..294
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..324
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..339
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..367
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 403..412
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..424
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 427..439
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 846 AA; 87462 MW; F5BA3A26DFEC4904 CRC64;
MGSGGPQRLL RALCILSVLA EHLPSATAQW FYPLGAEDTT PDPSVSPAAP TAPALDGEED
VGSVEPPRKV LLSKPPLAMA PRGRQSLARG AQHRPTAAPE MFEGSAEEEE FLQIQTTAKG
LTQRVLLAPE TDPALQMHNR SSCVCPVRPG PPGPKVQCVC VPGKRHRDHL GQIPWLFWPF
KIFSALDLDS GFGVWGEKGD RGFPGERGQP GFSGEKGKTG SPGQPGHQGP RGPPGPPGPP
GPPGPPGTWG GRSPPMAAAL PRGSENELGA SSPAGNPGPP GPPGLPGQPG PPGYPGHEGP
PGVPGRDGKL GPPGPPGPVG PPGFPGAEGA PGSPGSAGPD GPPGAPGLPG PQGPPGVPGH
EGPPGPTGPA ALPGKPGLRG EPGFPGLKGE KGEYGLPGMP GSPGRTGETG APGAPGPMGP
PGPPGDYRCD SRHAGHRETA GPPGPKGCCY GEHGCKPGHL PFPGTGSQPS SWAPISGYQT
GGKEEPEIYG AIIPHGLRGL PGNPGPPGPP GPPGAPGLLY FNRLYPSRAQ QPCKQPAATD
TGWAADADIP RTELPDSRAD LQRQTWVFRS KELMLKSGSA VPEGSLVYVR EGSSAFLRTP
TGWSRLLLED SESLFAGDDP SASTPQYQAT KRAQMKGDNM GTPVLAQTHS LVQKQEGQGL
PQILPTTIAP RIPSLRLAAL NVPLAGDMSG VRGADLQCYR QSQEAQLYGT FRAFLSAPTQ
DLVSIVKRTD RTLPVVNLKG QLLAKSWSSL FNGQAGAVPR GPIYSFNGRN VLTDPLWPQR
LAWHGSTPRG GHAHRRDCQG WRSSGPGEGL AAPLGEGRLL AGQRHNCSQV LAVLCVEVAF
PYRHMW
//