GenomeNet

Database: UniProt
Entry: A0A8C3B7G1_CAIMO
LinkDB: A0A8C3B7G1_CAIMO
Original site: A0A8C3B7G1_CAIMO 
ID   A0A8C3B7G1_CAIMO        Unreviewed;      1482 AA.
AC   A0A8C3B7G1;
DT   19-JAN-2022, integrated into UniProtKB/TrEMBL.
DT   19-JAN-2022, sequence version 1.
DT   28-JAN-2026, entry version 19.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSCMMP00000001097.1};
OS   Cairina moschata (Muscovy duck).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC   Anatinae; Cairina.
OX   NCBI_TaxID=8855 {ECO:0000313|Ensembl:ENSCMMP00000001097.1, ECO:0000313|Proteomes:UP000694556};
RN   [1] {ECO:0000313|Ensembl:ENSCMMP00000001097.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Vignal A., Thebault N., Warren W.C.;
RT   "Common duck and Muscovy duck high density SNP chip.";
RL   Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCMMP00000001097.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSCMMP00000001097.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSCMMT00000001233.1; ENSCMMP00000001097.1; ENSCMMG00000000683.1.
DR   Proteomes; UP000694556; Chromosome 7.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   CDD; cd00110; LamG; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1060; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000694556};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..1482
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5034133484"
FT   DOMAIN          165..353
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          214..352
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          34..160
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          354..904
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          921..1162
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1223..1259
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        34..48
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        72..87
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        107..120
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        360..369
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        377..389
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        421..430
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        437..449
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        450..459
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        512..523
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        574..588
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        616..628
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        659..668
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        722..740
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        818..832
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        838..847
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        979..992
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1051..1065
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1099..1108
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1119..1135
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1142..1154
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1482 AA;  150579 MW;  F7BF31C6D8B7CCC5 CRC64;
     MAPHRLGLLL LLASCCFSSS EAQLLDWIWG SSKTTGPPAA PSTAATASRL AEPSPTPGTA
     AQRRDPAAGD PTATTQQHQE QDPSTAAPTG RESAEGTEQW DGSSAGTAAP TSVAWTAAPP
     SSTPPAQLPA GTGLPDATQP PRQPEGTGVS PQKPAAARPE NLSAEVSLLE LIGDPPPEEI
     LKIYGPDNNP GYVFGPNANT GQVARYHLPS PFYRDFSLLF HIQPTTPRAG VLFAVTDSSQ
     SIIYVGVKLS DLQAGKQRII FYYTEPGSPS SYAAATFTVP TLLNQWTRFA ISVEEDEVVL
     YLDCEEYERV RFERSPDEME LEQGSGLFVA QAGGADPDKY QGVIADLKLR GDPRAAEHQC
     EEEEDDAEAS GDFGSGAEDR RQPSGKDRGV PGLVDAVPVT SPPVAADGGS GSSSGSPQQA
     ERTRAEERLR VPAGGPKGEK GEKGERGLKG DSGTGGVLGT GSTKGEKGQK GELGIKGSAG
     FGYPGSKGQK GEPGEPGPPS RHSDGAVLEP VTGPPGPSGP PGLPGKEGAP GRDGEPGDPG
     EDGKPGVMGP QGFPGTPGEP GMKGEKGDPG VGPRGPPGLP GPPGPPGPSS KQDRLTFIDM
     EGSGFGGDLE TVRGPRGPPG PPGPPGVPGL PGEPGRFGMN STDLPGPPGL PGRDGVPGTP
     GPEGPQGPPG KDGMPGPPGP KGERGDVGDL GLPGAPGPKG SKGDTGPAGA PGEMGLAGLP
     GPIGPRGPPG PPGPPGPPGP AYEAGFADME GSGLPLAAGS PGPRGPDGPQ GVPGLPGIKG
     EVGSPGQPGV PGPKGDAGVP GVDGRPGLEG FPGPQGPKGD RGSPGEKGER GQDGAGLPGP
     PGPPGPPGQV IYISGEDRSL GAVPGPEGRP GHAGFPGPVG PKGDQGSPGL QGSPGLKGEK
     GEPGVIISPD GTIIAANVKG EKGEPGLRGP MGPSGPHGRA GLKGEIGFPG RPGRPGMNGL
     KGEKGDPADI SGMLGLRGPP GPPGPPGPPG PPGSIVYDSN NAFSDASRPA LPAFPGFHQF
     PGQKGEKGDV GAPGPPGQFP YDLSHFSASL RGDKGEAGPK GEKGEPGGTA LYGPSVMGPP
     GPQGYPGLPG PKGDSIVGPP GPPGPQGPPG IGYEGRQGPP GPPGPPGPPS FPGPHRPAIS
     IPGPPGPPGP PGPPGSSGTS LGLRALPTYQ AMLSAAHELP EGGLVFLADR QELYVRLRGG
     FRRVLLEEHT LVPSSALDNE VYDKPPSVHY GGSQHPLQPR GPLHPLRNHS PPPTARPWRG
     DEVVANQHRL PQPPLLQQHE LLNSYYVQRR PEPAPVAAHV HQDFQPALHL VALNAPLSGG
     MRGIRGADFQ CFQQARQVGL AGTFRAFLSS RLQDLYSIVR RADRATVPIV NLRDEVLFNN
     WEALFTGSEA PLRADARILS FDGRDVLRDS AWPQKSVWHG SDAKGRRLPE SYCEAWRTEE
     RGTSGQASSL SSGKLLEQAA SSCQHAFIVL CIENSFMTAA KK
//
DBGET integrated database retrieval system