GenomeNet

Database: UniProt
Entry: A0A3P9CGW2_9CICH
LinkDB: A0A3P9CGW2_9CICH
Original site: A0A3P9CGW2_9CICH 
ID   A0A3P9CGW2_9CICH        Unreviewed;       975 AA.
AC   A0A3P9CGW2;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   28-JAN-2026, entry version 28.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSMZEP00005021264.1};
OS   Maylandia zebra (zebra mbuna).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX   NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005021264.1, ECO:0000313|Proteomes:UP000265160};
RN   [1] {ECO:0000313|Ensembl:ENSMZEP00005021264.1, ECO:0000313|Proteomes:UP000265160}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25186727; DOI=10.1038/nature13726;
RA   Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA   Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA   Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA   Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA   Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA   Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA   Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA   Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA   Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA   Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA   Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA   Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA   Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA   Di Palma F.;
RT   "The genomic substrate for adaptive radiation in African cichlid fish.";
RL   Nature 513:375-381(2014).
RN   [2] {ECO:0000313|Ensembl:ENSMZEP00005021264.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSMZEP00005021264.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3P9CGW2; -.
DR   Ensembl; ENSMZET00005021966.1; ENSMZEP00005021264.1; ENSMZEG00005015478.1.
DR   GeneTree; ENSGT00940000165423; -.
DR   Proteomes; UP000265160; LG16.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        369..399
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        619..640
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          41..230
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          274..330
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          530..609
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          755..781
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        281..296
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        530..542
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        575..584
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        593..602
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        755..764
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   975 AA;  105661 MW;  3342380565F0BEEF CRC64;
     FLTHGNIFFF FGGGDQRAEV WFDVSTSKSV STVHVHEYTF GVSLLQLIGD PPPSEITQIY
     GPDNTPSFVF GPDANTGQLA RAHLPNPFYR EFSLMFNLKP TTNQGGVIFS ITDARQSIMH
     VGVKLSAVQD NNQNVILYYT KPKSAQSFEA ARFLVPSMTN TWTRFSLAVM DDKVLFYFNC
     DTDPQVVHIE RSAENMELES GAGVFVGQAG GADPDKFLGV IGELRVMGDP FAASRHCEED
     EDDSDVVSNN LSHTYWMTHY TELTGHMGSA GFGYKGVKGE PGPPGPPGPP GSPGPAPEYT
     VGSDGSVVSR VPGPRGPPGA PGPQGTPGAD GEPVSGLKII CLLENTKHNK CAVTVEPTLL
     YLSRFACPLY WLICFMVTKC WVLSNLTDLM FSCILLYFFN ILQGPMGPKG DRGDPGPPGY
     GEKGEKGEPG LVIGPDGNFL NLAQLAGPKG PYGPPGLKGE IGMPGRPGRP GINGYKGEKG
     EPGVGSGFGY PVSINTIINT LAWEAYIFFF SPGASPNVDI YTLRNELKGE RGESGVKGEK
     GEPGGGYYDP RFGGVQGPPG PPGLPGPKGD SIIGPPGPSG PTGPPGIGYD GRPGPPGPPG
     PPGTLSGSYR PNYPRTVPFV IFLKYICLMY GGLFCVYVQV TALRSYDTMI ATARRQEEGS
     LIYIIDRADL YLRVRDGVRQ VMLGEYNPFF RDLENEVAEV QPPPVILYPQ SQDQSQNNGA
     GHYSEGFTLI KPIEPPAPTP ADPRYFPKYD PRFPDQTHTG HTDGRFANQQ TESRFPVTPQ
     RQPVPPVLEP AGHFDIQGSG LHLIALNSPH TGNMRGIRGA DFLCFQQARA IGLKGTFRAF
     LSSKLQDLYT IVRRSDRDSV PIVNLKNQVL FSSWDSLFGD NASKMRENVP IYSFDGRDIL
     RDSAWPEKMV WHGSSKKGHR QTDQYCETWR TGDHAVTGLA SSLHSGHLLQ QSPSSCSGSY
     IVLCIENAFT SPSKK
//
DBGET integrated database retrieval system