ID A0A673G2X2_9TELE Unreviewed; 1275 AA.
AC A0A673G2X2;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 28-JAN-2026, entry version 23.
DE RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0000256|ARBA:ARBA00069367};
OS Sinocyclocheilus rhinocerous.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Cyprinidae; Cyprininae; Sinocyclocheilus.
OX NCBI_TaxID=307959 {ECO:0000313|Ensembl:ENSSRHP00000006745.1, ECO:0000313|Proteomes:UP000472270};
RN [1] {ECO:0000313|Ensembl:ENSSRHP00000006745.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [2] {ECO:0000313|Ensembl:ENSSRHP00000006745.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- FUNCTION: May regulate extracellular matrix-dependent motility and
CC morphogenesis of endothelial and non-endothelial cells; the function
CC requires homotrimerization and implicates MAPK signaling.
CC {ECO:0000256|ARBA:ARBA00053766}.
CC -!- FUNCTION: Probably plays a major role in determining the retinal
CC structure as well as in the closure of the neural tube.
CC {ECO:0000256|ARBA:ARBA00054383}.
CC -!- SUBUNIT: Forms homotrimers. Recombinant non-collagenous domain 1 has
CC stronger affinity to NID1, HSPG2 and laminin-1:NID1 complex and lower
CC affinity to FBLN1 and FBLN2 than endostatin.
CC {ECO:0000256|ARBA:ARBA00065165}.
CC -!- SUBUNIT: Monomeric. Interacts with KDR/VEGFR2. Interacts with the
CC ITGA5:ITGB1 complex. Interacts with NID1, HSPG2, laminin-1:NID1
CC complex, FBLN1 and FBLN2. {ECO:0000256|ARBA:ARBA00064471}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix, basement membrane {ECO:0000256|ARBA:ARBA00004302}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A673G2X2; -.
DR Ensembl; ENSSRHT00000006973.1; ENSSRHP00000006745.1; ENSSRHG00000003921.1.
DR Proteomes; UP000472270; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProtKB-ARBA.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR GO; GO:0042127; P:regulation of cell population proliferation; IEA:UniProtKB-ARBA.
DR CDD; cd07455; CRD_Collagen_XVIII; 1.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 1.10.2000.10:FF:000017; Alpha 1 type XVIII collagen; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR035523; Collagen_XVIII_Fz.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR020067; Frizzled_dom.
DR InterPro; IPR036790; Frizzled_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF01392; Fz; 1.
DR SMART; SM00063; FRI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR PROSITE; PS50038; FZ; 1.
PE 3: Inferred from homology;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00090}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000472270};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1275
FT /note="Collagen alpha-1(XVIII) chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5025673866"
FT DOMAIN 208..327
FT /note="FZ"
FT /evidence="ECO:0000259|PROSITE:PS50038"
FT REGION 525..937
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1008..1096
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 543..552
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..602
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 609..621
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 645..655
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..688
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 721..731
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 733..742
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 817..832
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..868
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 901..911
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 922..934
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 223..269
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT DISULFID 260..298
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ SEQUENCE 1275 AA; 137440 MW; F1355994E3FF3CEE CRC64;
VMSKLRFWLC LFILVCLRIR HTHGWFWFDD LKENAKGAQT PAYLTTVRPT SPPRTEPHRT
TGTSASVTEV LKEVSSDGGH FKRKSEFRSG LGVESESISG SWSEFDGYVT STNFLGIAGE
DARSHVGNFK EEQMDRLQRI SNPTDSRFNE SMNVSHKNVS TYNDSNGRNS TEHDDYDNTT
LYSSEYSVSD NLLTIESEFL VASMPTAVES PRCLPVDSDL PFCTRMGVES FTVPNFLNQS
SVEEVQVVLT EWAWLLRSNC HYSLEWFFCL LLTPRCGPPG LPSRLPCRSF CEVLRDSCWT
LLDEGRLPVE CHSLPEEKHD GYRCLSVSNQ KEESGVSLLQ LIGDPPPDGV AKVFDDANNP
GYVFDQRSNV GQSAAAHLPN PFFRDFSLIF NIKPTSSKPG VIFSITDPIQ NIMYVGVKLS
AVEKGKQYII FYYTEPDSQS SYEAARFSVP SMVNTWTRFS ISVLNERVSL YFNCDSDPQV
ISFEQSPDDM DLDAGAGVFV GHASGADPDK FLGVIGDIRM LKDPGAAERH CEEDEDDFDA
GSGDYGASGD GEGPPSVQPT PPSSRPIQQP PVTSRRPLVE KQQTVAKGEK GDRGEQGAKG
DRGLVGPKGD AGSGSVSGGG AKEVKVCSDG NPGFGYPGSK GDRGPPGPPG PPGPAGPSAE
VEVRGDGSVV QRVAGPRGPP GPSGPPGPAG ADGEPVSRSQ SKEWLNYSLF SIKRGESQPG
PRGPPGPPGP PSRSDRPGEK GEPGLILGPD GNPHYLGGLT GQKGERGLSG PVGPPGPAGP
FGLKGEFGMP GRPGRPGVNG YKGEKGETGS GSGYDYPGPP GPPGPPGPPG PAVPLDRLGF
SSNIDIYALK NEMKGEHGEP GLKGEKGEPG GGFYDPRFGA VQGPPGNPGL PGPKGDSIRG
PPGPQGPPGP PGVGYDGRPG NPGSPGPPGP PGSPSLPGVA FLRSYDIMMA TARRQTEGAL
IYILDRNDLY LRVRDGVRQV MLGDYKPFYG GLDNEVAAVQ PPPVVHYSQD HTANNGAEQI
SPPHPPIEFP RREPENRNPN APDSRYPDQR YPPYTDNKYT DPVQPHRYPV QPERNPITPA
RHPSPPVNQP EGHVHTSGPG LHVIALNSPQ VGNMRGIRGA DFLCFQQARA VGLKGTFRAF
LSSKLQDLYS IVRKSDRETL PIVNLKDQVL FRSWESLFSD SESRMKDNAP IYSFDGRDVL
RDSAWPEKMI WHGSSGRGHR QTDNYCETWR AGDRAVTGLA SSLQAGQLLQ QTSSSCSSSY
IVLCIENSYM TQSKK
//