ID H2P3G5_PONAB Unreviewed; 1749 AA.
AC H2P3G5;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 28-JAN-2026, entry version 82.
DE RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0000256|ARBA:ARBA00069367};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000012859.2, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000012859.2, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPPYP00000012859.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000012859.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- FUNCTION: May regulate extracellular matrix-dependent motility and
CC morphogenesis of endothelial and non-endothelial cells; the function
CC requires homotrimerization and implicates MAPK signaling.
CC {ECO:0000256|ARBA:ARBA00053766}.
CC -!- FUNCTION: Probably plays a major role in determining the retinal
CC structure as well as in the closure of the neural tube.
CC {ECO:0000256|ARBA:ARBA00054383}.
CC -!- SUBUNIT: Forms homotrimers. Recombinant non-collagenous domain 1 has
CC stronger affinity to NID1, HSPG2 and laminin-1:NID1 complex and lower
CC affinity to FBLN1 and FBLN2 than endostatin.
CC {ECO:0000256|ARBA:ARBA00065165}.
CC -!- SUBUNIT: Monomeric. Interacts with KDR/VEGFR2. Interacts with the
CC ITGA5:ITGB1 complex. Interacts with NID1, HSPG2, laminin-1:NID1
CC complex, FBLN1 and FBLN2. {ECO:0000256|ARBA:ARBA00064471}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix, basement membrane {ECO:0000256|ARBA:ARBA00004302}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR FunCoup; H2P3G5; 423.
DR Ensembl; ENSPPYT00000013379.2; ENSPPYP00000012859.2; ENSPPYG00000011521.3.
DR eggNOG; KOG3546; Eukaryota.
DR eggNOG; KOG3577; Eukaryota.
DR GeneTree; ENSGT00940000158212; -.
DR HOGENOM; CLU_004003_1_0_1; -.
DR InParanoid; H2P3G5; -.
DR OMA; VQDQHQN; -.
DR Proteomes; UP000001595; Chromosome 21.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProtKB-ARBA.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR GO; GO:0042127; P:regulation of cell population proliferation; IEA:UniProtKB-ARBA.
DR CDD; cd07455; CRD_Collagen_XVIII; 1.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 1.10.2000.10:FF:000017; Alpha 1 type XVIII collagen; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR035523; Collagen_XVIII_Fz.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR010363; DUF959_COL18_N.
DR InterPro; IPR020067; Frizzled_dom.
DR InterPro; IPR036790; Frizzled_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06121; DUF959; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF01392; Fz; 1.
DR SMART; SM00063; FRI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR PROSITE; PS50038; FZ; 1.
PE 3: Inferred from homology;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00090}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000001595};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1749
FT /note="Collagen alpha-1(XVIII) chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035317234"
FT DOMAIN 324..441
FT /note="FZ"
FT /evidence="ECO:0000259|PROSITE:PS50038"
FT REGION 42..101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 145..248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 640..666
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 678..1437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1506..1550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..175
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 199..228
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..733
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 757..784
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 793..805
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 810..826
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 828..841
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 857..869
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 899..909
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 914..937
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 941..956
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1000..1016
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1048..1060
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1090..1104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1112..1121
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1136..1148
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1157..1174
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1249..1263
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1277..1291
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1316..1336
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1348..1363
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1393..1409
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1419..1431
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1514..1530
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1536..1546
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 339..385
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT DISULFID 376..414
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ SEQUENCE 1749 AA; 178144 MW; 1A8A03C68DBB3BFA CRC64;
MAPDPRGCRI LLLLFCSLAA ARADLLNLNW LWFNNEDTSH AATTIPEPQG PLPAQPTHMA
PRNGSTEPAT VPGSPEPPSE LLEDGQDTPT SAESPDTPEE NIAGVGAKIL NVAQGIRSFV
QLWNDTVPTE SLTRAETLVL ETPVGPFALP GPSSTPQENG TTLWPSRGTP SSPGAHTTEA
GTLPAPTPSP PSLGRPWAPL TGPSVPPSSS GRASLSSLLG GAPPWGSLQD PDSQGLSPAA
AAASQQLQHP DVRLRTPLLH PLVTGSLGKH AAPSAFSSGL PGALSQVAIT TLIGDSGAWV
SHAANSAGPG LANNSALLRA DPEAPAGRCL PLPPSLPVCG RLGISRSWLP NHLHHESGEQ
VRAGAQAWGG LLRTRCHPFL AWFFCLLLAP PCGSVPPPTP PPCRQFCEAL QDACWSRLGG
GRLPVACASL PTQEDGYCVF IGPAAERVSE EVGLLQLLGD PPPQQVTQTD DPDVGLAYVF
GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAVVL LGVKLSGVQD
GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVE GGYVTLYVDC EEFQRMLLAR
SSRGLELEPG AGLFVAQAGG ADPDKFQGMI AELKVRGDPQ VSPMHCLDEE GDDSDGASGD
FGSGLGDARE LLREETGAAL KPRLPMPPPV TTPPLAGGSS TEDSRSKEIE EQTTVASLGG
QTLPGSDSVS TWDGSVRTPG DRVREGGLKG QKGEPGVPGP PGRAGPPGSP CLPGPPGLPC
PVSPLGPAGP ALQPVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG PQGFPGTPGD
VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL EALRGPRGFP
GPPGPPGVPG LPGEPGLFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP GPPGREGPPG
RTGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGSPGP AGPPGPPGPP GPPGPGLPAG
FDDMEGSGGP FWSTARSADG PQGPPGLPGL KGDPGMPGLP GAKGEVGADG APGFPGLPGR
EGIAGPQGPK GDRGSQGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG SVLSVPGPEG
RPGFAGFPGP AGPKGDLGSK GERGSPGPKG EKGEPGSIFR PDGGALGPVQ KGAKGEPGFR
GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG PPGPPGPPGP
PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD FLQLEAEMKG
EKGDRGDAGQ KGERGEPGGG GFFGSRLPGP PGPPGPPGYP GIPGPKGESI RGQPGPPGPQ
GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP PGTVGASSGV
RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEARTPLP RGTDNEVAAL
QPPVVQLHDS NPYPRREHPH PTARPWRADD ILASPPRLPE PQPYPGAPHH SSYVHLRPAR
PTSPPAHTHR DFQPVLHLVA LNSPLSGSMR GIRGADFQCF QQARAVGLAG TFRAFLSSRL
QDLYSIVRRA DRAAVPIVNL KDELLFPSWE ALFSGSEGPL KPGARIFSFD GKDVLRHPTW
PQKSVWHGSD PNGRRLTESY CETWRTEAPS ATGQASSLLG GRLLGQNAAS CHHAYIVLCI
ENSFMTASK
//