GenomeNet

Database: UniProt
Entry: H2P3G5_PONAB
LinkDB: H2P3G5_PONAB
Original site: H2P3G5_PONAB 
ID   H2P3G5_PONAB            Unreviewed;      1749 AA.
AC   H2P3G5;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   25-MAY-2022, sequence version 2.
DT   28-JAN-2026, entry version 82.
DE   RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0000256|ARBA:ARBA00069367};
OS   Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pongo.
OX   NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000012859.2, ECO:0000313|Proteomes:UP000001595};
RN   [1] {ECO:0000313|Ensembl:ENSPPYP00000012859.2, ECO:0000313|Proteomes:UP000001595}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Wilson R.K., Mardis E.;
RT   "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL   Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPPYP00000012859.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSPPYP00000012859.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- FUNCTION: May regulate extracellular matrix-dependent motility and
CC       morphogenesis of endothelial and non-endothelial cells; the function
CC       requires homotrimerization and implicates MAPK signaling.
CC       {ECO:0000256|ARBA:ARBA00053766}.
CC   -!- FUNCTION: Probably plays a major role in determining the retinal
CC       structure as well as in the closure of the neural tube.
CC       {ECO:0000256|ARBA:ARBA00054383}.
CC   -!- SUBUNIT: Forms homotrimers. Recombinant non-collagenous domain 1 has
CC       stronger affinity to NID1, HSPG2 and laminin-1:NID1 complex and lower
CC       affinity to FBLN1 and FBLN2 than endostatin.
CC       {ECO:0000256|ARBA:ARBA00065165}.
CC   -!- SUBUNIT: Monomeric. Interacts with KDR/VEGFR2. Interacts with the
CC       ITGA5:ITGB1 complex. Interacts with NID1, HSPG2, laminin-1:NID1
CC       complex, FBLN1 and FBLN2. {ECO:0000256|ARBA:ARBA00064471}.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix, basement membrane {ECO:0000256|ARBA:ARBA00004302}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   FunCoup; H2P3G5; 423.
DR   Ensembl; ENSPPYT00000013379.2; ENSPPYP00000012859.2; ENSPPYG00000011521.3.
DR   eggNOG; KOG3546; Eukaryota.
DR   eggNOG; KOG3577; Eukaryota.
DR   GeneTree; ENSGT00940000158212; -.
DR   HOGENOM; CLU_004003_1_0_1; -.
DR   InParanoid; H2P3G5; -.
DR   OMA; VQDQHQN; -.
DR   Proteomes; UP000001595; Chromosome 21.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProtKB-ARBA.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   GO; GO:0042127; P:regulation of cell population proliferation; IEA:UniProtKB-ARBA.
DR   CDD; cd07455; CRD_Collagen_XVIII; 1.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 1.10.2000.10:FF:000017; Alpha 1 type XVIII collagen; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR035523; Collagen_XVIII_Fz.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR020067; Frizzled_dom.
DR   InterPro; IPR036790; Frizzled_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF01392; Fz; 1.
DR   SMART; SM00063; FRI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR   PROSITE; PS50038; FZ; 1.
PE   3: Inferred from homology;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00090}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001595};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1749
FT                   /note="Collagen alpha-1(XVIII) chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035317234"
FT   DOMAIN          324..441
FT                   /note="FZ"
FT                   /evidence="ECO:0000259|PROSITE:PS50038"
FT   REGION          42..101
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          145..248
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          640..666
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          678..1437
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1506..1550
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        152..175
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        199..228
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        712..733
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        757..784
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        793..805
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        810..826
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        828..841
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        857..869
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        899..909
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        914..937
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        941..956
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1000..1016
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1048..1060
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1090..1104
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1112..1121
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1136..1148
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1157..1174
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1249..1263
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1277..1291
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1316..1336
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1348..1363
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1393..1409
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1419..1431
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1514..1530
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1536..1546
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        339..385
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT   DISULFID        376..414
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ   SEQUENCE   1749 AA;  178144 MW;  1A8A03C68DBB3BFA CRC64;
     MAPDPRGCRI LLLLFCSLAA ARADLLNLNW LWFNNEDTSH AATTIPEPQG PLPAQPTHMA
     PRNGSTEPAT VPGSPEPPSE LLEDGQDTPT SAESPDTPEE NIAGVGAKIL NVAQGIRSFV
     QLWNDTVPTE SLTRAETLVL ETPVGPFALP GPSSTPQENG TTLWPSRGTP SSPGAHTTEA
     GTLPAPTPSP PSLGRPWAPL TGPSVPPSSS GRASLSSLLG GAPPWGSLQD PDSQGLSPAA
     AAASQQLQHP DVRLRTPLLH PLVTGSLGKH AAPSAFSSGL PGALSQVAIT TLIGDSGAWV
     SHAANSAGPG LANNSALLRA DPEAPAGRCL PLPPSLPVCG RLGISRSWLP NHLHHESGEQ
     VRAGAQAWGG LLRTRCHPFL AWFFCLLLAP PCGSVPPPTP PPCRQFCEAL QDACWSRLGG
     GRLPVACASL PTQEDGYCVF IGPAAERVSE EVGLLQLLGD PPPQQVTQTD DPDVGLAYVF
     GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAVVL LGVKLSGVQD
     GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVE GGYVTLYVDC EEFQRMLLAR
     SSRGLELEPG AGLFVAQAGG ADPDKFQGMI AELKVRGDPQ VSPMHCLDEE GDDSDGASGD
     FGSGLGDARE LLREETGAAL KPRLPMPPPV TTPPLAGGSS TEDSRSKEIE EQTTVASLGG
     QTLPGSDSVS TWDGSVRTPG DRVREGGLKG QKGEPGVPGP PGRAGPPGSP CLPGPPGLPC
     PVSPLGPAGP ALQPVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG PQGFPGTPGD
     VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL EALRGPRGFP
     GPPGPPGVPG LPGEPGLFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP GPPGREGPPG
     RTGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGSPGP AGPPGPPGPP GPPGPGLPAG
     FDDMEGSGGP FWSTARSADG PQGPPGLPGL KGDPGMPGLP GAKGEVGADG APGFPGLPGR
     EGIAGPQGPK GDRGSQGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG SVLSVPGPEG
     RPGFAGFPGP AGPKGDLGSK GERGSPGPKG EKGEPGSIFR PDGGALGPVQ KGAKGEPGFR
     GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG PPGPPGPPGP
     PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD FLQLEAEMKG
     EKGDRGDAGQ KGERGEPGGG GFFGSRLPGP PGPPGPPGYP GIPGPKGESI RGQPGPPGPQ
     GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP PGTVGASSGV
     RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEARTPLP RGTDNEVAAL
     QPPVVQLHDS NPYPRREHPH PTARPWRADD ILASPPRLPE PQPYPGAPHH SSYVHLRPAR
     PTSPPAHTHR DFQPVLHLVA LNSPLSGSMR GIRGADFQCF QQARAVGLAG TFRAFLSSRL
     QDLYSIVRRA DRAAVPIVNL KDELLFPSWE ALFSGSEGPL KPGARIFSFD GKDVLRHPTW
     PQKSVWHGSD PNGRRLTESY CETWRTEAPS ATGQASSLLG GRLLGQNAAS CHHAYIVLCI
     ENSFMTASK
//
DBGET integrated database retrieval system