ID G1R465_NOMLE Unreviewed; 1499 AA.
AC G1R465;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 24-JAN-2024, entry version 66.
DE SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSNLEP00000007987.2};
GN Name=COL5A2 {ECO:0000313|Ensembl:ENSNLEP00000007987.2};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000007987.2, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000007987.2, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000007987.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01059261; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01059262; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01059263; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01059264; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01059265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01059266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 61853.ENSNLEP00000007987; -.
DR Ensembl; ENSNLET00000008371.2; ENSNLEP00000007987.2; ENSNLEG00000006541.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000155675; -.
DR HOGENOM; CLU_001074_2_3_1; -.
DR InParanoid; G1R465; -.
DR OMA; CESPQVP; -.
DR TreeFam; TF344135; -.
DR Proteomes; UP000001073; Chromosome 22a.
DR GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR GO; GO:0043588; P:skin development; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 6.20.200.20; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1079; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT DOMAIN 39..97
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1266..1499
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 104..1268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 168..183
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..689
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 921..935
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1030..1044
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1128..1143
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1171..1185
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1212..1227
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1499 AA; 144958 MW; 7D127A1E2575023D CRC64;
MMANWAETRP LLILIVLLGQ FVSIKAQEED EDEGYGEEIA CTQNGQMYLN RDIWKPAPCQ
ICVCDNGAIL CDKIECQDVL DCADPVTPPG ECCPVCSQTP GGGNTNFGRG RKGQKGEPGL
VPVVTGIRGR PGPAGPPGSQ GPRGERGPKG RPGPRGPQGI DGEPGVPGQP GAPGPPGHPS
HPGPDGMSRP FSAQMAGLDE KSGLGSQVGL MPGSVGPVGP RGPQGLQGQQ GGAGPTGPPG
EPGDPGPMGP IGSRGPEGPP GKPGEDGEPG RNGNPGEVGF AGSPGARGFP GAPGLPGLKG
HRGHKGLEGP KGEVGAPGSK GEAGPTGPMG AMGPLGPRGM PGERGRLGPQ GAPGQRGAHG
MPGKPGPMGP LGIPGSSGFP GNPGMKGEAG PTGARGPEGP QGQRGETGPP GPVGSPGLPG
AVGTDGTPGA KGPTGSPGTS GPPGSAGPPG SPGPQGSTGP QGIRGQPGDP GVPGFKGEAG
PKGEPGPHGI QGPIGPPGEE GKRGPRGDPG TVGPPGPVGE RGAPGNRGFP GSDGLPGPKG
AQGERGPVGS SGPKGSQGDP GRPGEPGLPG ARGLTGNPGV QGPEGKLGPL GAPGEDGRPG
PPGSIGIRGQ PGSMGLPGPK GSSGDPGKPG EAGNAGVPGQ RGAPGKDGEV GPSGPVGPPG
LAGERGEQGP PGPTGFQGLP GPPGPPGEGG KPGDQGVPGD PGAVGPLGPR GERGNPGERG
EPGITGLPGE KGMAGGHGPD GPKGSPGPSG TPGDTGPPGL QGMPGERGIA GTPGPKGDRG
GIGEKGAEGT AGNDGARGLP GPLGPPGPAG PTGEKGEPGP RGLVGPPGSR GNPGSRGENG
PTGAVGFAGP QGPDGQPGVK GEPGEPGQKG DAGSPGPQGL AGSPGPHGPN GVPGLKGGRG
TQGPPGATGF PGSAGRVGPP GPAGAPGPAG PLGEPGKEGP PGLRGDPGSH GRVGDRGPAG
PPGGPGDKGD PGEDGQPGPD GPPGPAGTTG QRGIVGMPGQ RGERGMPGLP GPAGTPGKVG
PTGATGDKGP PGPVGPPGSN GPVGEPGPEG PAGNDGTPGR DGAVGERGDR GDPGPAGLPG
SQGAPGTPGP VGAPGDAGQR GEPGSRGPIG PPGRAGKRGL PGPQGPRGDK GDHGDRGDRG
QKGHRGFTGL QGLPGPPGPN GEQGSAGIPG PFGPRGPPGP VGPSGKEGNP GPLGPIGPPG
VRGSVGEAGP EGPPGEPGPP GPPGPPGHLT AALGDIMGHY DESMPDPLPE FTEDQAAPDD
KNKTDPGVHA TLKSLSSQIE TMRSPDGSKK HPARTCDDLK LCHSAKQSGE YWIDPNQGSV
EDAIKVYCNM ETGETCISAN PSSVPRKTWW ASKSPDNKPV WYGLDMNRGS QFAYGDHQSP
NTAITQMTFL RLLSKEASQN ITYICKNSVG YMDDQAKNLK KAVVLKGAND LDIKAEGNIR
FRYIVLQDTC SKRNGNVGKT VFEYRTQNVA RLPIIDLAPV DVGGTDQEFG VEIGPVCFV
//