GenomeNet

Database: UniProt
Entry: G1R465_NOMLE
LinkDB: G1R465_NOMLE
Original site: G1R465_NOMLE 
ID   G1R465_NOMLE            Unreviewed;      1499 AA.
AC   G1R465;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 2.
DT   24-JAN-2024, entry version 66.
DE   SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSNLEP00000007987.2};
GN   Name=COL5A2 {ECO:0000313|Ensembl:ENSNLEP00000007987.2};
OS   Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC   Nomascus.
OX   NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000007987.2, ECO:0000313|Proteomes:UP000001073};
RN   [1] {ECO:0000313|Ensembl:ENSNLEP00000007987.2, ECO:0000313|Proteomes:UP000001073}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Gibbon Genome Sequencing Consortium;
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSNLEP00000007987.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADFV01059261; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01059262; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01059263; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01059264; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01059265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01059266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 61853.ENSNLEP00000007987; -.
DR   Ensembl; ENSNLET00000008371.2; ENSNLEP00000007987.2; ENSNLEG00000006541.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000155675; -.
DR   HOGENOM; CLU_001074_2_3_1; -.
DR   InParanoid; G1R465; -.
DR   OMA; CESPQVP; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000001073; Chromosome 22a.
DR   GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR   GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR   GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR   GO; GO:0043588; P:skin development; IEA:Ensembl.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 6.20.200.20; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1079; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   DOMAIN          39..97
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          1266..1499
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          104..1268
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        168..183
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        675..689
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        921..935
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1030..1044
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1128..1143
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1171..1185
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1212..1227
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1499 AA;  144958 MW;  7D127A1E2575023D CRC64;
     MMANWAETRP LLILIVLLGQ FVSIKAQEED EDEGYGEEIA CTQNGQMYLN RDIWKPAPCQ
     ICVCDNGAIL CDKIECQDVL DCADPVTPPG ECCPVCSQTP GGGNTNFGRG RKGQKGEPGL
     VPVVTGIRGR PGPAGPPGSQ GPRGERGPKG RPGPRGPQGI DGEPGVPGQP GAPGPPGHPS
     HPGPDGMSRP FSAQMAGLDE KSGLGSQVGL MPGSVGPVGP RGPQGLQGQQ GGAGPTGPPG
     EPGDPGPMGP IGSRGPEGPP GKPGEDGEPG RNGNPGEVGF AGSPGARGFP GAPGLPGLKG
     HRGHKGLEGP KGEVGAPGSK GEAGPTGPMG AMGPLGPRGM PGERGRLGPQ GAPGQRGAHG
     MPGKPGPMGP LGIPGSSGFP GNPGMKGEAG PTGARGPEGP QGQRGETGPP GPVGSPGLPG
     AVGTDGTPGA KGPTGSPGTS GPPGSAGPPG SPGPQGSTGP QGIRGQPGDP GVPGFKGEAG
     PKGEPGPHGI QGPIGPPGEE GKRGPRGDPG TVGPPGPVGE RGAPGNRGFP GSDGLPGPKG
     AQGERGPVGS SGPKGSQGDP GRPGEPGLPG ARGLTGNPGV QGPEGKLGPL GAPGEDGRPG
     PPGSIGIRGQ PGSMGLPGPK GSSGDPGKPG EAGNAGVPGQ RGAPGKDGEV GPSGPVGPPG
     LAGERGEQGP PGPTGFQGLP GPPGPPGEGG KPGDQGVPGD PGAVGPLGPR GERGNPGERG
     EPGITGLPGE KGMAGGHGPD GPKGSPGPSG TPGDTGPPGL QGMPGERGIA GTPGPKGDRG
     GIGEKGAEGT AGNDGARGLP GPLGPPGPAG PTGEKGEPGP RGLVGPPGSR GNPGSRGENG
     PTGAVGFAGP QGPDGQPGVK GEPGEPGQKG DAGSPGPQGL AGSPGPHGPN GVPGLKGGRG
     TQGPPGATGF PGSAGRVGPP GPAGAPGPAG PLGEPGKEGP PGLRGDPGSH GRVGDRGPAG
     PPGGPGDKGD PGEDGQPGPD GPPGPAGTTG QRGIVGMPGQ RGERGMPGLP GPAGTPGKVG
     PTGATGDKGP PGPVGPPGSN GPVGEPGPEG PAGNDGTPGR DGAVGERGDR GDPGPAGLPG
     SQGAPGTPGP VGAPGDAGQR GEPGSRGPIG PPGRAGKRGL PGPQGPRGDK GDHGDRGDRG
     QKGHRGFTGL QGLPGPPGPN GEQGSAGIPG PFGPRGPPGP VGPSGKEGNP GPLGPIGPPG
     VRGSVGEAGP EGPPGEPGPP GPPGPPGHLT AALGDIMGHY DESMPDPLPE FTEDQAAPDD
     KNKTDPGVHA TLKSLSSQIE TMRSPDGSKK HPARTCDDLK LCHSAKQSGE YWIDPNQGSV
     EDAIKVYCNM ETGETCISAN PSSVPRKTWW ASKSPDNKPV WYGLDMNRGS QFAYGDHQSP
     NTAITQMTFL RLLSKEASQN ITYICKNSVG YMDDQAKNLK KAVVLKGAND LDIKAEGNIR
     FRYIVLQDTC SKRNGNVGKT VFEYRTQNVA RLPIIDLAPV DVGGTDQEFG VEIGPVCFV
//
DBGET integrated database retrieval system