GenomeNet

Database: UniProt
Entry: H0WT22_OTOGA
LinkDB: H0WT22_OTOGA
Original site: H0WT22_OTOGA 
ID   H0WT22_OTOGA            Unreviewed;      1501 AA.
AC   H0WT22;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2012, sequence version 1.
DT   27-MAR-2024, entry version 63.
DE   SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSOGAP00000005270.2};
GN   Name=COL5A2 {ECO:0000313|Ensembl:ENSOGAP00000005270.2};
OS   Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Strepsirrhini; Lorisiformes;
OC   Galagidae; Otolemur.
OX   NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000005270.2, ECO:0000313|Proteomes:UP000005225};
RN   [1] {ECO:0000313|Proteomes:UP000005225}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   The Broad Institute Genome Sequencing Platform;
RA   Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B.,
RA   Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N.,
RA   Walker B.J., Sharpe T., Hall G.;
RT   "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby).";
RL   Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSOGAP00000005270.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAQR03168858; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168859; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168860; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168861; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168862; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168863; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AAQR03168864; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 30611.ENSOGAP00000005270; -.
DR   Ensembl; ENSOGAT00000005898.2; ENSOGAP00000005270.2; ENSOGAG00000005889.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000155675; -.
DR   HOGENOM; CLU_001074_2_3_1; -.
DR   InParanoid; H0WT22; -.
DR   OMA; CESPQVP; -.
DR   TreeFam; TF323987; -.
DR   Proteomes; UP000005225; Unassembled WGS sequence.
DR   GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR   GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR   GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR   GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR   GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR   GO; GO:0043588; P:skin development; IEA:Ensembl.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 6.20.200.20; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005225};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..1501
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003544969"
FT   DOMAIN          38..96
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          1262..1501
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          106..1232
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        167..182
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        674..688
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1029..1043
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1127..1142
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1170..1184
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1211..1226
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1501 AA;  145932 MW;  2F8CFD0819CF548A CRC64;
     MANWVEARPL LILTVLLGQF VSIKAQEEGE DEGYGEEIAC TQNGQMYLNR DIWKPAPCQI
     CVCDNGAILC DKIECQEVLD CADPITPPGE CCPVCSQTAG GGNNNFGRGR KGQKGEPGLV
     PVVTGIRGRP GPAGPPGSQG PRGERGPKGR PGPRGPQGID GEPGVPGQPG SPGPPGHPSH
     PGPDGMSRPF SAQMAGLDEK SGLGSQVGLM PGSVGPVGPR GPQGLQGQQG GAGPAGPPGE
     PGEPGPMGPI GARGPEGPPG KPGEDGEAGR NGNPGEVGFT GSPGARGFPG APGLPGLKGH
     RGHKGLDGPK GEVGAPGSKG EAGPTGPMGA MGPLGPRGMP GERGRLGPQG APGQRGAHGM
     PGKPGPMGPL GIAGSAGFPG NPGMKGEAGP TGARGPEGPQ GQRGETGPPG PVGSSGLPGA
     MGTDGTPGAK GPMGSPGTSG PPGLAGPPGS PGPQGSTGPQ GIRGQPGDPG VPGFKGEAGP
     KGEPGPHGIQ GPIGPPGEEG KRGPRGDPGT VGPPGPVGER GPPGNRGFPG SDGLPGPKGA
     QGERGPVGSS GPKGGQGDPG RLGEPGLPGA RGLTGNPGVQ GPEGKLGPLG APGEDGRPGP
     PGSIGIRGQP GSMGLPGPKG SSGDPGKPGE AGNAGVPGQR GAPGKDGEVG PSGPVGPPGL
     AGERGEQGPP GPTGFQGLPG PPGPPGEGGK PGDQGVPGDP GAVGPLGPRG ERGNPGERGE
     PGITGLPGEK GMAGGHGPDG PKGSPGPSGT PGDTGPPGLQ GMPGERGIAG TPGPKGDRGG
     IGEKGAEGTA GNDGARGLPG PLGPPGPAGP TGEKGEPGPR GLVGPPGSRG NPGSRGENGP
     TGAVGFAGPQ GPDGQPGVKG EPGEPGQKGD AGSPGPQGLA GSPGPHGPNG VPGLKGGRGT
     QGPPGATGFP GSAGRVGPPG PTGAVGPAGP LGEPGKEGPP GLRGDPGSHG RVGDRGPAGP
     PGGPGDKGDP GEDGQSGPDG PPGPAGTTGQ RGIVGMPGQR GERGMPGLPG PAGTPGKVGP
     TGATGDKGPP GPVGPPGSNG PVGEPGPEGP AGNDGTPGRD GAVGERGDRG DPGPAGLPGS
     QGAPGTPGPV GAPGDAGQRG DPGSRGPIGP PGRAGKRGLP GPQGPRGDKG DHGDRGDRGQ
     KGHRGFTGLQ GLPGPPGPNG EQGSAGIPGP FGPRGPPGPV GPSGKEGNPG PLGPIGPPGV
     RGSIGEAGPE GPPGEPGPPG PPGPPGHLTA ALGDIMGHYD ESMPDPLPEF TEDQAVYAHI
     FTNQLLPGYT LFLKAQSLLR ESLISEFIFV SHLFFWRLIS CLSFSLFRKT GEYWIDPNQG
     SVEDAIKVYC NMDTGETCIS ANPSSIPRKM WWASKSPDHK PIWYGLDMNR GSQFTYGDHH
     SPNTAITQMT FLRLLSKEAS QNITYICKNT VGYMDDQAKN LKKAVILKGA NDLEIKAEGN
     IRFRYIVLQD TCSKRNGNVG KTVFEYRTQK VARLPIIDLA PVDIGSTDQE FGVELGPVCF
     V
//
DBGET integrated database retrieval system