ID H0WT22_OTOGA Unreviewed; 1501 AA.
AC H0WT22;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Collagen type V alpha 2 chain {ECO:0000313|Ensembl:ENSOGAP00000005270.2};
GN Name=COL5A2 {ECO:0000313|Ensembl:ENSOGAP00000005270.2};
OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Strepsirrhini; Lorisiformes;
OC Galagidae; Otolemur.
OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000005270.2, ECO:0000313|Proteomes:UP000005225};
RN [1] {ECO:0000313|Proteomes:UP000005225}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B.,
RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N.,
RA Walker B.J., Sharpe T., Hall G.;
RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby).";
RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSOGAP00000005270.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAQR03168858; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168859; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168860; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168861; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168862; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168863; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAQR03168864; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 30611.ENSOGAP00000005270; -.
DR Ensembl; ENSOGAT00000005898.2; ENSOGAP00000005270.2; ENSOGAG00000005889.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000155675; -.
DR HOGENOM; CLU_001074_2_3_1; -.
DR InParanoid; H0WT22; -.
DR OMA; CESPQVP; -.
DR TreeFam; TF323987; -.
DR Proteomes; UP000005225; Unassembled WGS sequence.
DR GO; GO:0005588; C:collagen type V trimer; IEA:Ensembl.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046332; F:SMAD binding; IEA:Ensembl.
DR GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR GO; GO:0048592; P:eye morphogenesis; IEA:Ensembl.
DR GO; GO:1903225; P:negative regulation of endodermal cell differentiation; IEA:Ensembl.
DR GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR GO; GO:0043588; P:skin development; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 6.20.200.20; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000005225};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1501
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003544969"
FT DOMAIN 38..96
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1262..1501
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 106..1232
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..182
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 674..688
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1029..1043
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1127..1142
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1170..1184
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1211..1226
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1501 AA; 145932 MW; 2F8CFD0819CF548A CRC64;
MANWVEARPL LILTVLLGQF VSIKAQEEGE DEGYGEEIAC TQNGQMYLNR DIWKPAPCQI
CVCDNGAILC DKIECQEVLD CADPITPPGE CCPVCSQTAG GGNNNFGRGR KGQKGEPGLV
PVVTGIRGRP GPAGPPGSQG PRGERGPKGR PGPRGPQGID GEPGVPGQPG SPGPPGHPSH
PGPDGMSRPF SAQMAGLDEK SGLGSQVGLM PGSVGPVGPR GPQGLQGQQG GAGPAGPPGE
PGEPGPMGPI GARGPEGPPG KPGEDGEAGR NGNPGEVGFT GSPGARGFPG APGLPGLKGH
RGHKGLDGPK GEVGAPGSKG EAGPTGPMGA MGPLGPRGMP GERGRLGPQG APGQRGAHGM
PGKPGPMGPL GIAGSAGFPG NPGMKGEAGP TGARGPEGPQ GQRGETGPPG PVGSSGLPGA
MGTDGTPGAK GPMGSPGTSG PPGLAGPPGS PGPQGSTGPQ GIRGQPGDPG VPGFKGEAGP
KGEPGPHGIQ GPIGPPGEEG KRGPRGDPGT VGPPGPVGER GPPGNRGFPG SDGLPGPKGA
QGERGPVGSS GPKGGQGDPG RLGEPGLPGA RGLTGNPGVQ GPEGKLGPLG APGEDGRPGP
PGSIGIRGQP GSMGLPGPKG SSGDPGKPGE AGNAGVPGQR GAPGKDGEVG PSGPVGPPGL
AGERGEQGPP GPTGFQGLPG PPGPPGEGGK PGDQGVPGDP GAVGPLGPRG ERGNPGERGE
PGITGLPGEK GMAGGHGPDG PKGSPGPSGT PGDTGPPGLQ GMPGERGIAG TPGPKGDRGG
IGEKGAEGTA GNDGARGLPG PLGPPGPAGP TGEKGEPGPR GLVGPPGSRG NPGSRGENGP
TGAVGFAGPQ GPDGQPGVKG EPGEPGQKGD AGSPGPQGLA GSPGPHGPNG VPGLKGGRGT
QGPPGATGFP GSAGRVGPPG PTGAVGPAGP LGEPGKEGPP GLRGDPGSHG RVGDRGPAGP
PGGPGDKGDP GEDGQSGPDG PPGPAGTTGQ RGIVGMPGQR GERGMPGLPG PAGTPGKVGP
TGATGDKGPP GPVGPPGSNG PVGEPGPEGP AGNDGTPGRD GAVGERGDRG DPGPAGLPGS
QGAPGTPGPV GAPGDAGQRG DPGSRGPIGP PGRAGKRGLP GPQGPRGDKG DHGDRGDRGQ
KGHRGFTGLQ GLPGPPGPNG EQGSAGIPGP FGPRGPPGPV GPSGKEGNPG PLGPIGPPGV
RGSIGEAGPE GPPGEPGPPG PPGPPGHLTA ALGDIMGHYD ESMPDPLPEF TEDQAVYAHI
FTNQLLPGYT LFLKAQSLLR ESLISEFIFV SHLFFWRLIS CLSFSLFRKT GEYWIDPNQG
SVEDAIKVYC NMDTGETCIS ANPSSIPRKM WWASKSPDHK PIWYGLDMNR GSQFTYGDHH
SPNTAITQMT FLRLLSKEAS QNITYICKNT VGYMDDQAKN LKKAVILKGA NDLEIKAEGN
IRFRYIVLQD TCSKRNGNVG KTVFEYRTQK VARLPIIDLA PVDIGSTDQE FGVELGPVCF
V
//