ID S7PI71_MYOBR Unreviewed; 1410 AA.
AC S7PI71;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Collagen alpha-2(V) chain {ECO:0000313|EMBL:EPQ07817.1};
GN ORFNames=D623_10008784 {ECO:0000313|EMBL:EPQ07817.1};
OS Myotis brandtii (Brandt's bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=109478 {ECO:0000313|EMBL:EPQ07817.1, ECO:0000313|Proteomes:UP000052978};
RN [1] {ECO:0000313|Proteomes:UP000052978}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23962925; DOI=10.1038/ncomms3212;
RA Seim I., Fang X., Xiong Z., Lobanov A.V., Huang Z., Ma S., Feng Y.,
RA Turanov A.A., Zhu Y., Lenz T.L., Gerashchenko M.V., Fan D., Hee Yim S.,
RA Yao X., Jordan D., Xiong Y., Ma Y., Lyapunov A.N., Chen G., Kulakova O.I.,
RA Sun Y., Lee S.G., Bronson R.T., Moskalev A.A., Sunyaev S.R., Zhang G.,
RA Krogh A., Wang J., Gladyshev V.N.;
RT "Genome analysis reveals insights into physiology and longevity of the
RT Brandt's bat Myotis brandtii.";
RL Nat. Commun. 4:2212-2212(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE162330; EPQ07817.1; -; Genomic_DNA.
DR eggNOG; KOG3544; Eukaryota.
DR Proteomes; UP000052978; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 7.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:EPQ07817.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000052978};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1177..1410
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 57..1179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 113..130
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 622..636
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 868..882
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 941..955
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1039..1054
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1082..1096
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1123..1138
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1410 AA; 136438 MW; 0DD93CAD1A9402A7 CRC64;
MRMSSAGTLG PPHNMLFYMR VNVLEQAPHD EQLQTYMWWM RPELPTKVLD FRGREIQARG
EKGEPGSVPV VTGIRGRPGP SGPPGSQGPR GDRGPKGKPG PRGPQGIDGE PGVPGQPGPP
GPPGHPSHPG PDGISRPFSA QMAGLDDKAG LGSQMGLMPG SVGPVGPRGP QGLQGQQGGV
GPAGPPGEPG EPGPMGPIGA RGPEGPPGKP GEDGEPGRNG SPGEVGFSGS PGARGFPGAP
GLPGLKGHRG HKGLEGPKGE VGATGSKGEA GPTGPMGAMG PMGPRGMPGE RGRLGPQGAP
GQRGAHGMPG KPGPMGPLGI PGSSGFPGNP GMKGEAGPTG ARGPEGPQGQ RGETGPPGPV
GSQGLPGAVG TDGTPGAKGP TGSAGPSGPP GLAGPPGSQG PQGSTGLPGI RGQPGDPGVP
GFKGEAGPKG EPGPHGIQGP IGPPGEEGKR GPRGDPGAVG PQGPVGERGA PGNRGFPGSD
GLPGPKGAQG ERGPVGSSGP KGGQGDPGRP GEPGLPGARG LTGNPGVQGP EGKLGPLGAP
GEDGRPGPPG SIGIRGQPGT MGLPGPKGSS GDPGKPGEAG NAGVPGQRGA PGKDGEVGPS
GPVGPPGLAG ERGEQGPPGP TGFQGLPGPP GPPGEGGKPG DQGVPGDPGA VGPLGPRGER
GNPGERGEPG ITGLPGEKGM AGGHGPDGPK GSPGPAGTPG DTGPPGLQGM PGERGIAGTP
GPKGDRGGLG EKGAEGTAGN DGARGLPGPL GPPGPAGPTG EKGEPGPRGL VGPPGSRGNP
GSRGENGPIG AVGFAGPQGP DGQPGVKGEP GEPGQKGDAG SPGPQGLAGS PGPHGPNGVP
GLKGGRGTQG PPGATGFPGS AGRVGPPGPV GAPGPAGPLG EPGKEGPPGL RGDPGSHGRV
GDRGPAGPPG GPGDKGDPGE DGQPGTPGKV GATGATGDKG PPGPVGPPGS NGPVGEPGPE
GPAGNDGTPG RDGAVGERGD RGDPGPAGLP GSQGAPGTPG PVGAPGDAGQ RGDPGSRGPI
GPPGRAGKRG LPGPQGPRGD KGDHGDRGDR GQKGHRGFTG LQGLPGPPGP NGEQGSAGIP
GPFGPRGPPG PVGPSGKEGN PGPLGPIGPP GVRGSVGEAG PEGPPGEPGP PGPPGPPGHL
TAALGDILGH YDENMPDPLP EFTEDEAAPD DKNKTDPGVH ATLKSLSSQI ETMRSPDGSK
KHPARTCEDL KLCHSAKHSG EYWIDPNQGS VEDAIKVFCN METGETCISA NPSSIPRKTW
WTSKSPDHKP VWYGLDMNRG AQFAYGDHQS PSAAITQMTF LRLLSKEASQ NITYVCKNSV
GYMDDQAQNL KKAVVLKGSN DLDIRAEGNV RFRYIVLQDT CSKRNGNVGK TVFEYRTQNV
ARLPIIDLAP VDVGGTDQEF GVEIGPVCFV
//