GenomeNet

Database: UniProt
Entry: S7PI71_MYOBR
LinkDB: S7PI71_MYOBR
Original site: S7PI71_MYOBR 
ID   S7PI71_MYOBR            Unreviewed;      1410 AA.
AC   S7PI71;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   27-MAR-2024, entry version 35.
DE   SubName: Full=Collagen alpha-2(V) chain {ECO:0000313|EMBL:EPQ07817.1};
GN   ORFNames=D623_10008784 {ECO:0000313|EMBL:EPQ07817.1};
OS   Myotis brandtii (Brandt's bat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC   Myotis.
OX   NCBI_TaxID=109478 {ECO:0000313|EMBL:EPQ07817.1, ECO:0000313|Proteomes:UP000052978};
RN   [1] {ECO:0000313|Proteomes:UP000052978}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23962925; DOI=10.1038/ncomms3212;
RA   Seim I., Fang X., Xiong Z., Lobanov A.V., Huang Z., Ma S., Feng Y.,
RA   Turanov A.A., Zhu Y., Lenz T.L., Gerashchenko M.V., Fan D., Hee Yim S.,
RA   Yao X., Jordan D., Xiong Y., Ma Y., Lyapunov A.N., Chen G., Kulakova O.I.,
RA   Sun Y., Lee S.G., Bronson R.T., Moskalev A.A., Sunyaev S.R., Zhang G.,
RA   Krogh A., Wang J., Gladyshev V.N.;
RT   "Genome analysis reveals insights into physiology and longevity of the
RT   Brandt's bat Myotis brandtii.";
RL   Nat. Commun. 4:2212-2212(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KE162330; EPQ07817.1; -; Genomic_DNA.
DR   eggNOG; KOG3544; Eukaryota.
DR   Proteomes; UP000052978; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:EPQ07817.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Reference proteome {ECO:0000313|Proteomes:UP000052978};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1177..1410
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          57..1179
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        113..130
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        622..636
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        868..882
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        941..955
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1039..1054
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1082..1096
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1123..1138
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1410 AA;  136438 MW;  0DD93CAD1A9402A7 CRC64;
     MRMSSAGTLG PPHNMLFYMR VNVLEQAPHD EQLQTYMWWM RPELPTKVLD FRGREIQARG
     EKGEPGSVPV VTGIRGRPGP SGPPGSQGPR GDRGPKGKPG PRGPQGIDGE PGVPGQPGPP
     GPPGHPSHPG PDGISRPFSA QMAGLDDKAG LGSQMGLMPG SVGPVGPRGP QGLQGQQGGV
     GPAGPPGEPG EPGPMGPIGA RGPEGPPGKP GEDGEPGRNG SPGEVGFSGS PGARGFPGAP
     GLPGLKGHRG HKGLEGPKGE VGATGSKGEA GPTGPMGAMG PMGPRGMPGE RGRLGPQGAP
     GQRGAHGMPG KPGPMGPLGI PGSSGFPGNP GMKGEAGPTG ARGPEGPQGQ RGETGPPGPV
     GSQGLPGAVG TDGTPGAKGP TGSAGPSGPP GLAGPPGSQG PQGSTGLPGI RGQPGDPGVP
     GFKGEAGPKG EPGPHGIQGP IGPPGEEGKR GPRGDPGAVG PQGPVGERGA PGNRGFPGSD
     GLPGPKGAQG ERGPVGSSGP KGGQGDPGRP GEPGLPGARG LTGNPGVQGP EGKLGPLGAP
     GEDGRPGPPG SIGIRGQPGT MGLPGPKGSS GDPGKPGEAG NAGVPGQRGA PGKDGEVGPS
     GPVGPPGLAG ERGEQGPPGP TGFQGLPGPP GPPGEGGKPG DQGVPGDPGA VGPLGPRGER
     GNPGERGEPG ITGLPGEKGM AGGHGPDGPK GSPGPAGTPG DTGPPGLQGM PGERGIAGTP
     GPKGDRGGLG EKGAEGTAGN DGARGLPGPL GPPGPAGPTG EKGEPGPRGL VGPPGSRGNP
     GSRGENGPIG AVGFAGPQGP DGQPGVKGEP GEPGQKGDAG SPGPQGLAGS PGPHGPNGVP
     GLKGGRGTQG PPGATGFPGS AGRVGPPGPV GAPGPAGPLG EPGKEGPPGL RGDPGSHGRV
     GDRGPAGPPG GPGDKGDPGE DGQPGTPGKV GATGATGDKG PPGPVGPPGS NGPVGEPGPE
     GPAGNDGTPG RDGAVGERGD RGDPGPAGLP GSQGAPGTPG PVGAPGDAGQ RGDPGSRGPI
     GPPGRAGKRG LPGPQGPRGD KGDHGDRGDR GQKGHRGFTG LQGLPGPPGP NGEQGSAGIP
     GPFGPRGPPG PVGPSGKEGN PGPLGPIGPP GVRGSVGEAG PEGPPGEPGP PGPPGPPGHL
     TAALGDILGH YDENMPDPLP EFTEDEAAPD DKNKTDPGVH ATLKSLSSQI ETMRSPDGSK
     KHPARTCEDL KLCHSAKHSG EYWIDPNQGS VEDAIKVFCN METGETCISA NPSSIPRKTW
     WTSKSPDHKP VWYGLDMNRG AQFAYGDHQS PSAAITQMTF LRLLSKEASQ NITYVCKNSV
     GYMDDQAQNL KKAVVLKGSN DLDIRAEGNV RFRYIVLQDT CSKRNGNVGK TVFEYRTQNV
     ARLPIIDLAP VDVGGTDQEF GVEIGPVCFV
//
DBGET integrated database retrieval system