ID A0A5E4D023_MARMO Unreviewed; 346 AA.
AC A0A5E4D023;
DT 13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2019, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:VTJ87427.1};
GN ORFNames=MONAX_5E019942 {ECO:0000313|EMBL:VTJ87427.1};
OS Marmota monax (Woodchuck).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Marmota.
OX NCBI_TaxID=9995 {ECO:0000313|EMBL:VTJ87427.1, ECO:0000313|Proteomes:UP000335636};
RN [1] {ECO:0000313|EMBL:VTJ87427.1, ECO:0000313|Proteomes:UP000335636}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Alioto T., Alioto T.;
RL Submitted (APR-2019) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABDUW010002597; VTJ87427.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A5E4D023; -.
DR Proteomes; UP000335636; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProt.
DR GO; GO:0045095; C:keratin filament; IEA:InterPro.
DR InterPro; IPR002494; KAP.
DR Pfam; PF13885; Keratin_B2_2; 3.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Keratin {ECO:0000256|ARBA:ARBA00022744};
KW Reference proteome {ECO:0000313|Proteomes:UP000335636};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 23..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..53
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..124
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 346 AA; 36055 MW; D7E80CBFC6382623 CRC64;
MRPRTGHQAP WTLGAQYRCQ ASCSVAGGCP SSPGWPGSSH TQHPSTQCSR NPLPLQGPRR
WQDINELDSL PGKQASPMTA RSRPSNPEGE AVGWGPEGPG RYKSQSPELP THTHTLTHTP
SSSTPTMAAS TMSVCSSSCP ESSWQVDDCP ESCCQPPCCT PSCCQPSCCA PAPRLTLLCT
PVSCVSRPCC QSVCTSSCTP SSCQQSSCQS DCSSCSPCQP SCCVSLCCKP VCCKPVCCVP
VCSEASSSCC QQSSCQSDCC SSSPCQPSCC VPVCCKPVCC YRPSSCVSLL CRPVCRPACC
VPASSCCASS CQPSCCRPAS CVSLLCRPTC SRPACCGVSL GQRSCC
//