ID I3LZE4_ICTTR Unreviewed; 623 AA.
AC I3LZE4;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 62.
DE RecName: Full=TOX high mobility group box family member 4 {ECO:0000256|ARBA:ARBA00013298};
GN Name=TOX4 {ECO:0000313|Ensembl:ENSSTOP00000001689.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000001689.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000001689.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01114835; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; I3LZE4; -.
DR STRING; 43179.ENSSTOP00000001689; -.
DR Ensembl; ENSSTOT00000001895.3; ENSSTOP00000001689.3; ENSSTOG00000001894.3.
DR GeneTree; ENSGT00940000154888; -.
DR HOGENOM; CLU_030650_0_0_1; -.
DR InParanoid; I3LZE4; -.
DR TreeFam; TF106481; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0000785; C:chromatin; IEA:Ensembl.
DR GO; GO:0000781; C:chromosome, telomeric region; IEA:Ensembl.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0072357; C:PTW/PP1 phosphatase complex; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21995; HMG-box_TOX-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45781; AGAP000281-PA; 1.
DR PANTHER; PTHR45781:SF2; TOX HIGH MOBILITY GROUP BOX FAMILY MEMBER 4; 1.
DR Pfam; PF00505; HMG_box; 1.
DR PRINTS; PR00886; HIGHMOBLTY12.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215}.
FT DOMAIN 227..295
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 227..295
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 159..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 308..339
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 159..187
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..328
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 623 AA; 66338 MW; AFC28AE905E3B212 CRC64;
MSGSESFPGG NDNYLTITGP SHPFLSGAET FHTPSLGDEE FEIPPISLDS DPSLAVSDVV
GHFDDLADPS SSQDGSFSAQ YGVQTLDMPV GMTHGLMEQG GGLLSGGLTM DLDHSIGTQY
SANPPVTIDV PMTDMTSGLM GHSQLTTIDQ SELSSQLGLS LGGSTILPPA QSPEDRLSTT
PSPTSSLHED GVEDFRRQLP SQKTVVVEAG KKQKAPKKRK KKDPNEPQKP VSAYALFFRD
TQAAIKGQNP NATFGEVSKI VASMWDSLGE EQKQVYKRKT EAAKKEYLKA LAAYKDNQEC
QATVETVELD PVPPSQTPSP PPVATVDPAS PAPASTESPA LSPCIVVNST LSSYVANQAS
SGAGGQPNIT KLIITKQMLP SSITMSQGGM VTVIPATVVT SRGLQLGQTS TATIQPSQQA
QIVTRSVLQA AAAAAASMQL PPPRLQPPPL QQMPQPPTQQ QVTILQQPPP LQAMQQPPPQ
KVRINLQQQP PPLQSKIVPP PTLKMQTTLV PPAVESSPER PMNSSPEAHT VEATSPETIC
EMITDVVPEV ESPSQMDVEL VSGSPVTLSP QPRCVRSGCE NPPVVSKDWD NEYCSNECVV
KHCRDVFLAW VASRNSNTVV FVK
//