ID I3IUR8_ORENI Unreviewed; 667 AA.
AC I3IUR8;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 11-JUL-2012, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=TOX high mobility group box family member 4 {ECO:0000313|Ensembl:ENSONIP00000000358.1};
GN Name=tox4 {ECO:0000313|Ensembl:ENSONIP00000000358.1};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000000358.1, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000000358.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003458812.1; XM_003458764.4.
DR AlphaFoldDB; I3IUR8; -.
DR STRING; 8128.ENSONIP00000000358; -.
DR Ensembl; ENSONIT00000000357.2; ENSONIP00000000358.1; ENSONIG00000000284.2.
DR GeneID; 100692575; -.
DR KEGG; onl:100692575; -.
DR CTD; 100003353; -.
DR eggNOG; KOG0381; Eukaryota.
DR GeneTree; ENSGT00940000154888; -.
DR HOGENOM; CLU_030650_0_0_1; -.
DR InParanoid; I3IUR8; -.
DR OMA; DGTGPHD; -.
DR OrthoDB; 4252846at2759; -.
DR TreeFam; TF106481; -.
DR Proteomes; UP000005207; Linkage group LG3.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21995; HMG-box_TOX-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45781; AGAP000281-PA; 1.
DR PANTHER; PTHR45781:SF6; TOX HIGH MOBILITY GROUP BOX FAMILY MEMBER 4; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT DOMAIN 301..369
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 301..369
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 209..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 474..550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..495
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 667 AA; 69091 MW; DF8AE10E1E4D1E8F CRC64;
MDLNFYSDLT DGTGQHDGDP EFLDPQSFNG FDSDNKFPGG SDNYLTIPGS GHPFLSSSET
FHTPSLGDEE FEIPPISLDP DSALSVSDVV SHFGELSETG PSDSVVVPGN AVVEGDDPSF
ASTFVSAASQ GLEHLTLGVM TQPGGNTLLG SSLGMDLGHP IGSQFSSSSP VTIDVPLGDM
GQGLLGSNQL TTIDQSELSA QLGLGLGGGN ILQRPQSPEN PLSATASPTS SLQDDDMDDF
RRSVLVESPV SLAVSPGVIS LDPSPSQSPL SAPTSSVSSA TGRKGAGGGK KGKKKKDPNE
PQKPVSAYAL FFRDTQAAIK GQNPNATFGE VSKIVASMWD SLGEEQKQVY KRKNEAAKKD
YLKALAEYRA SLISQAPIEV METTPSPPPP APAPVVTATP TPIPAPTPPT RPTRSQHYNP
EENTITNICT SNIILDLPQV TTRSRTGAIK PQPPPASTAL NTAPVAKIII KQTPLPSGGM
SATVTTASSP RQPPPLQQMQ STPPPPRLQQ MVHAQAPPPL QAKPRGGGGG AAAATTAPPP
LKVVPSARQS DSSASIIVTS SGETSTTVSA ATSALAVEVG HTGEVTGGEE VVEGEEGMEV
EVNVAPGPSV TPAASPNICV RAGCTNPAVE SKDWDKEYCS NECVATHCRD VFMAWCAIRG
QNSTTVT
//