ID A0A151MNG9_ALLMI Unreviewed; 526 AA.
AC A0A151MNG9;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Thymocyte selection-associated high mobility group box protein TOX {ECO:0000313|EMBL:KYO26056.1};
GN Name=TOX {ECO:0000313|EMBL:KYO26056.1};
GN ORFNames=Y1Q_0003816 {ECO:0000313|EMBL:KYO26056.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO26056.1};
RN [1] {ECO:0000313|EMBL:KYO26056.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO26056.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO26056.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03005657; KYO26056.1; -; Genomic_DNA.
DR RefSeq; XP_006269595.1; XM_006269533.3.
DR AlphaFoldDB; A0A151MNG9; -.
DR STRING; 8496.A0A151MNG9; -.
DR GeneID; 102572407; -.
DR KEGG; amj:102572407; -.
DR CTD; 9760; -.
DR eggNOG; KOG0381; Eukaryota.
DR OrthoDB; 4252846at2759; -.
DR PhylomeDB; A0A151MNG9; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21995; HMG-box_TOX-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45781; AGAP000281-PA; 1.
DR PANTHER; PTHR45781:SF4; THYMOCYTE SELECTION-ASSOCIATED HIGH MOBILITY GROUP BOX PROTEIN TOX; 1.
DR Pfam; PF00505; HMG_box; 1.
DR PRINTS; PR00886; HIGHMOBLTY12.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 261..329
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 261..329
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 192..265
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..220
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 221..246
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 526 AA; 57534 MW; E99D7918D651EA35 CRC64;
MDVRFYSPPP PQPAAAPDTP CLGPSPCLDP YYCNKFDGEN MYMSMTEPSQ DYVPASQSFP
GPSLESEDFN IPPITPPSLP DHSLVHLNEV ESGYHSLCHP MNHNGLLPFH PQNMDLPEIT
VSNMLSQDGT LLSNSLSVMQ DIRNTEGAQY SSHPQMAAMR PRVQPADIRQ PGMIPHGQLT
TINQSQLSAQ LGLNMGGNNV PHNSPSPPGS KSATPSPSSS VHEDEGEDTS KVNGGEKRPA
SDLGKKPKTP KKKKKKDPNE PQKPVSAYAL FFRDTQAAIK GQNPNATFGE VSKIVASMWD
GLGEEQKQVY KKKTEAAKKE YLKQLAAYRA SLVSKSYSEP VDVKASQPPQ MINSKQSVFH
GPAQAPSALY LSSHYHQQPG MNPHITAMHA NIPRNIAPKP NNQMPVTVSI ANMAVSPPPP
LQISPPLHQH LNIQQHQPIT MQQSIGNQLP MQVQSALHSP TMQQGFTLQP DYQNIINPTS
TAAQVVTQAM EYVRSGCRNP PAQTVDWNND YCSNGGMQRD KALYLT
//