ID H3B2W1_LATCH Unreviewed; 626 AA.
AC H3B2W1;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=TOX high mobility group box family member 4 {ECO:0000313|Ensembl:ENSLACP00000016232.1};
GN Name=TOX4 {ECO:0000313|Ensembl:ENSLACP00000016232.1};
OS Latimeria chalumnae (Coelacanth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Coelacanthiformes; Coelacanthidae; Latimeria.
OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016232.1, ECO:0000313|Proteomes:UP000008672};
RN [1] {ECO:0000313|Proteomes:UP000008672}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT "The draft genome of Latimeria chalumnae.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLACP00000016232.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFYH01118979; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AFYH01118980; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; H3B2W1; -.
DR STRING; 7897.ENSLACP00000016232; -.
DR Ensembl; ENSLACT00000016346.1; ENSLACP00000016232.1; ENSLACG00000014300.1.
DR eggNOG; KOG0381; Eukaryota.
DR GeneTree; ENSGT00940000154888; -.
DR HOGENOM; CLU_030650_0_0_1; -.
DR InParanoid; H3B2W1; -.
DR OMA; MEQCLIM; -.
DR TreeFam; TF106481; -.
DR Proteomes; UP000008672; Unassembled WGS sequence.
DR Bgee; ENSLACG00000014300; Expressed in pectoral fin and 6 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd21995; HMG-box_TOX-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR45781; AGAP000281-PA; 1.
DR PANTHER; PTHR45781:SF2; TOX HIGH MOBILITY GROUP BOX FAMILY MEMBER 4; 1.
DR Pfam; PF00505; HMG_box; 1.
DR PRINTS; PR00886; HIGHMOBLTY12.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000008672}.
FT DOMAIN 220..288
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 220..288
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 157..224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 434..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 502..531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 551..584
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 387..417
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 165..181
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..462
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 565..579
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 626 AA; 67748 MW; 74BDEABDA931AECD CRC64;
FPGGSDNYLT IAGSGHHFLS GAETFHTPSL GDEEFEIPPI SLDADPSLAV SDVVAHFDDL
GDPNSTQDSS FSSQYGVQTL DLPVAMSQGV IEQSQGILGT GLSMDLTHSI GAQYANPPVT
IDVAMTDMNH SLLAHNQLTT IDQSELSSQL GLSLGGGTIL PRAQSPEDRL STTPSPTSSL
QEDEAEEFRR DITSQKTIVV ETGKKQKASK KKKKKDPNEP QKPVSAYALF FRDTQAAIKG
QNPNATFGEV SKIVASMWDS LGEEQKQVYK RKTEAAKKDY LKALAAYRAN QLSQTTVETI
ELEPLHAVPA PMETPTVSQP IVVNPNMAQY APSQSPRPPS ITKIIIPKQM LASGQLSSSI
AVSQGGMVTV IPATMVASRG LQLNQSVHQI QTSVQRLQQQ QQQQQQQQQQ QQQQQQAVAV
QGSLQALRLQ PPPLQQMQQP PRLQLMPQPG ANPQPPPLQP MQAQPKVRLS LQVQQPPPPL
QIKIVPPTQP QVQIQPVPPL QMQPQQAAPT ALVQPAPASP DMPSPAASPS PVAVALASPE
QVDEIVTEIV PESESPPQMD VELVSASPPA SSPQSRCVRV GCTNPPVESR DWDREYCSNE
CVARHCREVF MTWLSSRNQS SITSVK
//