ID A0A061F8K1_THECC Unreviewed; 614 AA.
AC A0A061F8K1;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Global transcription factor group E4, putative isoform 1 {ECO:0000313|EMBL:EOY13188.1};
GN ORFNames=TCM_031714 {ECO:0000313|EMBL:EOY13188.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY13188.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY13188.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001885; EOY13187.1; -; Genomic_DNA.
DR EMBL; CM001885; EOY13188.1; -; Genomic_DNA.
DR EMBL; CM001885; EOY13189.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F8K1; -.
DR EnsemblPlants; EOY13187; EOY13187; TCM_031714.
DR EnsemblPlants; EOY13188; EOY13188; TCM_031714.
DR EnsemblPlants; EOY13189; EOY13189; TCM_031714.
DR Gramene; EOY13187; EOY13187; TCM_031714.
DR Gramene; EOY13188; EOY13188; TCM_031714.
DR Gramene; EOY13189; EOY13189; TCM_031714.
DR HOGENOM; CLU_009580_2_0_1; -.
DR Proteomes; UP000026915; Chromosome 7.
DR CDD; cd05506; Bromo_plant1; 1.
DR Gene3D; 1.20.1270.220; -; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 1.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR037377; GTE_bromo.
DR InterPro; IPR027353; NET_dom.
DR InterPro; IPR038336; NET_sf.
DR PANTHER; PTHR45926; OSJNBA0053K19.4 PROTEIN; 1.
DR PANTHER; PTHR45926:SF15; TRANSCRIPTION FACTOR GTE4-LIKE; 1.
DR Pfam; PF17035; BET; 1.
DR Pfam; PF00439; Bromodomain; 1.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00297; BROMO; 1.
DR SUPFAM; SSF47370; Bromodomain; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS51525; NET; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|PROSITE-ProRule:PRU00035};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 273..345
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 451..532
FT /note="NET"
FT /evidence="ECO:0000259|PROSITE:PS51525"
FT REGION 1..53
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 188..208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 567..614
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 28..53
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 578..614
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 614 AA; 67799 MW; 5DF927FDFB5A0BC5 CRC64;
MATGKIEGEE SKHNKVYTRK NHKKHKNPAF VPQQSSQQTL ATTTTTTDDN NSSQQLPIQT
LDVVVSDDSS SHNRVQKGLQ NATTGVATSG YVKYDNLVKI SLNVLSKNEV RVLKRKLASE
LEQIRDLVKR FEAKESRFSA GYANSRVSGN ENVDRGGGSL VRVNSDVGSV GLPSSMPFHG
LSVSVAEQDH SNHGGGGGSE FVEKEKRTPK ANQYYKNSEF VLGKEKLKPA ESNKKMKPSV
GKSNGGQMGG GIAMEKFSNQ MFKSCSNLLG KLMKHKFGWV FNRPVDVKGL GLHDYYSIIK
HPMDLGTVKT RLNKNWYKSP REFAEDVRLT FRNAMLYNPK GQDVHFMADT LSGIFEEKWA
AIESDYNLNR RFERSHDYSL PTPTSRRVPA SVPALAPVQA HGPPTPVPAP SPLPLEARTL
ERSESMTMPI DPKSRAVNLT PSGRIAVPKK PKAKDSDKRD MTYEEKQRLS VNLQNLPSEK
LDSLVQIIKK RNPALFVQDD EIEVDIDSVD PETLWELDRF VTNYKKGLSK NKKKAELTLQ
ASAENDHDIQ EINLEPSAEE VAKVNEAVER IVPTSPPIHG ERQQNNESGS GSSSSSSTDS
GSSSSDSDSD SSSG
//