GenomeNet

Database: UniProt
Entry: A0A061EIE6_THECC
LinkDB: A0A061EIE6_THECC
Original site: A0A061EIE6_THECC 
ID   A0A061EIE6_THECC        Unreviewed;       285 AA.
AC   A0A061EIE6;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 43.
DE   SubName: Full=WRKY DNA-binding protein 70, putative isoform 1 {ECO:0000313|EMBL:EOY04438.1};
GN   ORFNames=TCM_019689 {ECO:0000313|EMBL:EOY04438.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY04438.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY04438.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001882; EOY04438.1; -; Genomic_DNA.
DR   RefSeq; XP_017975966.1; XM_018120477.1.
DR   AlphaFoldDB; A0A061EIE6; -.
DR   SMR; A0A061EIE6; -.
DR   STRING; 3641.A0A061EIE6; -.
DR   EnsemblPlants; EOY04438; EOY04438; TCM_019689.
DR   EnsemblPlants; Tc04v2_t012100.1; Tc04v2_p012100.1; Tc04v2_g012100.
DR   GeneID; 18602206; -.
DR   Gramene; EOY04438; EOY04438; TCM_019689.
DR   Gramene; Tc04v2_t012100.1; Tc04v2_p012100.1; Tc04v2_g012100.
DR   KEGG; tcc:18602206; -.
DR   eggNOG; ENOG502RYCZ; Eukaryota.
DR   InParanoid; A0A061EIE6; -.
DR   OMA; MINDHTC; -.
DR   OrthoDB; 5478560at2759; -.
DR   Proteomes; UP000026915; Chromosome 4.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   Gene3D; 2.20.25.80; WRKY domain; 1.
DR   InterPro; IPR003657; WRKY_dom.
DR   InterPro; IPR036576; WRKY_dom_sf.
DR   InterPro; IPR044810; WRKY_plant.
DR   PANTHER; PTHR31282:SF85; WRKY DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR31282; WRKY TRANSCRIPTION FACTOR 21-RELATED; 1.
DR   Pfam; PF03106; WRKY; 1.
DR   SMART; SM00774; WRKY; 1.
DR   SUPFAM; SSF118290; WRKY DNA-binding domain; 1.
DR   PROSITE; PS50811; WRKY; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000313|EMBL:EOY04438.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          113..176
FT                   /note="WRKY"
FT                   /evidence="ECO:0000259|PROSITE:PS50811"
FT   REGION          60..115
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        60..76
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        77..96
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   285 AA;  32013 MW;  7DD9443925795E70 CRC64;
     MSSGRRKAIE ELARGRDLTN QLRDLLTKSF GDDGLLGSED LVTKILNSFA NTLSILRSSS
     GDYDEVSQNP RNSNMSWDGR KSEESGESIK SSTQKDRRGC YKRRKSEHSW TRDSPTLIDD
     GHAWRKYGQK VILNAKHPRN YYRCTHKHDQ GCQATKQVQQ IEDDPPKYGT TYYGHHTCKN
     LLKASQLILD STSKDSSILL SFANTNKQDN SMFSAFPPVK QESKEDMPSD ITYNLSTSSP
     DYLLSPDHLT TFESSAQMTV LSAADHADVI SGVVDSVDLD DLLEF
//
DBGET integrated database retrieval system