ID A0A068VJ04_COFCA Unreviewed; 534 AA.
AC A0A068VJ04;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=DH200=94 genomic scaffold, scaffold_1747 {ECO:0000313|EMBL:CDP20785.1};
GN ORFNames=GSCOC_T00000840001 {ECO:0000313|EMBL:CDP20785.1};
OS Coffea canephora (Robusta coffee).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Gardenieae complex;
OC Bertiereae - Coffeeae clade; Coffeeae; Coffea.
OX NCBI_TaxID=49390 {ECO:0000313|EMBL:CDP20785.1, ECO:0000313|Proteomes:UP000295252};
RN [1] {ECO:0000313|Proteomes:UP000295252}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. DH200-94 {ECO:0000313|Proteomes:UP000295252};
RX PubMed=25190796; DOI=10.1126/science.1255274;
RA Denoeud F., Carretero-Paulet L., Dereeper A., Droc G., Guyot R.,
RA Pietrella M., Zheng C., Alberti A., Anthony F., Aprea G., Aury J.M.,
RA Bento P., Bernard M., Bocs S., Campa C., Cenci A., Combes M.C.,
RA Crouzillat D., Da Silva C., Daddiego L., De Bellis F., Dussert S.,
RA Garsmeur O., Gayraud T., Guignon V., Jahn K., Jamilloux V., Joet T.,
RA Labadie K., Lan T., Leclercq J., Lepelley M., Leroy T., Li L.T.,
RA Librado P., Lopez L., Munoz A., Noel B., Pallavicini A., Perrotta G.,
RA Poncet V., Pot D., Priyono X., Rigoreau M., Rouard M., Rozas J.,
RA Tranchant-Dubreuil C., VanBuren R., Zhang Q., Andrade A.C., Argout X.,
RA Bertrand B., de Kochko A., Graziosi G., Henry R.J., Jayarama X., Ming R.,
RA Nagai C., Rounsley S., Sankoff D., Giuliano G., Albert V.A., Wincker P.,
RA Lashermes P.;
RT "The coffee genome provides insight into the convergent evolution of
RT caffeine biosynthesis.";
RL Science 345:1181-1184(2014).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG740831; CDP20785.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A068VJ04; -.
DR STRING; 49390.A0A068VJ04; -.
DR EnsemblPlants; CDP20785; CDP20785; GSCOC_T00000840001.
DR Gramene; CDP20785; CDP20785; GSCOC_T00000840001.
DR InParanoid; A0A068VJ04; -.
DR OMA; IREDQRQ; -.
DR PhylomeDB; A0A068VJ04; -.
DR Proteomes; UP000295252; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 2.20.25.80; WRKY domain; 1.
DR InterPro; IPR003657; WRKY_dom.
DR InterPro; IPR036576; WRKY_dom_sf.
DR InterPro; IPR044810; WRKY_plant.
DR PANTHER; PTHR31429; WRKY TRANSCRIPTION FACTOR 36-RELATED; 1.
DR PANTHER; PTHR31429:SF86; WRKY TRANSCRIPTION FACTOR 61-RELATED; 1.
DR Pfam; PF03106; WRKY; 1.
DR SMART; SM00774; WRKY; 1.
DR SUPFAM; SSF118290; WRKY DNA-binding domain; 1.
DR PROSITE; PS50811; WRKY; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000295252}.
FT DOMAIN 174..240
FT /note="WRKY"
FT /evidence="ECO:0000259|PROSITE:PS50811"
FT REGION 29..95
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 109..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 493..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..70
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..95
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..528
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 534 AA; 57516 MW; 2D0A2C8E0AFCADA3 CRC64;
MGEVMEENQR LRMCLDRAMK DYRALQMQFN DMVQQEPNKS AEPNKSSSTI TTHQETEEPE
LVSLSLGMSS SDGKKDDHFS GKNQGKEKVD KDGDHDKKVL ALGLDCKFEL PKDQENEPSP
NPSLEASSGE VKEEEGGEKW PPQKSLKNVV RSEEDEVSQQ NPAKRARVSV RVRCDTPTMN
DGCQWRKYGQ KIAKGNPCPR AYYRCTVAPN CPVRKQVQRC AEDMSILITT YEGTHNHPLP
VAATAMASTT SAAASMLMSG STTSTSGLLP PSNFASANLN GLNFFLSDNS KPKPFYLPNS
SLSSSPSFPT ITLDLTTSSN ASSNLTKLGS FPPRSYSATR DLNFSSLESN ALPLSWSNGV
LSYGPQAQPY NKNQSISSLN FGRQPQETLY QSYSQKNHPI LNSSAQQSLP AETIAAATKA
ITSDPSFQSA LAAALTSIIG SGTGGTTGAP VSQTSADKSG QNLKLNDQSF PILSSFPSST
STVNKCAPTF LNSSKPPSSA NSQPGSLMFL SPSLSFPTPS NKSTSPGDNR DRIS
//