ID A0A1R3FW35_COCAP Unreviewed; 546 AA.
AC A0A1R3FW35;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=DNA-binding WRKY {ECO:0000313|EMBL:OMO49950.1};
GN ORFNames=CCACVL1_30743 {ECO:0000313|EMBL:OMO49950.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMO49950.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMO49950.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMO49950.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO49950.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01016330; OMO49950.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3FW35; -.
DR STRING; 210143.A0A1R3FW35; -.
DR EnsemblPlants; OMO49950; OMO49950; CCACVL1_30743.
DR Gramene; OMO49950; OMO49950; CCACVL1_30743.
DR OMA; IREDQRQ; -.
DR OrthoDB; 5478560at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 2.20.25.80; WRKY domain; 1.
DR InterPro; IPR003657; WRKY_dom.
DR InterPro; IPR036576; WRKY_dom_sf.
DR InterPro; IPR044810; WRKY_plant.
DR PANTHER; PTHR31429; WRKY TRANSCRIPTION FACTOR 36-RELATED; 1.
DR PANTHER; PTHR31429:SF86; WRKY TRANSCRIPTION FACTOR 61-RELATED; 1.
DR Pfam; PF03106; WRKY; 1.
DR SMART; SM00774; WRKY; 1.
DR SUPFAM; SSF118290; WRKY DNA-binding domain; 1.
DR PROSITE; PS50811; WRKY; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000313|EMBL:OMO49950.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000188268}.
FT DOMAIN 194..260
FT /note="WRKY"
FT /evidence="ECO:0000259|PROSITE:PS50811"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 56..75
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 91..120
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 134..168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 502..546
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..26
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 56..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 546 AA; 58956 MW; AF134EC257AF726D CRC64;
MASSSSRKDQ EDQLESAKAE MGEVREENQR LKIYLNRIMK DYQNLQMQFY DIVRQDAKKS
AAKTSNDDHQ QDIEPELVSL TLGRFSSDSK KLLDDNKKKT CSQGKKDHEQ RAAGSNNKEG
LSLGLDYKLE AASKSDVDDD EALANPSPTN SSQEPKEEET WPPSKVLKTM TSGDDEILQQ
NPVKKARVCV RARCDTPTMN DGCQWRKYGQ KIAKGNPCPR AYYRCTVAPS CPVRKQVQRC
AEDMSILITT YEGTHNHPLP MSATAMASTT CAAASMLLSG SSSSSSSTAN TPTNLHGLNF
YLSDNSKSKF YLPNSSLSAS SSHPTITLDL TSTPSSSPTQ FPFNRFSSAY PTATSRYPCT
SLSFGSSDSN TLFWGNANGL LSYGNSSQSL MKNQIGTMEN NIYQTLMQKN NLNPNPLPAA
HQHQQPLQDT IAAATKAITA DPNFQSALAA ALTSIIGSGG GGNNGGEGLG QKLKWAEQPF
PVTSSSGYSQ TVKGNVGCAS SFLNKSPSST TSQQQGSMLF LPPSLPFSTP KSASTSPGGD
SRNHSN
//