ID A0A061GJU6_THECC Unreviewed; 746 AA.
AC A0A061GJU6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=ARID/BRIGHT DNA-binding domain-containing protein isoform 1 {ECO:0000313|EMBL:EOY30140.1};
GN ORFNames=TCM_037452 {ECO:0000313|EMBL:EOY30140.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY30140.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY30140.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY30139.1; -; Genomic_DNA.
DR EMBL; CM001887; EOY30140.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GJU6; -.
DR STRING; 3641.A0A061GJU6; -.
DR EnsemblPlants; EOY30139; EOY30139; TCM_037452.
DR EnsemblPlants; EOY30140; EOY30140; TCM_037452.
DR Gramene; EOY30139; EOY30139; TCM_037452.
DR Gramene; EOY30140; EOY30140; TCM_037452.
DR eggNOG; ENOG502QQP4; Eukaryota.
DR InParanoid; A0A061GJU6; -.
DR OMA; GSCGEWA; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd16100; ARID; 1.
DR CDD; cd15615; PHD_ARID4_like; 1.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR042293; ARID4.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46694; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 4; 1.
DR PANTHER; PTHR46694:SF1; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 4; 1.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM01014; ARID; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR PROSITE; PS51011; ARID; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000313|EMBL:EOY30140.1};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 569..673
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 454..481
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 746 AA; 82222 MW; 9EC7940B46D420C5 CRC64;
MMFSAQGSSR NHCSLLAVLS GGNVSDNKQK QPVSDDKPRY PFPELASSGR LEVQLLNSPN
IDELRRVLES TEPNVVYLQG EQNADSEEIG PLIWGDVDLS TPETLCGLFD STLPTTVYLE
TPNGDKLAEA LHSQGVPYVI YWKNTFSRFA ACHFRQALLS VIQSSCSHTW DAFQLAHASF
RLYCVRNNNV VSSNSQKQSV KPGPRLLGEA PKIDVSQPEV DMQGEESSPE NLPAIKIYDD
DVTVRFLVCG SPCILDAFLL GSLEDGLNAL LSIEIRGSKL HNRASAPPPP LQAGTFSRGV
VTMRCDFSTC SSAHISLLVS GSAQTCFNDQ LLENHIKNEI IEKSQLVHAQ SSSEESKLPS
SEPRRSASIA CGASVFEVCM KVPTWASQVL RQLAPDVSYR SLVMLGIASI QGLSVASFEK
DDAERLLFFC MRQDKDPLQD SSVIAISPSW LVPPAPSRKR SEPCKDSKPL NCTGMEGENG
IARPKSNVAA MRPIPHTHRH KIIPFSGFSE AERYDGDQGK VNLPVVPVKQ PAPVTHRKAL
SSSYQAQQII SLNPLPLKKH GCGRAPIQVC SEEEFLRDVM QFLILRGHTR LVPQGGLAEF
PDAILNAKRL DLFNLYREVV SRGGFHVGNG INWKGQVFSK MRNHTMTNRM TGVGNTLKRH
YETYLLEYEL AHDDVDGECC LLCHSSAAGD WVNCGICGEW AHFGCDRRQG LGAFKDYAKT
DGLEYVCPHC SISNFKKKPQ KTVNGY
//