ID A0A061F581_THECC Unreviewed; 698 AA.
AC A0A061F581;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE SubName: Full=Agenet domain-containing protein / bromo-adjacent domain-containing protein, putative {ECO:0000313|EMBL:EOY12068.1};
GN ORFNames=TCM_030669 {ECO:0000313|EMBL:EOY12068.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY12068.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY12068.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001885; EOY12068.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F581; -.
DR STRING; 3641.A0A061F581; -.
DR EnsemblPlants; EOY12068; EOY12068; TCM_030669.
DR Gramene; EOY12068; EOY12068; TCM_030669.
DR eggNOG; ENOG502QSIN; Eukaryota.
DR HOGENOM; CLU_014967_2_1_1; -.
DR InParanoid; A0A061F581; -.
DR OMA; CENDHHF; -.
DR Proteomes; UP000026915; Chromosome 7.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR CDD; cd04721; BAH_plant_1; 1.
DR CDD; cd20405; Tudor_Agenet_AtDUF_rpt1_3; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR InterPro; IPR008395; Agenet-like_dom.
DR InterPro; IPR014002; Agenet_dom_plant.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR PANTHER; PTHR31917; AGENET DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR31917:SF101; OS07G0607300 PROTEIN; 1.
DR Pfam; PF05641; Agenet; 1.
DR Pfam; PF01426; BAH; 1.
DR SMART; SM00743; Agenet; 2.
DR SMART; SM00439; BAH; 1.
DR PROSITE; PS51038; BAH; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 170..289
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT REGION 311..348
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 608..698
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 316..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 615..636
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..660
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..690
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 698 AA; 78742 MW; 0763D9C39D934374 CRC64;
MKMSGNGHCF TEWKEEFVSQ ERGNRVVHYF LKDSAGESIR AVIGTERSVR HMFYVVAEEF
VRVYGAEHSI HAGFKWRSRR EVVDWLTSML SKQHLQGDRS KSPKHEALLA LASPDCAMNE
ISARKAQALD DMSHLSRNWN GPSSDIVWSG TAWTCGKQLK HFPAFGRNGT TIAVQSFVFV
MAKGENHYLA YLEDMYEDKR GQKKVKVRWF HHTKEVKGVV PVRNPHPKEV FITPYSQVIS
AECVDGLASV LTREHYEKCS AVFPDALLAR VHVCSRQFRS NKVKPFDLSK LRGYFDQPIL
SCLNSSMFSE PDSMSHGLNE EGEEELSPSE NVKLGNKRTR TNRKSQRFVT DHSGNRISGN
HLMTYETSYK KIKYALSGKS LLSLKHVECQ PWYGSVFKVD EKIELLCQDS GIRGCWFRCT
VLQVSRKQMK VKYNDVQDED GYGKLEEWIP IFKLAMPDKL GMRYSGRRTI RPAPPSSETA
LALEVGSAVD AWWSDGWWEG VVTGVNSSGD DNLQVYFPGE NLFLSIHKKD LRISRDWDGD
HWIDIEARPD MLSLISTAIS PDMDTKVSMS STVVMDAKFD GSTMPMEVVA AKTTLNVVHG
EKPELAIQDC SGVKELQSSK DEKEGDGSDF KKPPPSENGD NDGNADDANT IHDKLNDVDG
NDKNNNANNS NDDKEGKLET ENDMEQDCKS TELVEVTT
//