ID A0A061FQZ1_THECC Unreviewed; 767 AA.
AC A0A061FQZ1;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Enhancer of polycomb-like protein {ECO:0000256|RuleBase:RU361124};
GN ORFNames=TCM_043753 {ECO:0000313|EMBL:EOY19077.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY19077.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY19077.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU361124}.
CC -!- SIMILARITY: Belongs to the enhancer of polycomb family.
CC {ECO:0000256|ARBA:ARBA00008035, ECO:0000256|RuleBase:RU361124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001888; EOY19077.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061FQZ1; -.
DR EnsemblPlants; EOY19077; EOY19077; TCM_043753.
DR Gramene; EOY19077; EOY19077; TCM_043753.
DR Proteomes; UP000026915; Chromosome 10.
DR GO; GO:0035267; C:NuA4 histone acetyltransferase complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR InterPro; IPR024943; Enhancer_polycomb.
DR InterPro; IPR019542; Enhancer_polycomb-like_N.
DR PANTHER; PTHR14898; ENHANCER OF POLYCOMB; 1.
DR PANTHER; PTHR14898:SF2; ENHANCER OF POLYCOMB-LIKE PROTEIN; 1.
DR Pfam; PF10513; EPL1; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU361124};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU361124};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|RuleBase:RU361124}.
FT DOMAIN 506..600
FT /note="Enhancer of polycomb-like N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10513"
FT REGION 62..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 115..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..87
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..133
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 767 AA; 86305 MW; F400664B43258CD1 CRC64;
MPSVGMRRTT RVFRMVKSSE VARVLRSGRR LWPDSGEAKP KRLANEGDEN YNLMKKAPKS
EVNGVAAEVS GRPKRLGNEE NPRKQSRKMK AGAFNTSGSV DKMFGIVYTR KRKRNGVQNG
HLSGNSGQGN YGKKISRRQA IENRNTNEDV EEPKMFSFVV ENGDCNGCFS NFLILVLGYV
KRAEVRLSEL AAFLMSQPIS SVYSSNGVNF FWGPRNRTGI CKFFGAKDSI PLFSLDFSAV
PRYFLYMHYS KVLRLKRIQI VPVNSDEIVS DSEEDEPCVT SVVDVCKSTS GNAAVEIDNL
GSKVVLHPSV RASKLTGRNA QCRNGLSSRS IQKRRSSLRR RRARNPSIVG IHKANGALMS
DLISSRRNGI PFSSVVSKNK LRSSVRNSSV ANVSDVGSSI SDLMQNVDSS QCSANILVIE
ADRCYREEGA IVTLELSASR EWLLVVKKGS STKFACKADK FMRPSSCNRF THAIIWTGDD
NWKLEFPNRQ DWIIFKDLYK ECSERNVPAS TVKAIPVPGV HEVPGYEDRR SVPFCRPDFY
ISLDGDEVSR ALAKRTANYD MDSEDEEWLK KFNNEFFSGN GHCEHLSEDC FELMVDAFEK
AYFCSPDDYS NENAAAHLCL DLGTRGLVEA VHTYWLRKRK QRRSALLRVF QGHQVKKAPL
VPKPFLRKRR SFKRQASHGR GKQPYLLQAL AAERDSMAEQ NAMLKLEEAR VSASRSVELA
VLKRQRTQLL MENADLATYK AMMALRIAEA ARFTESSDVA VAHFFDL
//