ID A0A061FIB6_THECC Unreviewed; 2265 AA.
AC A0A061FIB6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Histone methyltransferases(H3-K4 specific),histone methyltransferases(H3-K36 specific), putative isoform 1 {ECO:0000313|EMBL:EOY16447.1};
GN ORFNames=TCM_035213 {ECO:0000313|EMBL:EOY16447.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY16447.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY16447.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY16446.1; -; Genomic_DNA.
DR EMBL; CM001886; EOY16447.1; -; Genomic_DNA.
DR EMBL; CM001886; EOY16448.1; -; Genomic_DNA.
DR STRING; 3641.A0A061FIB6; -.
DR EnsemblPlants; EOY16446; EOY16446; TCM_035213.
DR EnsemblPlants; EOY16447; EOY16447; TCM_035213.
DR EnsemblPlants; EOY16448; EOY16448; TCM_035213.
DR Gramene; EOY16446; EOY16446; TCM_035213.
DR Gramene; EOY16447; EOY16447; TCM_035213.
DR Gramene; EOY16448; EOY16448; TCM_035213.
DR eggNOG; KOG4442; Eukaryota.
DR HOGENOM; CLU_230515_0_0_1; -.
DR InParanoid; A0A061FIB6; -.
DR OMA; KGDSHDL; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0000785; C:chromatin; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central.
DR CDD; cd19172; SET_SETD2; 1.
DR Gene3D; 3.30.40.100; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044437; SETD2/Set2_SET.
DR InterPro; IPR011124; Znf_CW.
DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF07496; zf-CW; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS51050; ZF_CW; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000313|EMBL:EOY16447.1}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 1191..1244
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DOMAIN 1305..1355
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1357..1474
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1482..1498
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 153..179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 264..301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 404..462
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 674..715
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 767..805
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 973..1047
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1064..1143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1652..1715
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1940..1980
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2120..2177
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 156..176
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 411..429
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 697..715
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 973..987
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1004..1046
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1070..1103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1104..1143
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1668..1715
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1940..1957
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1960..1978
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2265 AA; 248262 MW; 392FB44E5317657F CRC64;
MGLCENRTLV DEPLREVAAT EQHSCTELMG NLVPQQRDCI VFDPNGDCAG EPSEDENTDC
ERSRDIDCRD GIKGECKNVV GFGLKGLMGD ECCDSTICLK ENECENVYSS CSKELMGDNI
VFSENNQDVD DSDLKELVDN RCNDSVVCLE ENRGESVDGS SSKELMGERN RDSAVFSNEH
LGENVDGSFS DVLMGGRCGD STVSSNEDQG ENVDCAGSKE LISYKDGHSV VCLNENQSEN
VDGSGSKEMM DDRCGDGVVC LNDNQGENVD GSGPEELIGD GDSTGYSNEN QENVDRSGSK
EWMGDSISDN VVCLNENQGN VDVNDSETDL LCLKNRGISG EDGPTAVDGC SQDENTACLS
SGMEISIDQM RGNDENVVGW MLKECIDIQG GVCLIENLGK VDHHNSENDT SQDTEMPSEL
KTVATSPRNC VKQDKEKDDE SVSGSTQQGA MEDGEEKCEE ENDVLKRTGA DVPNQILPSQ
KSEVPFELIS VTGDFVSSSD WHNQKDDLSS SDLSLESFTK PVETKRTDDI CIELLASKGC
LSTLETLHRA ESLGTHQNAQ TDNKNVNGQS ENGVAEVFEK RAAVTAGTKV ETPSEIINAE
ENGCNSKGDS FELGANCLGD RSDSLSCQLF DVVENGLSER LDPVDIFAKD ACAAISSSSS
IDCSRERENE GKDVVKVDCV SDTKHHPATS SSSRRGSRKS KSSRKAPAKR IARYCRKTKL
ANPHESIEFI FRASRKKRSC SSKPARASDW GLLSNITQFL EQYHEPGCNE VPNQERSKAG
GGRASGKRSK NRAGKSRKGS SGISNTSTNC LRLKIKVGKE VASINLNSVV TESVDPSVSV
DTSFNNHGKE TSFQCPKLVN VVEDKVGKLE SERQLQFKED SEKVKTCSDA SIMDLKLAHK
VVESAENLEM SAEDAADNYP VSLSDAVAEA SGEVVENKYI DPGTSPDSEV INLIPDARVG
SIHQEESHNT VLNTSGALAS AGGVKSSKSS KRGKKDNHKS PGAASARKSK SSKNCRGKQK
TTVNGFCSSG ALTSSTGANS SRENGLGVSE EAMKVEIATD AKACCSPDVP DTKNTKNLSS
SKHKRNQPSK SSKSQGVSKG KSRVSDSARS RKGNACKQKG DELKSVSKTK VKKKGSDKDI
VARGGRHPLT VDIAGNHISD NIEISNTSNS IALADMINVD LVSDGTMEQC TQPDNAWVRC
DDCHKWRRIP VALVKSIDEA CRWVCGDNVD KAFADCSIPQ EKSNADINAD LGISDAEEDG
CDGLNYKELE KGFESKHMTV PPTSHFWRID SNWFLHRGRK TQTIDEIMVC HCKRPPDGKL
GCGDECLNRM LNIECVQGTC PCGDLCSNQQ FQKRKYAKMK WDRFGRKGFG LRMLEDISAS
QFLIEYVGEV LDMQAYEARQ KEYASRGQRH FYFMTLNGSE VIDAYVKGNL GRFINHSCDP
NCRTEKWMVN GEICIGLFAL RDIKQGEEVT FDYNYVRVFG AAAKKCHCGS PHCRGYIGGD
LLSAEEIVHD DSDEESPEPM MLEDGETWNG SDNIISRSSS FDGAEMQSVE SVVTDGVIKL
ENRPEAEDSV NRSASVTSQL KSSVETEYLN GNFQLSIKPE EVLPAMAAVQ PDSTTGKKAL
NRTSCSIQKL DTSLNILDNK LPTDVVDANK KSKFDTAEDK QVPPKSRPLM KTSRSSSSIK
KGKISSNSLN GHKVQITSTK SQVPSVKPKR LSENSSNCRF EAVEEKLNEL LDCDGGITKR
KDASKGYLKL LLLTATSGDS GNGETIQSNR DLSMILDALL KTKSRLVLTD IINKNGLQML
HNIMKKYRSD FKKIPILRKL LKVLEYLAMR EILTLDHIIG GPSCAGRQSF RESILSLTEH
DDKQVHQIAR NFRDRWIPKP VRKLSYRDKD EGKMEFHRGL DCNRVPASNN HWREQAIRPT
EAISCVMQSV VATTSVDTAS REGCSSSSTG VCQTNSTKIR KRKSRWDQPA ETEKIGSRSP
KKLQYSPLPV LVESTPDHID KMSQGDKECR DCVCKGEAIN VDNGRHSFQE DVPPGFSSPP
NASLVSSTAP STAIEFPKPY QLKCPDVIIA LPQKRFISRL PVSYGIPLPI LQQFGSPQGE
CVESWIIAPG MPFHPFPPLP PCPRDKKDTR PACTANSIGI DEDAEEGQRD SNRPATSYPD
ENIPCMAGGN QPDPDIPGTN IQQTFKRMRE SYDLGKKYFR QQKRKGPPWH KSECMGNNQI
GGTCCIDVGN VKNELRNSYF SDDITCRVEK GGNDFYQQPQ HPNQQ
//