ID A0A0D2N020_GOSRA Unreviewed; 1056 AA.
AC A0A0D2N020;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE RecName: Full=DNA (cytosine-5-)-methyltransferase {ECO:0000256|ARBA:ARBA00011975};
DE EC=2.1.1.37 {ECO:0000256|ARBA:ARBA00011975};
GN ORFNames=B456_004G180300 {ECO:0000313|EMBL:KJB25192.1};
OS Gossypium raimondii (New World cotton).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=29730 {ECO:0000313|EMBL:KJB25192.1, ECO:0000313|Proteomes:UP000032304};
RN [1] {ECO:0000313|EMBL:KJB25192.1, ECO:0000313|Proteomes:UP000032304}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23257886; DOI=10.1038/nature11798;
RA Paterson A.H., Wendel J.F., Gundlach H., Guo H., Jenkins J., Jin D.,
RA Llewellyn D., Showmaker K.C., Shu S., Udall J., Yoo M.J., Byers R.,
RA Chen W., Doron-Faigenboim A., Duke M.V., Gong L., Grimwood J., Grover C.,
RA Grupp K., Hu G., Lee T.H., Li J., Lin L., Liu T., Marler B.S., Page J.T.,
RA Roberts A.W., Romanel E., Sanders W.S., Szadkowski E., Tan X., Tang H.,
RA Xu C., Wang J., Wang Z., Zhang D., Zhang L., Ashrafi H., Bedon F.,
RA Bowers J.E., Brubaker C.L., Chee P.W., Das S., Gingle A.R., Haigler C.H.,
RA Harker D., Hoffmann L.V., Hovav R., Jones D.C., Lemke C., Mansoor S.,
RA ur Rahman M., Rainville L.N., Rambani A., Reddy U.K., Rong J.K.,
RA Saranga Y., Scheffler B.E., Scheffler J.A., Stelly D.M., Triplett B.A.,
RA Van Deynze A., Vaslin M.F., Waghmare V.N., Walford S.A., Wright R.J.,
RA Zaki E.A., Zhang T., Dennis E.S., Mayer K.F., Peterson D.G., Rokhsar D.S.,
RA Wang X., Schmutz J.;
RT "Repeated polyploidization of Gossypium genomes and the evolution of
RT spinnable cotton fibres.";
RL Nature 492:423-427(2012).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 2'-deoxycytidine in DNA + S-adenosyl-L-methionine = a 5-
CC methyl-2'-deoxycytidine in DNA + H(+) + S-adenosyl-L-homocysteine;
CC Xref=Rhea:RHEA:13681, Rhea:RHEA-COMP:11369, Rhea:RHEA-COMP:11370,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:57856, ChEBI:CHEBI:59789,
CC ChEBI:CHEBI:85452, ChEBI:CHEBI:85454; EC=2.1.1.37;
CC Evidence={ECO:0000256|ARBA:ARBA00000743};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the class I-like SAM-binding methyltransferase
CC superfamily. C5-methyltransferase family. {ECO:0000256|PROSITE-
CC ProRule:PRU01016}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001743; KJB25192.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0D2N020; -.
DR EnsemblPlants; KJB25192; KJB25192; B456_004G180300.
DR Gramene; KJB25192; KJB25192; B456_004G180300.
DR Proteomes; UP000032304; Chromosome 4.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0003886; F:DNA (cytosine-5-)-methyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd18635; CD_CMT3_like; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR Gene3D; 3.90.120.10; DNA Methylase, subunit A, domain 2; 1.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR018117; C5_DNA_meth_AS.
DR InterPro; IPR001525; C5_MeTfrase.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR PANTHER; PTHR10629; CYTOSINE-SPECIFIC METHYLTRANSFERASE; 1.
DR PANTHER; PTHR10629:SF34; DNA (CYTOSINE-5)-METHYLTRANSFERASE CMT2; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF00145; DNA_methylase; 1.
DR PRINTS; PR00105; C5METTRFRASE.
DR SMART; SM00439; BAH; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1.
DR PROSITE; PS51038; BAH; 1.
DR PROSITE; PS00094; C5_MTASE_1; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS51679; SAM_MT_C5; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603, ECO:0000256|PROSITE-
KW ProRule:PRU01016}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000032304};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|PROSITE-ProRule:PRU01016};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|PROSITE-
KW ProRule:PRU01016}.
FT DOMAIN 458..574
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 700..775
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 177..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 295..389
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..24
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 306..322
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 323..389
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 800
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01016"
SQ SEQUENCE 1056 AA; 118866 MW; CA45DB6B948818E1 CRC64;
MEPTENSDSD SKSQSLTLVT TGDDNVEQPL PLEICVPEEI AAGADDNKLL PRRSASIFQL
LPIRSVFSRK SDIETVLERC LRRSPRNSAL FVRESANVRP KAIVSLETEG CKRKIKESTT
VEGKNLRRSP RFTTISAAAE NKKVLSLSAK VMQSRSSKSQ RFKLKNKGAK IMNIERTNGM
GLRRSPRLTS APPETKGRAS KTIFKSSDKG SYSETRSSGK LNGKRLCLSK TEEADKGTFP
LRHNETNLRL IIERQLRRSL KFSQATKNGS SDISVRRLDM DEVVFSEEKL LGISPSSMHA
TENGDSNASF REHRREMSDE KQLKTPSSLS TLLAEGDSAK VNSSSNRLSN SCDEQPSKKF
KISSAESDMG TSDETFSKKA KGSSLSGKKK SQSKTDVIFI GNPIPDDEAQ ERWRWRYEMK
NTKSNRKLIS SDDDDDDDED KVVWNVECHY AQAEIDGCTI NLGDCVYIKG EEAKHHIGKI
LEFFKTTDGE NYFRVQWFYR AEDTVMKQEA AFHDERRVFY STVMNDNPID CIISKVSVTQ
ISPKLGLKSN SLPRSDFYFD MEYCVDYSTF CNLPPDNSFK SYSSSNCCKE VFPSTPAFSA
NIPSFGTYQA ELTLLDLYSG CGGMSTGLCL GAKASSIDLV TKWAVDSDKS ASKSLKLNHP
EAHVRNEAAD GFLRLLKEWE KLCKRYVVDN LERTYPSRFR ASEAVMKNAS PAKDADTSAD
ELEVSCLVDV CYGDPCNTGN RGLKFKVRWK GYSASDDTWE PIEGLSNCQE CIQEFVIKGF
RSKILPLCGD VDVICGGPPC QGISGYNRFR NVDSPLDDER NRQIVVFMDI VEYLKPKFVL
MENVVDILRL DKGSLGRYAL SRLVHMKYQA RLGIIAAGCY GLPQFRLRVF LWGAHPSEKL
PQFPLPSHDV IIRYWPPPEF ERNTVAYEEG QPRQLEDALL LRDAISDLPP VSNNEVREEM
TYEKPPETDF QRYIRSSKYV MTGSALDGAT RIRNLLYDHR PAPLSEDDYT RVCLIPKRKG
ANFRDLPGVI VGADNVARRD PTREKQFLPS GKPLLF
//