ID A0A232F1I1_9HYME Unreviewed; 1106 AA.
AC A0A232F1I1;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE RecName: Full=DNA (cytosine-5-)-methyltransferase {ECO:0000256|ARBA:ARBA00011975};
DE EC=2.1.1.37 {ECO:0000256|ARBA:ARBA00011975};
GN ORFNames=TSAR_010295 {ECO:0000313|EMBL:OXU24389.1};
OS Trichomalopsis sarcophagae.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Proctotrupomorpha;
OC Chalcidoidea; Pteromalidae; Pteromalinae; Trichomalopsis.
OX NCBI_TaxID=543379 {ECO:0000313|EMBL:OXU24389.1, ECO:0000313|Proteomes:UP000215335};
RN [1] {ECO:0000313|EMBL:OXU24389.1, ECO:0000313|Proteomes:UP000215335}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Alberta {ECO:0000313|EMBL:OXU24389.1,
RC ECO:0000313|Proteomes:UP000215335};
RC TISSUE=Whole body {ECO:0000313|EMBL:OXU24389.1};
RX PubMed=28648823; DOI=10.1016/j.cub.2017.05.032;
RA Martinson E.O., Mrinalini, Kelkar Y.D., Chang C.H., Werren J.H.;
RT "The Evolution of Venom by Co-option of Single-Copy Genes.";
RL Curr. Biol. 27:2007-2013(2017).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the class I-like SAM-binding methyltransferase
CC superfamily. C5-methyltransferase family. {ECO:0000256|PROSITE-
CC ProRule:PRU01016}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXU24389.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NNAY01001314; OXU24389.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A232F1I1; -.
DR STRING; 543379.A0A232F1I1; -.
DR Proteomes; UP000215335; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003886; F:DNA (cytosine-5-)-methyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0010468; P:regulation of gene expression; IEA:InterPro.
DR CDD; cd11725; ADDz_Dnmt3; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1.
DR InterPro; IPR025766; ADD.
DR InterPro; IPR018117; C5_DNA_meth_AS.
DR InterPro; IPR001525; C5_MeTfrase.
DR InterPro; IPR040552; DNMT3_ADD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR PANTHER; PTHR23068; DNA CYTOSINE-5- -METHYLTRANSFERASE 3-RELATED; 1.
DR PANTHER; PTHR23068:SF52; DUF3444 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF17980; ADD_DNMT3; 1.
DR Pfam; PF21255; ADDz_Dnmt3b; 1.
DR Pfam; PF00145; DNA_methylase; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51533; ADD; 1.
DR PROSITE; PS00094; C5_MTASE_1; 1.
DR PROSITE; PS50812; PWWP; 1.
DR PROSITE; PS51679; SAM_MT_C5; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603, ECO:0000256|PROSITE-
KW ProRule:PRU01016}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000215335};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|PROSITE-ProRule:PRU01016};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|PROSITE-
KW ProRule:PRU01016}; Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 522..580
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 683..819
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51533"
FT ACT_SITE 911
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01016"
SQ SEQUENCE 1106 AA; 128607 MW; 58392D007DA756B4 CRC64;
MLLCRDHEYF LGVFLDFNFM ITAYVNIHQK INHASYWSSS LWREMMAMDL FDDHNYCKLN
KSNVKIDKAE NYDNFNISQE NILSYKINNN ELMEFDVQNC VEIQCNDEMQ NDISEDVINF
NSINTNTESR YFNNDATRLN CNEIAKDVNF INTLECMNFN NISNVINSSI ATENEHAMND
SMANSGSQCL INNLPPNNHN STFTSYSTED SGIESMDSLL FTESEDSFLL NSRDDHCYSF
SKERDGAKKS YFNKRKAENS SAASILTNIE NSDVIKDKSE VPVKRNRTTK RSREVIISEE
SPIIEGKYEE SRRVTRLLVK NSDEKPVKYI DNTPGKLVWG YFRSGWWPAL IIRAEDAGMI
PSSEKIWVSW IGESRISELN AKCVDKFSND LERRIDNLAT NPKTKVTCKK QKDEACFKTI
QLLKKHFTGG ALVKPYIAWI KNNILPYKNK LDELLFYPYP EILSDKLNNL KIVNSEKNEK
YLRQQEKERL SKALEPKNLI KKIEKLEKPI EKGGINIVDQ KYGMIVWAKM QGYSWWPCVI
MDYQHLNRKQ PHVAHQWVMW YGDYKYSQVQ YRQILTFPTG MDRMESKITA TKDELFCKAV
LQAAKDYCDK LGYLTEPWKI KDVIHLFYKS KDIYKLKNAE LTEPNEEDLY SQTIKKQLRK
QINNQPISEG RKKKILECKN LNLLLSGKLP LESLCISCLE SGEELEDHPF FHASMCEKCL
EDFSPKIFAY GNDAKCFYCT LCGGDDLVAV CDSMSCPRVF CTACIKYIIC PEFYEDILLK
HPWYCFLCDP SSISTNNVVI KIRNDWRYKM ISLYRINCDE EAPSNLERLD RNRKIRVLSL
FDGIGTGLVV LKHLNVNIEC YYASEIDPDS MQVSFFNHGD EIIQLGDVRN IDEKKIKEIA
PIDLLIGGSP CNELSLANPK RRGLDDPEGT GILFYDYVRI MKLVKKHNKK RHLFWLFENV
ASMPKKFRNQ ISKNLGREPK FLDSADFSAQ HRPRLYWGNL PWGPYQVNNV VLQDVLRKRC
NRQALVKKIM TVTTRTNSLN QTKENLKPVL MDGKKDMLWV TELEKIFGFP MHYTDANLQK
TRRLQLIGKA WSVQTLTAIL RPVFLF
//