ID A0A1A6HN46_NEOLE Unreviewed; 1396 AA.
AC A0A1A6HN46;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE RecName: Full=DNA (cytosine-5)-methyltransferase 1 {ECO:0000256|ARBA:ARBA00020876};
DE EC=2.1.1.37 {ECO:0000256|ARBA:ARBA00011975};
DE Flags: Fragment;
GN ORFNames=A6R68_18151 {ECO:0000313|EMBL:OBS79400.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS79400.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS79400.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS79400.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS79400.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the class I-like SAM-binding methyltransferase
CC superfamily. C5-methyltransferase family. {ECO:0000256|PROSITE-
CC ProRule:PRU01016}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS79400.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01020470; OBS79400.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6HN46; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0003886; F:DNA (cytosine-5-)-methyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd04760; BAH_Dnmt1_I; 1.
DR CDD; cd04711; BAH_Dnmt1_II; 1.
DR Gene3D; 1.10.10.2230; -; 2.
DR Gene3D; 2.30.30.490; -; 2.
DR Gene3D; 3.90.120.10; DNA Methylase, subunit A, domain 2; 1.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR018117; C5_DNA_meth_AS.
DR InterPro; IPR001525; C5_MeTfrase.
DR InterPro; IPR031303; C5_meth_CS.
DR InterPro; IPR022702; Cytosine_MeTrfase1_RFD.
DR InterPro; IPR010506; DMAP1-bd.
DR InterPro; IPR017198; DNMT1-like.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR10629; CYTOSINE-SPECIFIC METHYLTRANSFERASE; 1.
DR PANTHER; PTHR10629:SF52; DNA (CYTOSINE-5)-METHYLTRANSFERASE 1; 1.
DR Pfam; PF01426; BAH; 2.
DR Pfam; PF06464; DMAP_binding; 1.
DR Pfam; PF00145; DNA_methylase; 1.
DR Pfam; PF12047; DNMT1-RFD; 1.
DR Pfam; PF02008; zf-CXXC; 1.
DR PIRSF; PIRSF037404; DNMT1; 5.
DR PRINTS; PR00105; C5METTRFRASE.
DR SMART; SM00439; BAH; 2.
DR SMART; SM01137; DMAP_binding; 1.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1.
DR PROSITE; PS51038; BAH; 2.
DR PROSITE; PS00094; C5_MTASE_1; 1.
DR PROSITE; PS00095; C5_MTASE_2; 1.
DR PROSITE; PS51912; DMAP1_BIND; 1.
DR PROSITE; PS51679; SAM_MT_C5; 1.
DR PROSITE; PS51058; ZF_CXXC; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRSR:PIRSR037404-3};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603, ECO:0000256|PROSITE-
KW ProRule:PRU01016}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|PROSITE-ProRule:PRU01016};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|PROSITE-
KW ProRule:PRU01016};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRSR:PIRSR037404-3};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 1..82
FT /note="DMAP1-binding"
FT /evidence="ECO:0000259|PROSITE:PS51912"
FT DOMAIN 457..503
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 566..692
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 784..911
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT REGION 76..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 517..537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 76..103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 105..154
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..177
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 1005
FT /evidence="ECO:0000256|PIRSR:PIRSR037404-1,
FT ECO:0000256|PROSITE-ProRule:PRU01016"
FT BINDING 188
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR037404-3"
FT BINDING 191
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR037404-3"
FT BINDING 249
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR037404-3"
FT BINDING 253
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /evidence="ECO:0000256|PIRSR:PIRSR037404-3"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS79400.1"
SQ SEQUENCE 1396 AA; 158091 MW; E5EDE62D61E338CA CRC64;
LKDLERDGLT EKECVKEKLN LLHEFLQTEI KSQLYDLETK LHKEELSEEG YLAKVKSLLN
KDLSLENGTL SLTRKANNSE ASSSSMATRR TTRQTTITSH FTKGPTKRKP KEECEERTSN
ESVAEKDQDK RRLRPRSQPK DLATKRRPKE EELEDLTAET PEDGDEEERK RAARTKSAMT
PKMDLPRCPQ CGQYLDDPDL KYQQHLPDAV EELQMLVSEK LSVYDETNSA GERIDSSPQH
KVTCFSVYCN RGHLCPVDTG LIEKNIELYF SGCAKAIYDD NPSLEGGINC KNLGPINEWW
ITGFDGGEKA LIGFSTSFAE YILMEPSPDY APIFGLMQXK IYISKIVVEF LQGNPHAVYE
DLINKIEFVV NQVESYDDAK DSDETPIFLT PCMRALIGLA GVTLGQRRAE RRHNLKHSAK
EKDKGPTKAT TTKLVYQIFD TFFSEQIEKD DKDEKENAFK RRRCGVCEVC QQPECGKCKA
CKDMVKFGGT GRSKQACLKR RCPNLAVKEA DDDEEVDDYI PEVPSPKKMH QGKKKKQNKD
RISWLGQPVK IEEKRTYYQK VCIDEETLEV GDCVSVIPDD SSKPLYLARV TALWEDKNGQ
MFHAHWFCAG TDTVLGATSD PLELFLVGEC ENMQLSYIHS KVKVVHRAPS ENWAMEGGMD
PETMLPGAED DGKTYFYQFW YNQDYARFES PPNTQPTEDN KHKFCLSCIR LGELRHKEMP
KVLEQLDEVD GRICCSSITK NGVVYRVGDS VYLPPEAFTF NIKLASPLKR TKKEPVDENL
YPEHYRKYSD YIKGSNLDAP EPYRIGRIKA IHCGKKNGKA NEADIKIRLN KFYRPENTHR
STSATYHSDI NLLYWSDEEA VVDFSDVQGR CTVEYGEDLP ESIQDYSQGG PDRFYFLEAK
CQVSEPREPE TTIKLPKLRT LDVFSGCGGL SEGFHQAGIS ETLWAIEMWD PAAQAFRLNN
PGSTVFTEDC NVLLKLVMAG EVTNSLGQRL PQKGDVEMLC GGPPCQGFSG MNRFNSRTYS
KFKNSLVVSF LSYCDYYRPR FFLLENVRNF VSFKRSMVLK LTLXCLVRMG YQCTFGVLQA
GQYGVAQTRR RAIILAAAPG EKLPLFPEPL HVFAPRACQL SVVVDDKKFV SNITRLSSGP
FRTITVRDTM SDLPEIQNGA SASEISYNGE PQSWFQRQLR GSHYQPILRD HICKDMSPLV
AARMRNIPLF PGSDWRDLPN IEVPLSDGTT TKKLRYTFHD RKNGCSSTGA MRGVCSCVEA
GKACDPTARQ FNTLIPWCLP HTGNRHNHWA GLYGRLEWDG FFSTTVTNPE PMGKQGRVLH
PEQHRVVSVR ECARSQGFPD TYRLFGNILD RHRQVGNAVP PPLAKAIGLE IKLCMLAKAQ
EIASAAVKVK ETATMD
//