ID R9K5C5_9FIRM Unreviewed; 1043 AA.
AC R9K5C5;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE RecName: Full=DNA (cytosine-5-)-methyltransferase {ECO:0000256|ARBA:ARBA00011975};
DE EC=2.1.1.37 {ECO:0000256|ARBA:ARBA00011975};
GN ORFNames=C809_04341 {ECO:0000313|EMBL:EOS41441.1};
OS Lachnospiraceae bacterium MD335.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=1235793 {ECO:0000313|EMBL:EOS41441.1, ECO:0000313|Proteomes:UP000014081};
RN [1] {ECO:0000313|EMBL:EOS41441.1, ECO:0000313|Proteomes:UP000014081}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=COE1 {ECO:0000313|EMBL:EOS41441.1,
RC ECO:0000313|Proteomes:UP000014081};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium COE1.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS41441.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASSW01000077; EOS41441.1; -; Genomic_DNA.
DR AlphaFoldDB; R9K5C5; -.
DR STRING; 1235793.C809_04341; -.
DR PATRIC; fig|1235793.3.peg.4531; -.
DR eggNOG; COG0270; Bacteria.
DR eggNOG; COG1372; Bacteria.
DR HOGENOM; CLU_006958_13_0_9; -.
DR OrthoDB; 9813719at2; -.
DR Proteomes; UP000014081; Unassembled WGS sequence.
DR GO; GO:0003886; F:DNA (cytosine-5-)-methyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0009307; P:DNA restriction-modification system; IEA:UniProtKB-KW.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR CDD; cd00081; Hint; 2.
DR Gene3D; 3.90.120.10; DNA Methylase, subunit A, domain 2; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 2.
DR Gene3D; 3.10.28.10; Homing endonucleases; 1.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 2.
DR InterPro; IPR001525; C5_MeTfrase.
DR InterPro; IPR028992; Hedgehog/Intein_dom.
DR InterPro; IPR003586; Hint_dom_C.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR027434; Homing_endonucl.
DR InterPro; IPR006142; INTEIN.
DR InterPro; IPR030934; Intein_C.
DR InterPro; IPR004042; Intein_endonuc.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR004860; LAGLIDADG_2.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR NCBIfam; TIGR01443; intein_Cterm; 1.
DR PANTHER; PTHR46098; TRNA (CYTOSINE(38)-C(5))-METHYLTRANSFERASE; 1.
DR PANTHER; PTHR46098:SF1; TRNA (CYTOSINE(38)-C(5))-METHYLTRANSFERASE; 1.
DR Pfam; PF00145; DNA_methylase; 2.
DR Pfam; PF13403; Hint_2; 1.
DR Pfam; PF14528; LAGLIDADG_3; 1.
DR PRINTS; PR00379; INTEIN.
DR SMART; SM00305; HintC; 1.
DR SMART; SM00306; HintN; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF55608; Homing endonucleases; 1.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 2.
DR PROSITE; PS50818; INTEIN_C_TER; 1.
DR PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
PE 4: Predicted;
KW Autocatalytic cleavage {ECO:0000256|ARBA:ARBA00022813};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Protein splicing {ECO:0000256|ARBA:ARBA00023000};
KW Reference proteome {ECO:0000313|Proteomes:UP000014081};
KW Restriction system {ECO:0000256|ARBA:ARBA00022747};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 206..345
FT /note="DOD-type homing endonuclease"
FT /evidence="ECO:0000259|PROSITE:PS50819"
FT DOMAIN 405..427
FT /note="Intein C-terminal splicing"
FT /evidence="ECO:0000259|PROSITE:PS50818"
SQ SEQUENCE 1043 AA; 114842 MW; 0CA8F7330115C5E6 CRC64;
MDENGKLTLG SLFSGSGAFE LGGMLAGIRP VFASEVEPFP IRVTTKRLPF VKHYGDVNSI
RGDEVEPVDI ITFGSPCFPA GTLVLTDKGY TEIERIEVGM RVLTHKGRWR KVTAAGSRQA
ETIVLKGNHY GLECTKNHPI YCSSESKIEN KIRIEEEKSW IPAADMKGRL WGVPRKIEKT
QMISPHYSGS RKQKPMPLMD GDFFYFVGRW LGDGWVRDGQ RPGRPEGQCS GQIYLCDSYD
KEDELRSIVE KVTSSYSVER CRTAIKFRFC GQVLCNWLTD NFGKYAGGKY IMPWVYTLPE
EYRQAILDGL FDSDGYRPKE NEWRVTTISK KLAEGLRILG EVQGYSTTVF RTVPCEYRMI
EGRKVTQKPC YMVAFSRNAS RPHLTDAAHA WYRVRSAEPT GEVKTVYNLT VEDDNSYVAD
GIVVHNCQNL SIAGKRAGLD GKQSSLFYQA IRIIKEMRCA TNGRYPRFIV WENVPGAFSS
NGGEDFRAVL EAVCSVKDGG IPVPEPPKGK WANAGCVMAD GFSLAWRVVD ACLWGVPQRR
KRIYLVADFT GGSAGKILFE SEGVSGYTPQ GFRAWQGAAG GAAPGIGEAG GICLNDQGGQ
YISVDSEMAC TLRAQSHGHP PCVMEAAGFC TEHSADSRGI GYEEETSPTL RAGTVPAAVA
LENHPTDSRV KVSEDNMVQT LTSRMGTGGG NVPLVMDAAT PKTLKIRAGG GNGGKGALVQ
DNKSATLSCN NDQTVFVPFC KGTRPHSAEE APTWENREVA NTLNTFDIGE SRCNELVVQA
FGICSKESNA MKSDNPHSGF YEAQTARTLD CNCNNPSANQ GGIAVVAVQG SMIGRDDRNG
PQGSGVNEDV CFSLTGADRH AVAYPTYCTS KNSYFMRAEK ELANTLVATD YKDPPVINDV
RTASGKDVFG TISASMGSKQ WLGNQEAFSG DYHIVEPDYI VRRLTPTECA RLQGFPDWWC
DGLGTENPTE EDMAFWREVF ETHRKVMGTS SKPKSDSQIR KWLKDPHSDS AEYRMWGNGC
ALPNVYFVLC GIVYYAQFPE FLL
//