GenomeNet

Database: UniProt
Entry: R9K5C5_9FIRM
LinkDB: R9K5C5_9FIRM
Original site: R9K5C5_9FIRM 
ID   R9K5C5_9FIRM            Unreviewed;      1043 AA.
AC   R9K5C5;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 42.
DE   RecName: Full=DNA (cytosine-5-)-methyltransferase {ECO:0000256|ARBA:ARBA00011975};
DE            EC=2.1.1.37 {ECO:0000256|ARBA:ARBA00011975};
GN   ORFNames=C809_04341 {ECO:0000313|EMBL:EOS41441.1};
OS   Lachnospiraceae bacterium MD335.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX   NCBI_TaxID=1235793 {ECO:0000313|EMBL:EOS41441.1, ECO:0000313|Proteomes:UP000014081};
RN   [1] {ECO:0000313|EMBL:EOS41441.1, ECO:0000313|Proteomes:UP000014081}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=COE1 {ECO:0000313|EMBL:EOS41441.1,
RC   ECO:0000313|Proteomes:UP000014081};
RG   The Broad Institute Genomics Platform;
RG   The Broad Institute Genome Sequencing Center for Infectious Disease;
RA   Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA   Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Lachnospiraceae bacterium COE1.";
RL   Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EOS41441.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ASSW01000077; EOS41441.1; -; Genomic_DNA.
DR   AlphaFoldDB; R9K5C5; -.
DR   STRING; 1235793.C809_04341; -.
DR   PATRIC; fig|1235793.3.peg.4531; -.
DR   eggNOG; COG0270; Bacteria.
DR   eggNOG; COG1372; Bacteria.
DR   HOGENOM; CLU_006958_13_0_9; -.
DR   OrthoDB; 9813719at2; -.
DR   Proteomes; UP000014081; Unassembled WGS sequence.
DR   GO; GO:0003886; F:DNA (cytosine-5-)-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR   GO; GO:0009307; P:DNA restriction-modification system; IEA:UniProtKB-KW.
DR   GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR   CDD; cd00081; Hint; 2.
DR   Gene3D; 3.90.120.10; DNA Methylase, subunit A, domain 2; 1.
DR   Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 2.
DR   Gene3D; 3.10.28.10; Homing endonucleases; 1.
DR   Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 2.
DR   InterPro; IPR001525; C5_MeTfrase.
DR   InterPro; IPR028992; Hedgehog/Intein_dom.
DR   InterPro; IPR003586; Hint_dom_C.
DR   InterPro; IPR003587; Hint_dom_N.
DR   InterPro; IPR036844; Hint_dom_sf.
DR   InterPro; IPR027434; Homing_endonucl.
DR   InterPro; IPR006142; INTEIN.
DR   InterPro; IPR030934; Intein_C.
DR   InterPro; IPR004042; Intein_endonuc.
DR   InterPro; IPR006141; Intein_N.
DR   InterPro; IPR004860; LAGLIDADG_2.
DR   InterPro; IPR029063; SAM-dependent_MTases_sf.
DR   NCBIfam; TIGR01443; intein_Cterm; 1.
DR   PANTHER; PTHR46098; TRNA (CYTOSINE(38)-C(5))-METHYLTRANSFERASE; 1.
DR   PANTHER; PTHR46098:SF1; TRNA (CYTOSINE(38)-C(5))-METHYLTRANSFERASE; 1.
DR   Pfam; PF00145; DNA_methylase; 2.
DR   Pfam; PF13403; Hint_2; 1.
DR   Pfam; PF14528; LAGLIDADG_3; 1.
DR   PRINTS; PR00379; INTEIN.
DR   SMART; SM00305; HintC; 1.
DR   SMART; SM00306; HintN; 1.
DR   SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR   SUPFAM; SSF55608; Homing endonucleases; 1.
DR   SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 2.
DR   PROSITE; PS50818; INTEIN_C_TER; 1.
DR   PROSITE; PS50819; INTEIN_ENDONUCLEASE; 1.
DR   PROSITE; PS50817; INTEIN_N_TER; 1.
PE   4: Predicted;
KW   Autocatalytic cleavage {ECO:0000256|ARBA:ARBA00022813};
KW   Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW   Protein splicing {ECO:0000256|ARBA:ARBA00023000};
KW   Reference proteome {ECO:0000313|Proteomes:UP000014081};
KW   Restriction system {ECO:0000256|ARBA:ARBA00022747};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          206..345
FT                   /note="DOD-type homing endonuclease"
FT                   /evidence="ECO:0000259|PROSITE:PS50819"
FT   DOMAIN          405..427
FT                   /note="Intein C-terminal splicing"
FT                   /evidence="ECO:0000259|PROSITE:PS50818"
SQ   SEQUENCE   1043 AA;  114842 MW;  0CA8F7330115C5E6 CRC64;
     MDENGKLTLG SLFSGSGAFE LGGMLAGIRP VFASEVEPFP IRVTTKRLPF VKHYGDVNSI
     RGDEVEPVDI ITFGSPCFPA GTLVLTDKGY TEIERIEVGM RVLTHKGRWR KVTAAGSRQA
     ETIVLKGNHY GLECTKNHPI YCSSESKIEN KIRIEEEKSW IPAADMKGRL WGVPRKIEKT
     QMISPHYSGS RKQKPMPLMD GDFFYFVGRW LGDGWVRDGQ RPGRPEGQCS GQIYLCDSYD
     KEDELRSIVE KVTSSYSVER CRTAIKFRFC GQVLCNWLTD NFGKYAGGKY IMPWVYTLPE
     EYRQAILDGL FDSDGYRPKE NEWRVTTISK KLAEGLRILG EVQGYSTTVF RTVPCEYRMI
     EGRKVTQKPC YMVAFSRNAS RPHLTDAAHA WYRVRSAEPT GEVKTVYNLT VEDDNSYVAD
     GIVVHNCQNL SIAGKRAGLD GKQSSLFYQA IRIIKEMRCA TNGRYPRFIV WENVPGAFSS
     NGGEDFRAVL EAVCSVKDGG IPVPEPPKGK WANAGCVMAD GFSLAWRVVD ACLWGVPQRR
     KRIYLVADFT GGSAGKILFE SEGVSGYTPQ GFRAWQGAAG GAAPGIGEAG GICLNDQGGQ
     YISVDSEMAC TLRAQSHGHP PCVMEAAGFC TEHSADSRGI GYEEETSPTL RAGTVPAAVA
     LENHPTDSRV KVSEDNMVQT LTSRMGTGGG NVPLVMDAAT PKTLKIRAGG GNGGKGALVQ
     DNKSATLSCN NDQTVFVPFC KGTRPHSAEE APTWENREVA NTLNTFDIGE SRCNELVVQA
     FGICSKESNA MKSDNPHSGF YEAQTARTLD CNCNNPSANQ GGIAVVAVQG SMIGRDDRNG
     PQGSGVNEDV CFSLTGADRH AVAYPTYCTS KNSYFMRAEK ELANTLVATD YKDPPVINDV
     RTASGKDVFG TISASMGSKQ WLGNQEAFSG DYHIVEPDYI VRRLTPTECA RLQGFPDWWC
     DGLGTENPTE EDMAFWREVF ETHRKVMGTS SKPKSDSQIR KWLKDPHSDS AEYRMWGNGC
     ALPNVYFVLC GIVYYAQFPE FLL
//
DBGET integrated database retrieval system