ID D2MPZ7_9FIRM Unreviewed; 848 AA.
AC D2MPZ7;
DT 02-MAR-2010, integrated into UniProtKB/TrEMBL.
DT 02-MAR-2010, sequence version 1.
DT 28-JUN-2023, entry version 44.
DE SubName: Full=Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase {ECO:0000313|EMBL:EFC05450.1};
DE Flags: Fragment;
GN ORFNames=HMPREF9013_0387 {ECO:0000313|EMBL:EFC05450.1};
OS Bulleidia extructa W1219.
OC Bacteria; Bacillota; Erysipelotrichia; Erysipelotrichales;
OC Erysipelotrichaceae; Bulleidia.
OX NCBI_TaxID=679192 {ECO:0000313|EMBL:EFC05450.1, ECO:0000313|Proteomes:UP000005017};
RN [1] {ECO:0000313|Proteomes:UP000005017}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=W1219 {ECO:0000313|Proteomes:UP000005017};
RA Madupu R., Durkin A.S., Torralba M., Methe B., Sutton G.G.,
RA Strausberg R.L., Nelson K.E.;
RT "Sequence of Clostridiales genomosp. BVAB3 str. UPII9-5.";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFC05450.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFR01000014; EFC05450.1; -; Genomic_DNA.
DR RefSeq; WP_006627460.1; NZ_ADFR01000014.1.
DR AlphaFoldDB; D2MPZ7; -.
DR STRING; 679192.HMPREF9013_0387; -.
DR eggNOG; COG4193; Bacteria.
DR eggNOG; COG5263; Bacteria.
DR Proteomes; UP000005017; Unassembled WGS sequence.
DR GO; GO:0004040; F:amidase activity; IEA:InterPro.
DR Gene3D; 1.10.530.10; -; 1.
DR Gene3D; 2.10.270.10; Cholin Binding; 6.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR002901; MGlyc_endo_b_GlcNAc-like_dom.
DR Pfam; PF01473; Choline_bind_1; 4.
DR Pfam; PF19127; Choline_bind_3; 3.
DR Pfam; PF01832; Glucosaminidase; 1.
DR SMART; SM00047; LYZ2; 1.
DR SUPFAM; SSF69360; Cell wall binding repeat; 2.
DR PROSITE; PS51170; CW; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005017};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..848
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003032877"
FT DOMAIN 277..412
FT /note="Mannosyl-glycoprotein endo-beta-N-
FT acetylglucosamidase-like"
FT /evidence="ECO:0000259|SMART:SM00047"
FT REPEAT 556..575
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 662..681
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 744..763
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT NON_TER 848
FT /evidence="ECO:0000313|EMBL:EFC05450.1"
SQ SEQUENCE 848 AA; 97210 MW; 22DEC2ADC7951461 CRC64;
MNLKKKIACI SLSFICALIL LPNRFTSVKA ANPYRMNCSN FEVALAKVDS KKQGYFEKVG
CYASFDAAKS VMKEKGHAAV VRHSSMDSSN KIIAMNTGVV YTDVSSSHLA LLKMVGHSGH
ITYARSNRMA FYHDTFHYDN GHGDVWLTLS GFKGTTDIHN LEFVPMVWMN NGTYSYVNGA
SVTKEVPYFS AYTSQGRKEI LFTAKDQVGN TYFNAAFGIA PSWMKENKRY YSSNDIDFYE
DPQLTKYSGT YYNYYQFLPL RTKSKIPASK YNEFLKKMGK NSSSKLWNTG DLFVKAQEHF
GLNALMVFSQ ACVESAYGNS YLATNRNNLF GWKAYDSNPN GATGYASVKQ CIDQAFRDNI
RDYVSTNEPV YYGEHFGNKG SGITMKYASS TTYGLTVAAV AYSFDKFAGL VDHQSVHFGV
LKDKNVNVYN QPSNKANVLY KAGYGMDNAS PYYNNHVVAV LGTFKDYYKI QSTDYHDGSK
VITSRNNHSD RAYDWNQMIG YVKKSDLQLV TDSDKKTGWV DRNGKKYWFN EKGKPEPGWK
TIQGNRYWFH LDGRAETGWS PIDGKIYHFG PDGKMESGWF SYKGHLYYLN TIEDGHMETG
WARIAGYKYH FGSDGKLDTG WFHYGGNTYY LDPKSNGRMV TGKVTIDGKD YVFGEDGVLV
QKNGWKQING KKYWFNEKGE PEPGWKDIDG KRYWFYPDGQ VETGWRTIEG KKYHFNDAGE
LDYGWFTYQG NQYYLNPELN GATQTGWKEI KGELYWFYPD GKMETGWSSI AGEQYHFKPD
GRLDTGWFEY KGNRYYLDPK SKGRMVKGLF QINQKTYYFD EKGWMKNDGW HTINGHRYWF
TVEGYAEV
//