ID A0A4R2BLV7_9BACI Unreviewed; 891 AA.
AC A0A4R2BLV7;
DT 31-JUL-2019, integrated into UniProtKB/TrEMBL.
DT 31-JUL-2019, sequence version 1.
DT 08-NOV-2023, entry version 16.
DE SubName: Full=Endo-beta-N-acetylglucosaminidase D {ECO:0000313|EMBL:TCN27522.1};
GN ORFNames=EV146_102476 {ECO:0000313|EMBL:TCN27522.1};
OS Mesobacillus foraminis.
OC Bacteria; Bacillota; Bacilli; Bacillales; Bacillaceae; Mesobacillus.
OX NCBI_TaxID=279826 {ECO:0000313|EMBL:TCN27522.1, ECO:0000313|Proteomes:UP000295689};
RN [1] {ECO:0000313|EMBL:TCN27522.1, ECO:0000313|Proteomes:UP000295689}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CV53 {ECO:0000313|EMBL:TCN27522.1,
RC ECO:0000313|Proteomes:UP000295689};
RX PubMed=26203337; DOI=.1186/s40793-015-0017-x;
RA Whitman W.B., Woyke T., Klenk H.P., Zhou Y., Lilburn T.G., Beck B.J.,
RA De Vos P., Vandamme P., Eisen J.A., Garrity G., Hugenholtz P.,
RA Kyrpides N.C.;
RT "Genomic Encyclopedia of Bacterial and Archaeal Type Strains, Phase III:
RT the genomes of soil and plant-associated and newly described type
RT strains.";
RL Stand. Genomic Sci. 10:26-26(2015).
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000256|ARBA:ARBA00004514}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TCN27522.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SLVV01000002; TCN27522.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A4R2BLV7; -.
DR Proteomes; UP000295689; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0033925; F:mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR CDD; cd06547; GH85_ENGase; 1.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR032979; ENGase.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR005201; Glyco_hydro_85.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR PANTHER; PTHR13246:SF1; CYTOSOLIC ENDO-BETA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR13246; ENDO BETA N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF03644; Glyco_hydro_85; 1.
DR Pfam; PF18911; PKD_4; 1.
DR SMART; SM00089; PKD; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000295689};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..891
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5020643878"
FT DOMAIN 662..747
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 741..888
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 891 AA; 100073 MW; 52E222ABE98AE6A9 CRC64;
MKNVKKGYRW LIFLCGIFLV SLLPNTSYAK QPESSYWYPE QLLNWTPEND PDAVFNRSQI
PLRKQEVLYK VNDNAQSEAR LVALSALNPT TSGVPSQGGK AFLANTFSYW QYVDLMVYWA
GSAGEGIIVP PSADVIDAAH KNGVPILGNV FFPPIVYGGK VEWLNEMLIQ HEDGSFPAAD
KLLEVARYYG FDGWFINQET EGGDANTAQK MKDFLTYLQQ NKPEGMQIMW YDSMTKDGGI
RWQNALTDQN SSFLQDGKTR VSDSMFLNFW WRDQQSSNNK AQELGRSPYD LFAGIDVEAN
GTSTKIAWDG IFPEGKAPLT SLGIYRPDWA FKTAATMEDF YQKENEFWVG KSQNPAITND
NGAWKGMAHY FTAKTAIQEL PFITHFNTGS GKFFAADGKV VSTQSWNNRS LQDILPTWRW
LKEGSQTVDV DFDWDTAFYG GSSLKLSGTI EADNPTHIKL YKTNLPIQKD TEISMTYKTD
LKKTNIKLGV SFTDHPEKFS FLEVKKQSHN QWTTDTFKLK KYSDKKIAAI SLWIESDKEV
KDFTTNIGEI KIYNKHAEHN SIHSPNEGKV NKIEFNKGIY ADIPLSWKPS DSDVSHYEIY
RKLPNNEKEF VGATPNQAYY ISNLKRNGKE SSTTLEIVAV SKEYKRSKPE EISFEWPAYP
KPEADFGADK TVAAPGQEIQ FLNQSTEVTE EVEWQFEGGT PAVSHEQNPI VTYDKEGVYS
VTLIAKNSEG ESVLTKKEMI TISEAVKDIK NIAMNKTASA SGQCAPAEAA AYALDGKVTD
NSKWCALGNA PHWLKVDLGA QYAISKFVVH HAQAGGEPQA FNTQALRIEV STDGENWTEA
VKVTDNTVAV SEHSISLTNG RYVRLWIDKP TQGSDQAARI FEFEVHGFLA P
//