ID A0A368VEI2_9ACTN Unreviewed; 840 AA.
AC A0A368VEI2;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Beta-glucosidase {ECO:0000313|EMBL:RCW39113.1};
GN ORFNames=DFQ14_1184 {ECO:0000313|EMBL:RCW39113.1};
OS Halopolyspora algeriensis.
OC Bacteria; Actinomycetota; Actinomycetes; Actinomycetes incertae sedis;
OC Halopolyspora.
OX NCBI_TaxID=1500506 {ECO:0000313|EMBL:RCW39113.1, ECO:0000313|Proteomes:UP000253495};
RN [1] {ECO:0000313|EMBL:RCW39113.1, ECO:0000313|Proteomes:UP000253495}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CECT 8575 {ECO:0000313|EMBL:RCW39113.1,
RC ECO:0000313|Proteomes:UP000253495};
RA Whitman W.;
RT "Genomic Encyclopedia of Type Strains, Phase III (KMG-III): the genomes of
RT soil and plant-associated and newly described type strains.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 3 family.
CC {ECO:0000256|ARBA:ARBA00005336}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RCW39113.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QPJC01000018; RCW39113.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A368VEI2; -.
DR OrthoDB; 3187421at2; -.
DR Proteomes; UP000253495; Unassembled WGS sequence.
DR GO; GO:0008422; F:beta-glucosidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102483; F:scopolin beta-glucosidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.40.50.1700; Glycoside hydrolase family 3 C-terminal domain; 1.
DR Gene3D; 3.20.20.300; Glycoside hydrolase, family 3, N-terminal domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR026891; Fn3-like.
DR InterPro; IPR002772; Glyco_hydro_3_C.
DR InterPro; IPR036881; Glyco_hydro_3_C_sf.
DR InterPro; IPR001764; Glyco_hydro_3_N.
DR InterPro; IPR036962; Glyco_hydro_3_N_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR037524; PA14/GLEYA.
DR PANTHER; PTHR42715; BETA-GLUCOSIDASE; 1.
DR PANTHER; PTHR42715:SF10; BETA-GLUCOSIDASE F-RELATED; 1.
DR Pfam; PF14310; Fn3-like; 1.
DR Pfam; PF00933; Glyco_hydro_3; 1.
DR Pfam; PF01915; Glyco_hydro_3_C; 1.
DR PRINTS; PR00133; GLHYDRLASE3.
DR SMART; SM01217; Fn3_like; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF52279; Beta-D-glucan exohydrolase, C-terminal domain; 1.
DR PROSITE; PS51820; PA14; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000253495}.
FT DOMAIN 423..576
FT /note="PA14"
FT /evidence="ECO:0000259|PROSITE:PS51820"
FT REGION 299..320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 840 AA; 90789 MW; 2B4EC8DA5E05960A CRC64;
MTSAPDVDRP EPDEEKLRDL VEELTVPEKA RLVTGADLWT LPPIPRIGLH RLVLSDGPNG
VRGPTWDERD NSLLLPVGSA VAATWDRSAA RRVGELLGDE ARRRGVHAVL APTVNVHRSP
FGGRNFENPS EDPRLISELG SEIVTGIQSR GVGAAPKHFV ANDSETERFS YDVDVDERTL
RELYLTPFEY IVTRTRPWML MTAYNSVNGR AMTENARLVN EILKGEWSWD GVNVSDWYAT
RDLEGAAHGG LDLAMPGPDS PFGGGALVQA VTEGRIPGEV LDAKVRRILR LAARTGALGE
PFDEQDDPSG TSGRPAAPDE ARPVIRHLAA RGFVLLRNES AEGTPLLPLD ASALSTVAVV
GENATLPAVQ GGGSSQVNPP RIVTPLEGIR GVLDRAEVVH RRGVRHRRLL DPIPRERVTD
PDEGGTGVRV DYLDENAQVL SSEVRGTNRL LWPGRQELPQ RATGIRLRGR VELDGPGEHT
FAVHGAGRFR VTVAGREVFD GRVPDEERPD RATGDSTANM PEQRISVVLM DSELDDHGTV
LLDVEHTPAA GAIFPGVGLG YFRQDRDEQS ELTAAVQAAR EADVAVVVVG THEEIETEGR
DRSSLELPGG QDDLVRAVAA VNPRTVVVLN AGAPVLMPWM SEVPAVLWTW FGTQEYGSAL
ADVLFGAEEP GGRLPMTLPA ALTDVPVPLP GVQPVEGTLT YSEGTLVGYR AYSARGVAPL
FPFGHGLGYT TWEYGGAAAE SAEDGVRLRV AVRNAGTRAG REIVQVYERG PEGEPPRLVG
FTVVEAEPGE GDTVSIDIEQ RAFAHYDVDL PGWSVLSGEH ELLIGRSLSD IRRTVIAQYP
//