ID A0A3N2NL46_9BACT Unreviewed; 999 AA.
AC A0A3N2NL46;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=Glycoside hydrolase family 97 protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EEL53_03990 {ECO:0000313|EMBL:ROT23175.1};
OS Muribaculaceae bacterium Isolate-114 (HZI).
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Muribaculaceae.
OX NCBI_TaxID=2486475 {ECO:0000313|EMBL:ROT23175.1, ECO:0000313|Proteomes:UP000273368};
RN [1] {ECO:0000313|EMBL:ROT23175.1, ECO:0000313|Proteomes:UP000273368}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate-114 (HZI) {ECO:0000313|Proteomes:UP000273368};
RA Clavel T., Strowig T.;
RT "Sequence and cultivation study of Muribaculaceae reveals novel species,
RT host preference, and functional potential of this yet undescribed family.";
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- COFACTOR:
CC Name=Ca(2+); Xref=ChEBI:CHEBI:29108;
CC Evidence={ECO:0000256|ARBA:ARBA00001913};
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ROT23175.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RIBN01000004; ROT23175.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3N2NL46; -.
DR Proteomes; UP000273368; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR Gene3D; 2.70.98.10; -; 1.
DR Gene3D; 3.20.20.70; Aldolase class I; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR014718; GH-type_carb-bd.
DR InterPro; IPR029483; GH97_C.
DR InterPro; IPR019563; GH97_catalytic.
DR InterPro; IPR029486; GH97_N.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR35803:SF1; GLUCAN 1,4-ALPHA-GLUCOSIDASE SUSB; 1.
DR PANTHER; PTHR35803; GLUCAN 1,4-ALPHA-GLUCOSIDASE SUSB-RELATED; 1.
DR Pfam; PF03372; Exo_endo_phos; 1.
DR Pfam; PF14509; GH97_C; 1.
DR Pfam; PF14508; GH97_N; 1.
DR Pfam; PF10566; Glyco_hydro_97; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Reference proteome {ECO:0000313|Proteomes:UP000273368}.
FT DOMAIN 23..269
FT /note="Glycosyl-hydrolase 97 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF14508"
FT DOMAIN 275..575
FT /note="Glycosyl-hydrolase 97 catalytic"
FT /evidence="ECO:0000259|Pfam:PF10566"
FT DOMAIN 577..678
FT /note="Glycosyl-hydrolase 97 C-terminal oligomerisation"
FT /evidence="ECO:0000259|Pfam:PF14509"
FT DOMAIN 701..986
FT /note="Endonuclease/exonuclease/phosphatase"
FT /evidence="ECO:0000259|Pfam:PF03372"
SQ SEQUENCE 999 AA; 113139 MW; 04964FBC133642A7 CRC64;
MRKIFIALGL ICVALGVWGD TFLKSPDGNL KVSFALTNKG VPTYSLSFKG TEIVRTSPMG
FSFATGTSLD KGFKVDGVTR TECDSVWSPV WGENSSIREN YNGMVVGLAK GDRRMNIEFR
AFNDGIGFRY IFPESKDSLL VVKEEKTSFA MAGDHTAWWI PLDYDSQEYE YTESRLSAIP
PSKGVQTSLM MKTDAGIYVN IHEAALVNFP AMHLDYNRKS KTFKSHLTPR PDGTAAHVTT
PFKTPWRTVM VSDKATDILA SNMILNLNDA CKIEDTSWIH PIKYMGVWWE MITGKSRWSY
TKAENFDLAT FDYSKATPHG RHGANNENVR RYIDFASRHG FDQLLIEGWN VGWEDWYGKE
KDYVFDFVTP YPDFDIAALN GYAKGKGIKL MMHHETSGSV ENYERHLDAA YDLMNRYGYD
AVKSGYVGRI LPKGEFHYSQ DIVNHYQDAV EKAAAKHIMV NGHEAVRPTG LCRTWPNLVA
NESAMGQEYA EMTPRHVTIL PFTRLQGGPM DFTPGIFRMD ITSFAPGYQG KRKRATIANQ
LGLYLTMYSP IQMAADMPEH YERHMDAFQF IKDVPVDWSE SRYLDAEPGD YIVVARKDRN
SENWFVGGVT NEEARDYELR FDFLPEGVEY ECTIYADAPD SDGFTNPEKY EIKRSRVTVA
SVIPVRMARA GGFAVSLKRV FSMLQLNIWQ ECTMVKGGYE ALVDEIWRLK PDFVTLSEVR
NYGGVDFTGK LCGSLRDKGV AYYSYPSEDS GIISRFPIKE FSTVYPLKDD SGTIYRAEVD
LGNTKISVYT AHLDWHNYSC YNARGYDGAT WKETSKPKSA EEILKLGSAS RRVEEIEAFL
KDAKGRMAEG YEILMGGDFN EPSFKDWIQE TAQMFDHNGF AVEWPVTKLL DDNGFTDMWR
ALCPDPVKNP GITYPSDVPT LPAEKICWAP KADDRDRIDF IFGSRRVKPL EAVIVGPRAS
MCRSERTEET SADSIATPAC TWFSDHKGIL AKFVISDYF
//