ID A0A0A6PA32_9GAMM Unreviewed; 704 AA.
AC A0A0A6PA32;
DT 04-FEB-2015, integrated into UniProtKB/TrEMBL.
DT 04-FEB-2015, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE RecName: Full=Glycoside hydrolase family 5 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=PN36_34300 {ECO:0000313|EMBL:KHD11286.1};
OS Candidatus Thiomargarita nelsonii.
OC Bacteria; Pseudomonadota; Gammaproteobacteria; Thiotrichales;
OC Thiotrichaceae; Thiomargarita.
OX NCBI_TaxID=1003181 {ECO:0000313|EMBL:KHD11286.1, ECO:0000313|Proteomes:UP000030428};
RN [1] {ECO:0000313|EMBL:KHD11286.1, ECO:0000313|Proteomes:UP000030428}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hydrate Ridge {ECO:0000313|EMBL:KHD11286.1};
RX PubMed=27199933; DOI=10.3389/fmicb.2016.00603;
RA Flood B.E., Fliss P., Jones D.S., Dick G.J., Jain S., Kaster A.K.,
RA Winkel M., Mussmann M., Bailey J.;
RT "Single-Cell (Meta-)Genomics of a Dimorphic Candidatus Thiomargarita
RT nelsonii Reveals Genomic Plasticity.";
RL Front. Microbiol. 7:603-603(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KHD11286.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JSZA02000380; KHD11286.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0A6PA32; -.
DR Proteomes; UP000030428; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0071704; P:organic substance metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR044060; Bacterial_rp_domain.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR34142:SF1; CELLULASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR34142; ENDO-BETA-1,4-GLUCANASE A; 1.
DR Pfam; PF00150; Cellulase; 1.
DR Pfam; PF18998; Flg_new_2; 3.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Reference proteome {ECO:0000313|Proteomes:UP000030428};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..704
FT /note="Glycoside hydrolase family 5 domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007387798"
FT DOMAIN 36..298
FT /note="Glycoside hydrolase family 5"
FT /evidence="ECO:0000259|Pfam:PF00150"
FT DOMAIN 384..433
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
FT DOMAIN 438..512
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
FT DOMAIN 538..583
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
SQ SEQUENCE 704 AA; 79258 MW; 9ACAA87AB466BF40 CRC64;
MKRLILSPLM AIIFCTATNV SANLAQFQVI NGQIIDPSDQ PFVVKGTNVF WETSHTAAML
IDCWNFNTIR LNHYPDPDLP RDDLNYRFDY IVASYADRGI VVIFDLAHDG EGEDVGIGHY
WRSRQDELVE LYAYYADRYR DNPYVWFDLI NEPGTLDFDS TAWVELHQKL IRAIRNTGNN
NPILVEGWAW GQDAGNWEST PVPEENSAIL SLGDQILNFD DKTYQNIIFS HHVYDQWKYA
DVSRLADYVD RVRAKGYALV VGEYGSHNGS TDTLDPTKWM FEVAIPRDVG RIVWTWVAAD
NNDLTINTQM LGGGDGIDSC TAPTNLTELG ELIWADNHSA PPPPPNEFST LTLTKSKDRG
QIRAKQIGKR WTINCNLNCD EKSYDYSTGS EIILQARPKK CFEWLGWSGD CSGTEREISV
TLSSDMICIA NFRGGSKLTV QTLGNGKVIS QMIDCGEICE AYYPNQKNVL LKAIPETGYS
FVEWDCEGKK RTQPKMMVKM NSDITCTASF AQAVELTLVT VGEGRVIITP KGSDCGVNCN
AYDPGKKLRL RAIPSAGFLF VSWGGDCSGT KNLTTVTMDS SKRCIVSFIA KNANVLEKAI
RQMVSQFYET FPNFDGNALA LQEAFQLALP VIMTLDRQCP IWPTSSFNDS YISGDYTASI
NIMPDHSVEI LFREDVMPPI RIYYDVLGEN DEFANIFRWD LLLW
//