ID A0A1G0ZS43_9BACT Unreviewed; 1319 AA.
AC A0A1G0ZS43;
DT 15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2017, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0000259|Pfam:PF06452};
GN ORFNames=A2X45_14785 {ECO:0000313|EMBL:OGV61060.1};
OS Lentisphaerae bacterium GWF2_50_93.
OC Bacteria; Lentisphaerota.
OX NCBI_TaxID=1798574 {ECO:0000313|EMBL:OGV61060.1, ECO:0000313|Proteomes:UP000178497};
RN [1] {ECO:0000313|EMBL:OGV61060.1, ECO:0000313|Proteomes:UP000178497}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=27774985; DOI=10.1038/ncomms13219;
RA Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA Williams K.H., Hubbard S.S., Banfield J.F.;
RT "Thousands of microbial genomes shed light on interconnected biogeochemical
RT processes in an aquifer system.";
RL Nat. Commun. 7:13219-13219(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OGV61060.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MHBI01000001; OGV61060.1; -; Genomic_DNA.
DR STRING; 1798574.A2X45_14785; -.
DR Proteomes; UP000178497; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0016052; P:carbohydrate catabolic process; IEA:InterPro.
DR Gene3D; 2.60.40.1190; -; 2.
DR Gene3D; 2.60.40.4070; -; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR010502; Carb-bd_dom_fam9.
DR InterPro; IPR011044; Quino_amine_DH_bsu.
DR PANTHER; PTHR24104; E3 UBIQUITIN-PROTEIN LIGASE NHLRC1-RELATED; 1.
DR PANTHER; PTHR24104:SF25; GLYCOPROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF06452; CBM9_1; 1.
DR SUPFAM; SSF49344; CBD9-like; 2.
DR SUPFAM; SSF50969; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
PE 4: Predicted;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1319
FT /note="Carbohydrate-binding domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009571507"
FT DOMAIN 1126..1316
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
SQ SEQUENCE 1319 AA; 143537 MW; 269B01A637193771 CRC64;
MKGIMIALTA AMFGANFAAC AGENTIGRQS QNEGITAVPA PGKVTVDGDL SEWDWSGRIS
VFAEYSMRNR YSVDAAAMWD KDYLYLGARW KDPMPLNSTI DPAINPDEGW KADSWQMRIH
SDRNLWITVW QFSTKKQSVM HFAYWKNDKS ERDGTDVKML VSPEGSADLG EDAQMAYKMD
ADNKGFSQEI RIPWKLIYRT VPEIKAGNVM RLGCEFLWGD STGRTWPIHR YADNMQPGKT
SREFYWSAVN SWGNLTLSEK GNIPLREYVV ETDKIAGTVP VRATIPKDVA RFTMVIDDKD
GKRIRTLAAD RDPADYAVKG ADKDGMQTVE VKWDCLNDFG KLVEPGTFKV RGLTQKGISA
EYEMCFYNPG TPPWDTKDGR GAWGADHSAP VGIAAAGDWT MLGFEVPEGG SGVIGIDPTG
QKRWGDRRGT LKLAGDDKYA YAYVTHWYTK ETICRYDLKT GKTAAFVKDG KERTYDLTLK
EILGEENPGK LTGISAQGGK LAVAVDSGKI FILDAASAAV LQKFNATNPG DLAFSRDGKL
YGIVDGKVSA IDTTSGAAAP IAISGAGKIS GLAVDNDGNI LVADVGPDSQ VKAFSTDGKL
VYTCGKKGGR ALRGIYDEQG MVKMSSIAVD AKGQIWAVES WSFPRRVSVW GKDGKLIRDY
LGNTGYAGTG CFLHDQDPTL GYCGPLEFKL DKKAGTWKLN KILWVPDREN GECFTVSTGS
HVGPQRFTAN IAGKNREYLY LHDPGSEGVG QVLFMETEAG WKPVSAICNA GQALGDLAHA
GQVVKQPGGE LAGLNAYDMI IWNDGNGDGK IQRAECEVIP AKRPGDEKKG GESAFRTGNG
WGGRIDTKTL TIYVDGLTAI KPVEIKADGA PVYASKAFVK LPIAENGDLV PVPEENRLLC
LSFKGYAGPT SGMLGIDLAK NTVDWAYPNP YPGVHGSHNA TMPKPGLVIG PLKICGVANA
GEQAGNVFLL RGNLGQDFIF TTDGLYVGAM FQDGRLPCDT LPDKESMLKG MPLDNFTEGG
EPFNGWFGGQ SDGKIRLTTG WPRQAAMILL INGLDSIRKF SGTEVSVDMP TIVKADQENI
ARTSKKAEAC VYAIKPMPKA PNLDGTLKDW NDIPAIDVTR EGLPDRAKVK MAYDANNLYI
SYEVNDSTPW KNEGKDIARL FKTGDSVDLQ LSVNANAKKH AQPEAGDIRV VFAQLQNKPV
AVLMAPVDKK AAPEAKKNYT SPVGTKIFDR VEAIADANVK IKAEGSRYVV EAALPLKSIG
LEPKPGMKIR GDVGFISSDA NGMINTARTY WSNKHTNLVN DEPQESWLFP ETWGEFSFE
//