ID R6QB43_9FIRM Unreviewed; 1503 AA.
AC R6QB43;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE SubName: Full=Alpha-galactosidase {ECO:0000313|EMBL:CDC19102.1};
GN ORFNames=BN582_00077 {ECO:0000313|EMBL:CDC19102.1};
OS Eubacterium sp. CAG:274.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=1262888 {ECO:0000313|EMBL:CDC19102.1, ECO:0000313|Proteomes:UP000017904};
RN [1] {ECO:0000313|EMBL:CDC19102.1, ECO:0000313|Proteomes:UP000017904}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:274 {ECO:0000313|Proteomes:UP000017904};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDC19102.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBEX010000022; CDC19102.1; -; Genomic_DNA.
DR STRING; 1262888.BN582_00077; -.
DR Proteomes; UP000017904; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR CDD; cd14256; Dockerin_I; 1.
DR Gene3D; 2.60.40.2700; -; 3.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 1.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR InterPro; IPR038637; NPCBM_sf.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF08305; NPCBM; 1.
DR SMART; SM00776; NPCBM; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 3.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
DR PROSITE; PS00018; EF_HAND_1; 2.
DR PROSITE; PS50022; FA58C_3; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Reference proteome {ECO:0000313|Proteomes:UP000017904};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1503
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004428775"
FT DOMAIN 1273..1424
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1425..1493
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
SQ SEQUENCE 1503 AA; 167562 MW; 1C9F90F09CF34335 CRC64;
MNLKKTIALG LSTLMTLQIT PLNIFADETS QELTVSSISA SDTKMSSDYW QNHRVSTNSD
DYTYISAYSN NAGQYSTSYL KYAFDGNWST HWETGNGYGT VTNYVDVTFN KTTKIDRILY
ATRQDGAKGK GYPTTATIYS YNSETDTWDK VAVGSSSVSN SYVLITLPQA TDFEKLRFEF
TVANNGWASA SEFVFLRPDD TVLDGNVTIS GTSAIGKTLT ANDNLTVGNS KNLGYQWQYS
DDGTVWTDIE NATEKNYTIT EQKAYYRVKI YDKTNEYYGN LYSEAYNGSM VAKITGSLKV
GCVVTAETEY VNPEDTLYYK WQYSDDGITF IDIPSATSLK YTVPASMSNK YIRLGISGNN
KDYIYSDNSF VSVSAVLSGP CQVNSEILAQ LKGAENTEYT YQWQICDTAD GEFTDISSAT
SAKYTPTESQ LNKYIRVLLT ITETGELLYS QPRLVSSSGT YPDRTGDYMY ISDVPSSALI
SSSVGYSTLK YDTNVEDGII ALKVNGQKKQ FLKGLGAHAP ATLVYDISYL VDDYGYDTFT
GYLGVDYSKG SNGNGVIFNI YTSQDNSTWT QVKNTGVLKG DSECQYVTID LKNAKYLKFD
ISSNGDKGYD HSVIANGKFI SSDYKEPSAD NFKFIKTVSQ YESELAGLSD YESETYRKIL
YQKNFVERTG YDFLITLASE NENNREALEW FLDDFEALNY YANAGNTILG SYEKAAMVLT
QLYKNYSLDM SNSLYKRMFV ALMFSHAGSI YFWADSTKTS NPVRRYAIYK KLYENGCLVN
SVFENLTVPE MRWVMNSISD DNQIEWLNYY VRKKSAAKAG VAFDELEDNQ LILNSYSYMG
YITGYNYYLD KYYTDENKAS WTEKYHLVNT DDDTNDDIYD INIPFESGHP KLWIVFEEGA
VCGGISKVGV NLLTAFGIPS VVIGQPGHAA YLKYELDDPS KGENALAKWS IWNDVYGWTK
SERGDTLLLG WGAQSWAKGY RVSYVPLAQA ALNDYDNFQK AEDLVKIAQM ADSDKAIELY
RQALEIQNFN LDAWDGLITA YKSAEKTSDD YMELAKEIIS ALTYFPQPMN DMLQNAIKPE
ISSEADIADF NIMTLNALKS ASVATSEQSL QPDICKTMAD YILGNENLAI ATFSFDGENA
NTIVLSDQFT DGGTEFLYSI DGGENWVNAG TNNSHKLTDE EISKITADND LLVKLQGASS
YYTIDIKAGS APSGLYNNDN ENKVIGATSK YQWSEDGKTW TNFTDDTVFE GDRTVSVRIG
ANGTTLVGSS SLCTFTTDTD TADRSYISIS NVKVLAYSSA QSDNESASKS IDGNINTIWH
TTYTTNSDLN RFIAYEFNKP VLLTSIDYTP RQTGNFNGVF TKCSVYTSKD GTNWTKAGTA
TWASDRTKKT VNLDTPVYTK YVKVVGDEAG ANFGSAAMIE FYERLNSDNY DINKDNSVDN
KDVALLLKYV MGVNLSSDIS FENADFNGDG NIDMLDVITL KNYTDNNTVE TTTETTETTT
EQQ
//