ID R5ZYY2_9BACT Unreviewed; 1161 AA.
AC R5ZYY2;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE RecName: Full=F5/8 type C domain-containing protein {ECO:0000259|PROSITE:PS50022};
GN ORFNames=BN693_01940 {ECO:0000313|EMBL:CDA44738.1};
OS Prevotella sp. CAG:5226.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262930 {ECO:0000313|EMBL:CDA44738.1, ECO:0000313|Proteomes:UP000018184};
RN [1] {ECO:0000313|EMBL:CDA44738.1, ECO:0000313|Proteomes:UP000018184}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:5226 {ECO:0000313|Proteomes:UP000018184};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA44738.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBBW010000175; CDA44738.1; -; Genomic_DNA.
DR AlphaFoldDB; R5ZYY2; -.
DR Proteomes; UP000018184; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:UniProt.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.20.20.70; Aldolase class I; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS50022; FA58C_3; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000018184};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1161
FT /note="F5/8 type C domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004411812"
FT DOMAIN 112..236
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 1161 AA; 129181 MW; 6C592422F3633C19 CRC64;
MKQAVILSLL LAYAPSMAQA QVKVSQTATQ VTIENQYLSR TFNITGNRLT PGVLVNKRAA
GATFTPGQGS EEFALNPQAR QHTLVSRKGW KAEADSWCNE SATVGNPNLA IDGDNGTMWH
TWYATPTPGG QKGNDKLPHS LIISLGKRTA FRAFGYLPRQ GAYGAMSNGN IKGYEFYISN
DKKKWTLVKK GQFNYNSVQT IWVALDKEYK AKYVKLTETS STNGGAFGAC AEFYLTTDVV
KSAAAPATGL KASAMKVKGV HVASTGNGKR VTFDLAPTAY TNPVDGVTST WDIDMVVEMN
DNDHFMRKYL LVKAADEATR ALPIDYIEME NLGTQQVPAN SKWTRQQAAG GEGGMSAYTI
TLGQPVYVDG MFFGSEFPQA ENEIDDQGMY HTRYYSGKSL KTLDLNEHRV NANGQFRTWP
NVTGATRSAT DHNVIRTDFF KYINTIARPI KARMQYNSWY DWMMNITEDR INASFREMER
GFTQYGLRPM DSYVVDDGWN NYDQVGTAES GTTHNTTGFW EFNSKFPNGL KGASDIAHRY
GSGFGIWLGP RGGYNFNQQW GKFLEQHGNG TYSTTTYDVV TGDSVYVAKL RDFFIKQQRD
YGVNYWKLDG FATQQPQAST NGRYITGGKN GNYYFTEHWE RWYRNLDAMY ADAKSRNQDL
WINLTCYVNP SPWILQWSNS VWIQNSRDMW HAVVDGRERE MDQQLSYRDD RYWVFINSQQ
LQFPQAHIFN HDPVYGKTGA VAPNAMTDAE FRAYLYMMAT RGTAFWEMLY SYNLMNEGNK
WLINAEALNF VDSHYETLRN AIYFGQSPLK GGIYGYSCWR QGSNGAADGI VSFRNPSNSE
QTYTYTLDKS VGVPEQAAGL CASLVMEYTG NPEQQTAEAE AYAAAANAKG LAYGSTLSIK
LKPGEIRVLR FAADSAVAAQ PVMARANKQH EVTLVMDQPV CTDGATFELL AGGKRTGKPA
TAVTTDADYR TLHLTFGSTL GETKPYSIRI KGLKNWQGAA TAAESPQFYF APDSVLECIE
KPVTLNEAVR LGSHKAVVGK GDFSIDFTLT TTDADVQLAS QGEAWSLALK GGKLCFKAGH
VAYTTPVSVH DGKAHHIVCC RERNGMVKVY VDGQLQGTAY DKAYVNEPLQ PAPIVLGQAE
RTFTFDTFTL RDGALSFDEV R
//