ID R5Q9W5_9FIRM Unreviewed; 1669 AA.
AC R5Q9W5;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE SubName: Full=F5/8 type C domain protein {ECO:0000313|EMBL:CCZ25378.1};
GN ORFNames=BN734_01242 {ECO:0000313|EMBL:CCZ25378.1};
OS [Ruminococcus] torques CAG:61.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Mediterraneibacter.
OX NCBI_TaxID=1263108 {ECO:0000313|EMBL:CCZ25378.1, ECO:0000313|Proteomes:UP000017998};
RN [1] {ECO:0000313|EMBL:CCZ25378.1, ECO:0000313|Proteomes:UP000017998}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:61 {ECO:0000313|Proteomes:UP000017998};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCZ25378.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAZS010000009; CCZ25378.1; -; Genomic_DNA.
DR Proteomes; UP000017998; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06564; GH20_DspB_LnbB-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 2.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 1.20.1270.70; Designed single chain three-helix bundle; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 3.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR025705; Beta_hexosaminidase_sua/sub.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR015882; HEX_bac_N.
DR PANTHER; PTHR43678:SF1; BETA-N-ACETYLHEXOSAMINIDASE; 1.
DR PANTHER; PTHR43678; PUTATIVE (AFU_ORTHOLOGUE AFUA_2G00640)-RELATED; 1.
DR Pfam; PF00754; F5_F8_type_C; 3.
DR Pfam; PF07554; FIVAR; 3.
DR Pfam; PF00728; Glyco_hydro_20; 1.
DR Pfam; PF02838; Glyco_hydro_20b; 1.
DR PRINTS; PR00738; GLHYDRLASE20.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55545; beta-N-acetylhexosaminidase-like domain; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 3.
DR PROSITE; PS50022; FA58C_3; 3.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000017998};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..1669
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004401137"
FT TRANSMEM 1644..1663
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 26..183
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 186..343
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1236..1386
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT REGION 1530..1553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 747
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR625705-1"
SQ SEQUENCE 1669 AA; 184476 MW; BCE112BF66FA8BE3 CRC64;
MHNVKKSFKR LLAVSLCASC VLSNGMLSLA EGAEDKAVNL ALGCQATANA QYNQNGNDMS
ASKAVDGNDE TRWSSEGAAP GWLQVDLGEQ KSFTQFRILS EGGTGVTVGK QLIGKFKIEG
SNDNSKWTLI HQSEDKQAEG FPQDTVVTLE KPVSYRYVKL TVESLKTGAF DSVSIREFEI
RDKEETTPEK PQDPEENVAL KKTAAADSTE DNSLIAAKAF DGNTKDRSSR WSSAVADAPH
WIYVDLGKEM DVKTVCIFWE TRKATDYKIQ IANTAEAPAE SDWKDVKHVQ DRPKALKDAI
VLDKVEKARY VRLYINSFTK NDPDNEGASW NSISIYEMEV YGGEPKVDIE EGISVDTPKK
GDKKLTVHIP EETKTEKVTY NGTDYEQVVD ADLNLYQPVV DTTVKVSFKI ENKENGSYRF
KEIPVTVPGE YETKEGDNAA PDVLPEIREW KGNAGTFAPN AGSRIVIKDA ELQEMADAFA
KDYEAIMGQT LPVVTADSAN AGDFFFALTK EGKGLQEEGY LMTVDEKTAV EAETTTGAFW
ATRTILQSLK ANGNIPQGVA RDYPLYKVRG FMLDVGRKTF TMDWLEDTVK QMSWYKMNDF
QIHLNDNLIP LEHYSQIGED PMQAYSAFRL ESDIKEGGKD GLYKADLTSK DVFYTKDEFR
NLIQESRVYG VDIVPEIDTP AHSLALTKVR PDLRHGTYGR DNDHLALKEK YDESLEFVQS
IFNEYMGKDL SDPVFDKDTV VHVGADEYTA APEAYRKFAD DMLKYVQDSG RTPRIWGSLS
TIKGETSVRS EGVQMNLWNF GWANMDKMYE QGYDLINCND GNYYIVPNAG YYYDYLNEDT
LYNLAINSIG GVTIPAGDKQ MIGGAIAVWN DMTDYLENGV SEYDVYDRID NEIALFGAKL
WGKGNKDLSA AKEDYAALGT APRTNFTYET EKNEEGAAVH YPMDNMKDAS GSGQDLKEGK
NAAIESVDGR NALKLEGKES YVSTDLATAG LGNDLRVKVK RTTDGDEEQI LFESSYGTIK
AVQKETGKVG FTRENHDYSF NYKLPVNEWV ELEFKNEQNK TYLYVNGELR DVLGDDERVE
GRPLLATTMF PIERIGSTKN AFTGYVDDVR LGTNADFAST MPLDYAVLTA NQVIGKTENA
QLAQLVKEAE AIFAAYNPDA SAINDLAAEI KAVLDDSDYK EADYSRIETL KKTIPSDLSP
FTEESAAWLE YVLSQIRTGL PEEMQSTVDG YEKMLADALA GLTLVEERNV NYVDNAKLTA
TASSHQDNGS APDKALDGDT NTIWHSKWDI TTMPHWIDLE MEEPMAVDGL TYVPRQTGTN
GNVTKYEIQI SNDGTNYTKH AEGTLKNNAD TKVIDFNKVT TKHVRLVYLE AANNNGAAAE
LKLHQADVPA DIEGLTAVIT EAKAIKNEGF TKESWDALQN KIAEAEELAS AENADANDVE
IMKRELSKAM TSLILEDKVT SDPEPGKVDK SKLQELYNKY KGIKADGYTA ESWTAFAEAR
TEAETVLANE KATQEKVDKA AENLEKAFKS LKKEETKPDP DPTPDPDPGA ADVSGLKNLY
EAYKDIKSDG YTAESWAAFD KARAEAEKIL ANPNATQDDV NAAKAALEAA YKGLVPKTQP
NPTPGGNGGN VGSTAVVTGD SANIAGYLTV LLAAGGIAVV TFFRRKRVK
//