ID F3ATR1_9FIRM Unreviewed; 1669 AA.
AC F3ATR1;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 24-JAN-2024, entry version 58.
DE RecName: Full=F5/8 type C domain-containing protein {ECO:0000259|PROSITE:PS50022};
GN ORFNames=HMPREF1025_01115 {ECO:0000313|EMBL:EGG87078.1};
OS Lachnospiraceae bacterium 3_1_46FAA.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=665950 {ECO:0000313|EMBL:EGG87078.1, ECO:0000313|Proteomes:UP000005376};
RN [1] {ECO:0000313|EMBL:EGG87078.1, ECO:0000313|Proteomes:UP000005376}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_46FAA {ECO:0000313|EMBL:EGG87078.1,
RC ECO:0000313|Proteomes:UP000005376};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Ambrose C., Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., White J.,
RA Yandava C., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3_1_46FAA.";
RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGG87078.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACWP01000026; EGG87078.1; -; Genomic_DNA.
DR eggNOG; COG1196; Bacteria.
DR eggNOG; COG2273; Bacteria.
DR eggNOG; COG3250; Bacteria.
DR eggNOG; COG3525; Bacteria.
DR HOGENOM; CLU_002275_1_0_9; -.
DR OrthoDB; 9816455at2; -.
DR Proteomes; UP000005376; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06564; GH20_DspB_LnbB-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 2.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 1.20.1270.70; Designed single chain three-helix bundle; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 3.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR025705; Beta_hexosaminidase_sua/sub.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR015882; HEX_bac_N.
DR PANTHER; PTHR43678:SF1; BETA-N-ACETYLHEXOSAMINIDASE; 1.
DR PANTHER; PTHR43678; PUTATIVE (AFU_ORTHOLOGUE AFUA_2G00640)-RELATED; 1.
DR Pfam; PF00754; F5_F8_type_C; 3.
DR Pfam; PF07554; FIVAR; 3.
DR Pfam; PF00728; Glyco_hydro_20; 1.
DR Pfam; PF02838; Glyco_hydro_20b; 1.
DR PRINTS; PR00738; GLHYDRLASE20.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55545; beta-N-acetylhexosaminidase-like domain; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 3.
DR PROSITE; PS50022; FA58C_3; 3.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..1669
FT /note="F5/8 type C domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003295728"
FT TRANSMEM 1644..1663
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 26..183
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 186..343
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1236..1386
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT REGION 1530..1553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 747
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR625705-1"
SQ SEQUENCE 1669 AA; 184458 MW; A120FE6E37AF9BA6 CRC64;
MHNVKKSFKR LLAVSLCASC VLSNGMLSLA EGAEDKAVNL ALGCQATANA QYNQNGNDMS
ASKAVDGNDE TRWSSEGAAP GWLQVDLGEQ KSFTQFRILS EGGTGVTVGK QLIGKFKIEG
SNDNSKWTLI HQSEDKQAEG FPQDTVVTLE KPVSYRYVKL TVESLKTGAF DSVSIREFEI
RDKEETTPEK PQDPEENVAL KKTAAADSTE DNSLIAAKAF DGNTKDRSSR WSSAVADAPH
WIYVDLGKEM DVKTVCIFWE TRKATDYKIQ IANTAEAPAE SDWKDVKHVQ DRPKALKDAI
VLDKVEKARY VRLYINSFTK NDPDNEGASW NSISIYEMEV YGGEPKVDIE EGISVDTPKK
GDKKLTVHIP EETKTEKVTY NGTDYEQVVD ADLNLYQPVV DTTVKVSFKI ENKENGSYRF
KEIPVTVPGE YETKEGDNAA PDVLPEIREW KGNAGTFAPN AGSRIVIKDA ELQEMADAFA
KDYEAIMGQT LPVVTADSAN AGDFFFALTK EGKGLQEEGY LMTVDEKTAV EAETTTGAFW
ATRTILQSLK ANGNIPQGVA RDYPLYKVRG FILDVGRKTF TMDWLEDTVK QMSWYKMNDF
QIHLNDNLIP LEHYSQIGED PMQAYSAFRL ESDIKEGGKD GLYKADLTSK DVFYTKDEFR
NLIQESRVYG VDIVPEIDTP AHSLALTKVR PDLRHGTYGR DNDHLALKEK YDESLEFVQS
IFNEYMGKDL SDPVFDKDTV VHVGADEYTA APEAYRKFAD DMLKYVQDSG RTPRIWGSLS
TIKGETSVRS EGVQMNLWNF GWANMDKMYE QGYDLINCND GNYYIVPNAG YYYDYLNEDT
LYNLAINSIG GVTIPAGDKQ MIGGAIAVWN DMTDYLENGV SEYDVYDRID NEIALFGAKL
WGKGNKDLSA AKEDYAALGT APRTNFTYET EKNEEGAAVH YPMDNMKDAS GSGQDLKEGK
NAAIESVDGR NALKLEGKES YVSTDLATAG LGNDLRVKVK RTTDGDEEQI LFESSYGTIK
AVQKETGKVG FTRENHDYSF NYKLPVNEWV ELEFKNEQNK TYLYVNGELR DVLGDDERVE
GRPLLATTMF PIERIGSTKN AFTGYVDDVR LGTNADFAST MPLDYAVLTA NQVIGKTENA
QLAQLVKEAE AIFAAYNPDA SAINDLAAEI KAVLDDSDYK EADYSRIETL KKTIPSDLSP
FTEESAAWLE YVLSQIRTGL PEEMQSTVDG YEKMLADALA GLTLVEERNV NYVDNAKLTA
TASSHQDNGS APDKALDGDT NTIWHSKWDI TTMPHWIDLE MEEPMAVDGL TYVPRQTGTN
GNVTKYEIQI SNDGTNYTKH AEGTLKNNAD TKVIDFNKVT TKHVRLVYLE AANNNGAAAE
LKLHQADVPA DIEGLTAVIT EAKAIKNEGF TKESWDALQN KIAEAEELAS AENADANDVE
IMKRELSKAM TSLILEDKVT SDPEPGKVDK SKLQELYNKY KGIKADGYTA ESWTAFAEAR
TEAETVLANE KATQEKVDKA AENLEKAFKS LKKEETKPDP DPTPDPDPGA ADVSGLKNLY
EAYKDIKSDG YTAESWAAFD KARAEAEKIL ANPNATQDDV NAAKAALEAA YKGLVPKTQP
NPTPGGNGGN VGSTAVVTGD SANIAGYLTV LLAAGGIAVV TFFRRKRVK
//