ID V8BNC4_9FIRM Unreviewed; 1347 AA.
AC V8BNC4;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE RecName: Full=F5/8 type C domain-containing protein {ECO:0000259|PROSITE:PS50022};
GN ORFNames=HMPREF1202_02479 {ECO:0000313|EMBL:ETD16634.1};
OS [Ruminococcus] lactaris CC59_002D.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Mediterraneibacter.
OX NCBI_TaxID=1073376 {ECO:0000313|EMBL:ETD16634.1, ECO:0000313|Proteomes:UP000018683};
RN [1] {ECO:0000313|EMBL:ETD16634.1, ECO:0000313|Proteomes:UP000018683}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CC59_002D {ECO:0000313|EMBL:ETD16634.1,
RC ECO:0000313|Proteomes:UP000018683};
RG The Broad Institute Genomics Platform;
RA Earl A., Allen-Vercoe E., Daigneault M., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Abouelleil A., Alvarado L., Chapman S.B., Gainer-Dewar J.,
RA Goldberg J., Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A.,
RA Ireland A., Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W.,
RA Priest M., Roberts A., Saif S., Shea T., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Ruminococcus lactaris CC59_002D.";
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETD16634.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZJE01000035; ETD16634.1; -; Genomic_DNA.
DR RefSeq; WP_023923060.1; NZ_KI669410.1.
DR STRING; 1073376.HMPREF1202_02479; -.
DR PATRIC; fig|1073376.3.peg.2541; -.
DR HOGENOM; CLU_002275_2_0_9; -.
DR OrthoDB; 9763537at2; -.
DR Proteomes; UP000018683; Unassembled WGS sequence.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06564; GH20_DspB_LnbB-like; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 2.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR025705; Beta_hexosaminidase_sua/sub.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR015882; HEX_bac_N.
DR PANTHER; PTHR43678:SF1; BETA-N-ACETYLHEXOSAMINIDASE; 1.
DR PANTHER; PTHR43678; PUTATIVE (AFU_ORTHOLOGUE AFUA_2G00640)-RELATED; 1.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF07554; FIVAR; 2.
DR Pfam; PF00728; Glyco_hydro_20; 1.
DR Pfam; PF02838; Glyco_hydro_20b; 1.
DR PRINTS; PR00738; GLHYDRLASE20.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55545; beta-N-acetylhexosaminidase-like domain; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR PROSITE; PS50022; FA58C_3; 2.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1347
FT /note="F5/8 type C domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004766729"
FT DOMAIN 25..177
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 968..1125
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT REGION 45..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1260..1320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1260..1295
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 587
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR625705-1"
SQ SEQUENCE 1347 AA; 148334 MW; 7BB51A0F346C8DBF CRC64;
MKKKKDMKKP LLAAVLACSM VQIPAIPVAA AAPENLALNQ TVTASSYEQP TNNPEKTSPS
KAVDGDLTTR WGTAQNLAAN EWINVNLGSS KQIQQININF ERTDDAQNIL GYKVEIANGN
DTYTEIYRKE EKAKQKEVIK LTSPVSATDV KVTILSADAG TINWVNVGIN EIEIYGSINN
DYSLSDVAGM ITGGTTIAAD VADFPMPTVP EGFNIRINGA DFEQIISRDG KIVHPLTDKT
VKVSYVITKE ETGEELVTDD FTYTIAGQHT TNAAKNTKPS IIPEIAEWYS DSTDSVATDS
ITAVTYDNNA LEDVVDEFIA DYKDFSGIEL QKVKGNAKAN AFNFSLAAPD ELLGDEGYTM
NILSDRINVA SESTVGNMYG MQSILQMYKS NPDAFSIGTM RDYPRFSTRG FVLDAARKPV
SMDMLKEISR TMRYYKMNDF HVHLSDNYIW LEDYGKLDQE NQAFDAYDAF RLESGLTNDA
GESPTAEDYS FSKKEFKEFI QTERAVGVNI VPEIDVPAHA NSFTKVWPEL KVTNKTSSTS
ANRPLIDHLD ISKTETVQKI EEIFDDYTHG SDATFDSETT VHIGADEFLD NYSAYRNFIN
TIVPYVKQTN TVRMWGGLTW IKDNPVTQIN ADAIDGVEIN LWSKDWADGI EMYNMGYDLI
NTIDDYGYMV PDGSKTRANS YGDLLNTSRI FSSFEPNKLR SSSGYVAVPS GDDQMLGAAF
ALWNDNIDKR ASGLSESDLY WRFFDALPFY AEKTWAATGQ EKGSADALNT LATSMGTGPN
TNPYYQESSV DNIYESYDFN SGKGLDDTSE NDRDLTLASD STAKVQNKAL VLSGDKSYVE
TPIEQLGNGN ALSFDITLSS EPESGDILFE SDAAYGTHDI RIMSNGKLGF TRELYDYYFD
YTLPVGEKVN IKIVTQQQVT KLYVNGQFVS NATGEFNHNG TTKKSGITNA TFALPLQRIG
SETNAVQATI DNVNVTAADM YNKSKWTGTT NSETTYNNVE GLLRYAFDND YTSRWHSNWK
GATDKLTGSN SFYAEINFGQ KYTINQFSFT PRTDTASGYV TKADLYIKAN DSDEWTLVAQ
DQTFAADAAQ KTFTFDAQEV QYVKFVAKAS SDGWVAVSEF DIANTAQDPD QPEADKTALQ
ELVNQAVTDF TGYTSESVTA YKVALDNAKE VLTNEDATQA EVDEAIAALT NAVLVTDKSK
LQEKIDQAVT DFDGYTADSV AAYKAALARL NEVLADTDAT PAQVSEAIAA FDLARLVKDT
SDDGKKDDDK KDDDKKNDDK KDDNKKDDSK KEDGTNDNSS STNGKADKKD TASKAAKTGD
ASTALPYMLL MLASGSAVVI LKKKKEQ
//