ID V8CFA6_9FIRM Unreviewed; 1655 AA.
AC V8CFA6;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=F5/8 type C domain-containing protein {ECO:0000259|PROSITE:PS50022};
GN ORFNames=HMPREF1202_00139 {ECO:0000313|EMBL:ETD26088.1};
OS [Ruminococcus] lactaris CC59_002D.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Mediterraneibacter.
OX NCBI_TaxID=1073376 {ECO:0000313|EMBL:ETD26088.1, ECO:0000313|Proteomes:UP000018683};
RN [1] {ECO:0000313|EMBL:ETD26088.1, ECO:0000313|Proteomes:UP000018683}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CC59_002D {ECO:0000313|EMBL:ETD26088.1,
RC ECO:0000313|Proteomes:UP000018683};
RG The Broad Institute Genomics Platform;
RA Earl A., Allen-Vercoe E., Daigneault M., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Abouelleil A., Alvarado L., Chapman S.B., Gainer-Dewar J.,
RA Goldberg J., Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A.,
RA Ireland A., Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W.,
RA Priest M., Roberts A., Saif S., Shea T., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Ruminococcus lactaris CC59_002D.";
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETD26088.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZJE01000003; ETD26088.1; -; Genomic_DNA.
DR RefSeq; WP_023920464.1; NZ_KI669407.1.
DR STRING; 1073376.HMPREF1202_00139; -.
DR PATRIC; fig|1073376.3.peg.136; -.
DR HOGENOM; CLU_242130_0_0_9; -.
DR OrthoDB; 179563at2; -.
DR Proteomes; UP000018683; Unassembled WGS sequence.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:UniProtKB-KW.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 4.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 1.20.120.670; N-acetyl-b-d-glucoasminidase; 1.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR007781; NAGLU.
DR InterPro; IPR024732; NAGLU_C.
DR InterPro; IPR024240; NAGLU_N.
DR InterPro; IPR024733; NAGLU_tim-barrel.
DR PANTHER; PTHR12872; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR12872:SF1; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR Pfam; PF05089; NAGLU; 1.
DR Pfam; PF12972; NAGLU_C; 1.
DR Pfam; PF12971; NAGLU_N; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 4.
DR PROSITE; PS50022; FA58C_3; 2.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..1655
FT /note="F5/8 type C domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004767867"
FT DOMAIN 327..477
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1228..1384
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 1655 AA; 182973 MW; 58B8384E209003A9 CRC64;
MKNAKFAGVQ RLLSAVFAAT VALTSVQIPV YATPQTQAAE QTSQLVNIAG DCEITVPSSQ
SGYGAERMTD GDSSTMWIQN DGTWPSTVSL KLPADNTKRI KKVVLKFESG QSAWGVDVTL
SHALNNVTSD LVIDNTATVT SFDDGYEFTY DTALSFTHTF IELSNPTNNG AAGGFWPALA
EVEIWAENES GEESLTNVAP AATITSVGGD YGTKSNLTDE DYSSLYVFNG GGMSTLPDGA
WIEMELDREY PVKSMEAAFE HLSSDENNFQ FTFDIYGKSS TDTEWQTLFA GVNATRLDDG
YLQTLTLDSI KNLKSVRIVI TSITNTAGDP WPALAEFKIF ADTSGSGSED TESIAYKKPV
HTNAGGVVSR INDGSTINTW TGERYPAYVD IDLEANYKLD EIQVYTPSAG YSQYSVYTSM
DGRDFEKLAE KSDKENCPAK GESYQANKKE ARIVRVYVEY QSESSKALIN EIRVLGTPSG
TAVQETPAVQ VEDFKNSAYN VTVTYQDTIN EVKGIIERRI GAAYKDWFTF ELADAANGYD
YYDLSQSNGK IHIKGNNGVS LATGLNYYLK YYCNVNISQV GDQVTMPKSI IPVEGTVHKE
TKFPVRYSYN YCTLSYSMAF WGEEEWRNEL DWLALNGVNV VLDATAQEEV WRRFLTELGY
THQEAKDFIA GPAYYAWAYM ANLSGYGGPV HDTWFTERTE LARKNQLIMR KLGMQPVLQG
YSGMVPVDIT SKDPSAEVIK QGTWCSFQRP SMLRTDSESF TKYAALFYKV QKEVYGDSAH
YYATDPFHEG GNTGGMDSAV ISQKVLASMM TADPHATWVI QSWQGNPTTA LLQGLGDNRN
HALVLDLYAE KTPHWNETNP GYYGGAEGGG EFLNTPWVYC MLNNFGGRLG LHGHIDNYVE
GIVNASKQAE HMAGIGITPE ASVNNPVLYD LFFETIWADD GNNLQKINLD EWFKNYVTRR
YGADSDSAYQ AMEILHDTVY NPAYNMKGQG APESVVNARP GLDIGAASTW GNAVVDYDKK
KLEKAAELLL ADYDKLKNSA GYQYDLANVL EQVLSNTAQE YQKKMAAAFR SGDAEEFSTL
SDKFLSIIDM VEKVTGTQKE FLVGTWINGA KKLAENSDDF TKELYELNAR SLITTWGSYD
QAISGGLIDY SNRQWAGLTN DYYKMRWEKW ITERKKELAG ESYTNYSAQD WFEMEWAWAR
GTNKYSGTPN GLDLQGLGTD VLANYSLTNM PKDPAEDDSR DLPLEGMTAT AGSEQATTGS
EGPASAVLDQ TTGTIWHSKW SGDARENLWI DIALGESKTV TGLRMLPRSG GGNGTITSYR
IEISNDHGKT YQEVATGTWN SSDSWKMAEF HAIQATNVRL YAVESVSDTS NIFASAAEIR
IMGPATAIVP AEETIVNIAT PSKEADLSSA QAAKETDKYT VSTVWKDATG TTVTAISKDK
NATHDYTAKI TLTPVTGYSF DKTSVPDTLT LKLNDQRTVE AIPVTDSVLN DDGTVTITYQ
FSNMFQGGSL RMDQSSPEKS TNMRFGYDFK LPEASSEKDE IRFKGCTWYY GVAEDDLKNT
FSPDKTNFIT NPDKKGAEYY RSNIVFTNLS SGAYKRSVYA RILVKYTVNG KERSVMGTFV
DSRSVSMIVE GILANTNADQ TEKDYAQKIK DAILK
//