ID F3BDJ6_9FIRM Unreviewed; 1071 AA.
AC F3BDJ6;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 24-JAN-2024, entry version 51.
DE RecName: Full=F5/8 type C domain-containing protein {ECO:0000259|PROSITE:PS50022};
GN ORFNames=HMPREF9477_02029 {ECO:0000313|EMBL:EGG80283.1};
OS Lachnospiraceae bacterium 2_1_46FAA.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=742723 {ECO:0000313|EMBL:EGG80283.1, ECO:0000313|Proteomes:UP000018451};
RN [1] {ECO:0000313|EMBL:EGG80283.1, ECO:0000313|Proteomes:UP000018451}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2_1_46FAA {ECO:0000313|EMBL:EGG80283.1,
RC ECO:0000313|Proteomes:UP000018451};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Ambrose C., Allen-Vercoe E., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A., Gujja S.,
RA Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C., Murphy C.,
RA Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T., Sisk P.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 2_1_46FAA.";
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGG80283.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLB02000003; EGG80283.1; -; Genomic_DNA.
DR AlphaFoldDB; F3BDJ6; -.
DR STRING; 742723.HMPREF9477_02029; -.
DR eggNOG; COG3250; Bacteria.
DR HOGENOM; CLU_287559_0_0_9; -.
DR OrthoDB; 176168at2; -.
DR Proteomes; UP000018451; Unassembled WGS sequence.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:UniProtKB-KW.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 1.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR038637; NPCBM_sf.
DR InterPro; IPR036278; Sialidase_sf.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF08305; NPCBM; 1.
DR SMART; SM00776; NPCBM; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF50939; Sialidases; 1.
DR PROSITE; PS50022; FA58C_3; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Reference proteome {ECO:0000313|Proteomes:UP000018451};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1071
FT /note="F5/8 type C domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003291561"
FT DOMAIN 921..1014
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 1071 AA; 122974 MW; D5447CB357826F5A CRC64;
MKKKKLMRLF AGLLVITMLA PNSIVSVYAE EPVQNEIQQN KNTKIQYLSD KQEVEKRVGY
GSFGKDRNTE MKQNGEGLQV KTRGEKVTFP KGIFAHAPST VIYDVSELKD KYPNFAGYLG
IDSRSGGGNG VIFKISVSDD KNSWEEIYNS GVVTINDAVY VNLSMKGKKY IKLEADSNGE
NSRDHVVYAD AGFVTNDYTP SENYNAPIKT VAEYDEELSK INFRDKNAVN NNTQTIYQRE
LVNQAGFYTI NKVYEMEDER GNKLYKDAIE YLLKDKEALY YYINGGPKPS QGTYENSLIA
FGKLYNAYKG ELDNNSENDL NIRLAVSVAS AYANPKTVRF WTENDNPNSG VEEVQKEDPV
RRYATYKKLS EQGNYMDKIS TLAKEPSRNM GNAIWSGKQF KELSVPMMRW VVDSRMHEDE
FEWLAQYITQ WASEEENKNK NFLDSYMYVH NKQGNWQYTD EKYYSQQKRA EWNKKYKFDD
FKSFDDNAKY GTKDLIRSWI VWEEGGVCGA YAKTYANLAE VAGRPSIVTG QPAHAAALTW
QWVSDGGPDH KGQYEWRIQN NAWSLRETSS EYEDYLLGWG NRRKMGNIDT NRNRASSYTL
LATDVIQDWD AYVMAKKYTL LANSLKDFSA KREVYYRALT ESPRYLDATY GMLDMYLNKP
DLTSAELNRF MKETASRYTY YPMVMADLLR EIELSGKLTD PVHMAELYME RQRALEEAYT
LRKDTKNLTA DDYEKTRQPW YAGDVARAIL EKDHSRIASF SFDGDNAGKI VLTKNLTEKG
LKMKYSLDGG KTWKTSDKDV ISLTNDELKS ITVENDIQVT VEGATEDMYG RLPICVIDIM
EQEAPKNIEA NDKEDLLLGD VSNLEYSEDG GNTWKPYEQN GLKNQNRFTG EKKVTVRRGS
HGQYVASKTA EYQFKNADDK NEEKYLQLQY VSMHEFSSQQ NEGNQAAKNL IDGQRNTNWH
SKWDVQDKKE YSVKFDKARR ISKLEYVPSV GGSNGRWEEV EIYGSNDGKT WTKIGKSGQL
ANDVNAKEIK VDSSKAWQYI KVKGLHSYSH DGNKDKYFSG SMLNFFEDTT K
//