ID C0EGH0_9FIRM Unreviewed; 1657 AA.
AC C0EGH0;
DT 05-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 05-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=F5/8 type C domain protein {ECO:0000313|EMBL:EEG29446.1};
GN ORFNames=CLOSTMETH_02963 {ECO:0000313|EMBL:EEG29446.1};
OS [Clostridium] methylpentosum DSM 5476.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Oscillospiraceae incertae sedis.
OX NCBI_TaxID=537013 {ECO:0000313|EMBL:EEG29446.1, ECO:0000313|Proteomes:UP000003340};
RN [1] {ECO:0000313|EMBL:EEG29446.1, ECO:0000313|Proteomes:UP000003340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 5476 {ECO:0000313|EMBL:EEG29446.1,
RC ECO:0000313|Proteomes:UP000003340};
RA Fulton L., Clifton S., Fulton B., Xu J., Minx P., Pepin K.H., Johnson M.,
RA Bhonagiri V., Nash W.E., Mardis E.R., Wilson R.K.;
RL Submitted (JAN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEG29446.1, ECO:0000313|Proteomes:UP000003340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 5476 {ECO:0000313|EMBL:EEG29446.1,
RC ECO:0000313|Proteomes:UP000003340};
RA Sudarsanam P., Ley R., Guruge J., Turnbaugh P.J., Mahowald M., Liep D.,
RA Gordon J.;
RT "Draft genome sequence of Clostridium methylpentosum (DSM 5476).";
RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEG29446.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACEC01000102; EEG29446.1; -; Genomic_DNA.
DR STRING; 537013.CLOSTMETH_02963; -.
DR eggNOG; COG0845; Bacteria.
DR eggNOG; COG1554; Bacteria.
DR HOGENOM; CLU_253695_0_0_9; -.
DR Proteomes; UP000003340; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:UniProtKB-KW.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 4.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.70.98.50; putative glycoside hydrolase family protein from bacillus halodurans; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR049053; AFCA-like_C.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR027414; GH95_N_dom.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR31084; ALPHA-L-FUCOSIDASE 2; 1.
DR PANTHER; PTHR31084:SF19; GLYCOSYL HYDROLASE FAMILY 95 N-TERMINAL DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF07554; FIVAR; 4.
DR Pfam; PF14498; Glyco_hyd_65N_2; 2.
DR Pfam; PF21307; Glyco_hydro_95_C; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023001};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023001};
KW Reference proteome {ECO:0000313|Proteomes:UP000003340};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1657
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002896145"
FT TRANSMEM 1633..1651
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 55..100
FT /note="Glycosyl hydrolase family 95 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF14498"
FT DOMAIN 117..263
FT /note="Glycosyl hydrolase family 95 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF14498"
FT DOMAIN 706..802
FT /note="Alpha fucosidase A-like C-terminal"
FT /evidence="ECO:0000259|Pfam:PF21307"
FT DOMAIN 996..1082
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|Pfam:PF00754"
SQ SEQUENCE 1657 AA; 180836 MW; 04C39CAF86F1FA6A CRC64;
MKLAKRFQIK RLLAGILAAA LVLPVGVNAA TGAIPVSEQL SVLRADATVE KELKLWYDEP
APNSDAGWEQ WSLPLGCGYM GANVFGITDT ERIQLTENSL CGNNGFEGGL NNFSETYLDF
GHDYSGVSNY TRDLILNDAT AHVRYDYGGV TYSREYFTSY PDKVMAIKLS ASESGKLSFT
LRPTIPYLNE KKSGTVSAQG DTITLSGRMH GYEVDFEGQY KVIPSGGSAS MQAANDADGD
NGTIQVTGAD SAVILIAIGT NYEFDPQVFL NPDATKLEGF EHPHAKVTER IEQASAQSYE
QLRSNHTADY QNLFDRTRFD LGGAVPQLTT DELMNAYKAG SNDRYLEELY FQYGRYLLIS
SSRKGALPPN LQGVWNMYEQ APWTAGYWHN INIQMNYWPV FSTNLAELFD SYIDYYNAYL
PAVRNSSNQF IAQQHPDNYD PGGDNGWSIG TGAGPYSVYA PNGQGTDGNG TGALMAQVFW
EYYDFTRDPD ILENITYPAV SGAANFMSRV MEPHGDYLLA DPSASPEQME NGNYVVTVGT
AWDQQLAYEM EQNTLEAAEL LGRQDEALPQ RLADQIDKLD PVQVGFSGQI KEFREENFYG
EIAEYNHRHI SQLVGLYPGT LINSTTPAWM DAAKVSLNLR GDKSTGWAMA HRLNAWARTK
DGNRTYSIYQ TLLKNGTLNN LWDTHPPFQI DGNFGGTAGV SEMLLQSHEG YIAPMPAIPD
AWAQGSYRGL VARGNFTVGA DWSNGQADQF TITSNAGGVC KLSYFNIADA VVTDSDGNTI
SFEKDSTDLI SFDTVQGKTY TVTQIPSYRA TQAASDLELH YLDNGQAVVM NWTASADAKS
YNIYRADGND ADYTLLESGV TDTSYTSRNA ELDQKEQHTF RVTAVGQDGR ESSGVTAIMF
PLSPSASVSG VFLDETTVQL SIDPVASAEQ YNIYRKTADG YEKLMSTKYN TAVVGNVSVG
DAFAVSVESE TRESPKTDAV ITSQITLDNV LQGKAITSTR PALGDYPLSN ALDGNPDTRY
ALPDEEGPYS VTIDLDGVYE LHNFKILEFQ NPPGETRSNE TTIELSSDGG TSWTTVIDKQ
SLNPGSGLMG ITEFDLGGAA GSLMRITFHH TIGTTSSTAT IHEIICSGSV GAPTDKQALQ
EVLFQADKID TSSYDPSAVS AFEDTYQRAT LLFNNPAAGQ EQVDSITAEL TNSIKNLKQV
NVAFQKPITA NHEAIDPFYG IDKMVDGDHE SRYAGSDIYT ELEVEIDLQG NYLVNRVNVE
EYLDGGTRGG ETTIQVYNGE EWVTVVDKQS LSGQRFTVLS FEEIVGSKIR IQFKNTQSQR
LITIYEIEVM GLAQLDLSML QAKIQEAEAI QPTSCTPDSY QGLLEAIASA KDVLENATAQ
QQVNQAVQSL QSAIDALEPA SPADKTILQK VIDRADELIL GEEYASAIES VQASFAASLA
EAKEINANRF ATQQQVDNAW IALMTEIHKL GFQKGDLTSL RTLCEYVAGL QLDRYIDNQA
KADLPGALAA GRAVLEDGDA MADTIDSAVD ALLGVLTNLR LKADKTYLQQ ALDRAQSIDL
NDYSAQSVDA FQRMVKYGEH INCKDDATQE EVDAAAKQID AAIAALSPVQ RVAGDGVQAA
SQTQSSPRTG ERGMLPGLSV LLLAAGLLLF GQHKRKE
//