ID D4CQC5_9FIRM Unreviewed; 928 AA.
AC D4CQC5;
DT 18-MAY-2010, integrated into UniProtKB/TrEMBL.
DT 18-MAY-2010, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Glycosyl hydrolase family 3 N-terminal domain protein {ECO:0000313|EMBL:EFE90685.1};
GN ORFNames=GCWU000341_02572 {ECO:0000313|EMBL:EFE90685.1};
OS Oribacterium sp. oral taxon 078 str. F0262.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Oribacterium.
OX NCBI_TaxID=608534 {ECO:0000313|EMBL:EFE90685.1, ECO:0000313|Proteomes:UP000004602};
RN [1] {ECO:0000313|EMBL:EFE90685.1, ECO:0000313|Proteomes:UP000004602}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0262 {ECO:0000313|EMBL:EFE90685.1,
RC ECO:0000313|Proteomes:UP000004602};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA Chinwalla A., Mardis E.R., Wilson R.K.;
RL Submitted (FEB-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 3 family.
CC {ECO:0000256|ARBA:ARBA00005336}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFE90685.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACIQ02000032; EFE90685.1; -; Genomic_DNA.
DR RefSeq; WP_009216016.1; NZ_GG729936.1.
DR AlphaFoldDB; D4CQC5; -.
DR STRING; 608534.GCWU000341_02572; -.
DR eggNOG; COG1472; Bacteria.
DR HOGENOM; CLU_005235_2_0_9; -.
DR Proteomes; UP000004602; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.40.50.1700; Glycoside hydrolase family 3 C-terminal domain; 1.
DR Gene3D; 3.20.20.300; Glycoside hydrolase, family 3, N-terminal domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR026891; Fn3-like.
DR InterPro; IPR002772; Glyco_hydro_3_C.
DR InterPro; IPR036881; Glyco_hydro_3_C_sf.
DR InterPro; IPR001764; Glyco_hydro_3_N.
DR InterPro; IPR036962; Glyco_hydro_3_N_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR42715; BETA-GLUCOSIDASE; 1.
DR PANTHER; PTHR42715:SF10; BETA-GLUCOSIDASE F-RELATED; 1.
DR Pfam; PF14310; Fn3-like; 1.
DR Pfam; PF00933; Glyco_hydro_3; 1.
DR Pfam; PF01915; Glyco_hydro_3_C; 1.
DR SMART; SM01217; Fn3_like; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF52279; Beta-D-glucan exohydrolase, C-terminal domain; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:EFE90685.1}.
FT DOMAIN 423..494
FT /note="Fibronectin type III-like"
FT /evidence="ECO:0000259|SMART:SM01217"
FT REGION 909..928
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 928 AA; 102849 MW; 0A208CA3C2A3598A CRC64;
MSGTDWERIR PAVFLRRGDG KRGGGLFRRR SRSAASLAEE KWKERGRTET EGGLLMGERA
ERKLLDYEIR HRALARRAAA EGIVLLKNEG ILPLAAGQSL ALYGAGAVGT VKGGSGSGDV
NARESVNIWQ GLKSAGFTIE NEDWLSDYEK RFQEARLSWR DRILSEFEEG NGNDAEFFEI
YTKNPFRSPS GGKIVKTEAD CAIYVLSRTA GEGKDRRKEP GDYFLSKEEE RELSELSELY
RDLILLLNTG GPVDLSFTER LPNIRAILFL SQPGMEGGNA VSDILSGAVS PSGRLTDSWA
LRYEDYPNAE SFSYLSGDLS REEYREGIYV GYRYFDSFSV PLRYGFGEGL SYTDFGIRLE
SLRFLEGEAE SALSYGSTLL SGLGLQSGDQ ENRREQAESA GVEPALELSI RVENTGSRYA
GRECVQIYAS LPAGELEKEH RRLIGFRKTA LLSPGESEEF LLRIPIYLLA SYEEKRSAYL
LERGCYGIWI GGSLRASKAV WKLRLGREAV LLKLRPELSI REELSEIRRS REMQEKLCAA
WEELSLPETE LPAEKLCLRE LSYREEGPLP GRRAAEIAAG LSEEELLHLV TGRIREQDGK
EAFPQSVPGA AGETIALDTA FGKVESMMLA DGPAGLRLEA EPREKDGRLL SGGLISALEC
GFFRKAEERD PAERIVYQYC TAFPVGTLLA QSFDPRLLRE IGEAVSEEME LFHIAIWLAP
GMNIHRNPLC GRNFEYYSED PLLSGVMAAS LSLGVQERKG HFVTIKHLAG NNQEDNRMGS
DDIVSERALR EIYLRGFELA VKNAAPGAIM SSYNLINGVH SANNRALCTE IARKEWGFRG
LIMSDWGTTN MSTDAALCTA SGCIRAGNDL IMPGAPEDLE NMREALRKGE LSLELLRLSA
SRILEAAGKT ETGAEAVRAE GRKGGEQR
//