ID V2XJ82_9FIRM Unreviewed; 832 AA.
AC V2XJ82;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Glycosyl hydrolase family 3 protein {ECO:0000313|EMBL:ESL02209.1};
GN ORFNames=GCWU0000282_002343 {ECO:0000313|EMBL:ESL02209.1};
OS Catonella morbi ATCC 51271.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae; Catonella.
OX NCBI_TaxID=592026 {ECO:0000313|EMBL:ESL02209.1, ECO:0000313|Proteomes:UP000018227};
RN [1] {ECO:0000313|EMBL:ESL02209.1, ECO:0000313|Proteomes:UP000018227}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 51271 {ECO:0000313|EMBL:ESL02209.1,
RC ECO:0000313|Proteomes:UP000018227};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Nash W.E., Warren W., Chinwalla A.,
RA Mardis E.R., Wilson R.K.;
RL Submitted (JUN-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 3 family.
CC {ECO:0000256|ARBA:ARBA00005336}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESL02209.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACIL03000016; ESL02209.1; -; Genomic_DNA.
DR RefSeq; WP_023355203.1; NZ_KI535369.1.
DR AlphaFoldDB; V2XJ82; -.
DR STRING; 592026.GCWU0000282_002343; -.
DR eggNOG; COG1472; Bacteria.
DR HOGENOM; CLU_005235_2_0_9; -.
DR OrthoDB; 98455at2; -.
DR Proteomes; UP000018227; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.40.50.1700; Glycoside hydrolase family 3 C-terminal domain; 1.
DR Gene3D; 3.20.20.300; Glycoside hydrolase, family 3, N-terminal domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR026891; Fn3-like.
DR InterPro; IPR002772; Glyco_hydro_3_C.
DR InterPro; IPR036881; Glyco_hydro_3_C_sf.
DR InterPro; IPR001764; Glyco_hydro_3_N.
DR InterPro; IPR036962; Glyco_hydro_3_N_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR42715; BETA-GLUCOSIDASE; 1.
DR PANTHER; PTHR42715:SF10; BETA-GLUCOSIDASE F-RELATED; 1.
DR Pfam; PF14310; Fn3-like; 1.
DR Pfam; PF00933; Glyco_hydro_3; 1.
DR Pfam; PF01915; Glyco_hydro_3_C; 1.
DR PRINTS; PR00133; GLHYDRLASE3.
DR SMART; SM01217; Fn3_like; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF52279; Beta-D-glucan exohydrolase, C-terminal domain; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:ESL02209.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018227}.
FT DOMAIN 339..410
FT /note="Fibronectin type III-like"
FT /evidence="ECO:0000259|SMART:SM01217"
SQ SEQUENCE 832 AA; 91227 MW; B65A4007F6E13974 CRC64;
MKKLKTRTFS GTTNAALSER EKLGMDIARE AAAAGIVLLK NENNILPIAK GSKIALYGIG
AANTFKGGTG SGDVNERQSV SIFEGLKNAG YIITNEAQAK ESVEFYKKAR EAWRDDILKK
CEGKDGAEFF IIYSTNPFSV PENKQSITKT DTDTAIYVVS RVAGEGADRH NKKGDFLLTE
AEEKELDSIC SLYEKVVVVL NTGGVMDLSP LDKYPNIYGI INLGQAGMEG GNALSDIVSG
EVNPSGKLAA SWAFKYEDYP NASEFSHNNG NVNQEYYNEG IYVGYRYFDS FNIPVRYGFG
FGLSYTDFSV EFEGIKEEKG KGVILDVRVK NTGKVAGREV IQVYVSCPEG KLEKERRRLA
AFKKTELLAA GEEKDYSLAI PFDVLTSYNE DEPGWMLEKG LYGFYVGNSL GSSVLKAIME
LDADVITEKT MHICKPEKPV EEYKVNSKLV EERKTEIKKL AETLPVVRIK AESIAVKETK
YLSNEELAAA EEIELVKSLS EEQIKKLATG DPGRAQEESA LGSAGISVPG SASETSHCAE
NKGIAGIVLA DGPAGLRLNR YYFVRDGKMV PMSFMFSLEG GLLVPDADKT EGERFYQYCT
AFPVGTVLAQ TWDEEVVRKV GQHVAKEMVE FGVTLWLAPG MNIQRNPLCG RNFEYYSEDP
FITGKIAAAM TGGVQSVKGC GTTVKHFACN NNEDNRMGCD SILSERALRE IYLKGFGIAI
KEAQPMSIMT SYNKINGIHA ANNYDLCTNV ARNEFGFKGM IMTDWTTTHN GTDCTAAGCI
RAGNDAVMPG CEDDQINLTE ELASGKLQKT ALEACVSRLV RCVLQSNEYE EA
//