ID A0A367F2N0_9ACTN Unreviewed; 461 AA.
AC A0A367F2N0;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=Glycosyl hydrolase family 5 {ECO:0008006|Google:ProtNLM};
GN ORFNames=DQ384_33920 {ECO:0000313|EMBL:RCG24045.1};
OS Sphaerisporangium album.
OC Bacteria; Actinomycetota; Actinomycetes; Streptosporangiales;
OC Streptosporangiaceae; Sphaerisporangium.
OX NCBI_TaxID=509200 {ECO:0000313|EMBL:RCG24045.1, ECO:0000313|Proteomes:UP000253094};
RN [1] {ECO:0000313|EMBL:RCG24045.1, ECO:0000313|Proteomes:UP000253094}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCTCC AA 208026 {ECO:0000313|EMBL:RCG24045.1,
RC ECO:0000313|Proteomes:UP000253094};
RA Li L.;
RT "Sphaerisporangium craniellae sp. nov., isolated from a marine sponge in
RT the South China Sea.";
RL Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 12 (cellulase H) family.
CC {ECO:0000256|ARBA:ARBA00005519, ECO:0000256|RuleBase:RU361163}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RCG24045.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QOIL01000025; RCG24045.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A367F2N0; -.
DR OrthoDB; 2557744at2; -.
DR Proteomes; UP000253094; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:InterPro.
DR GO; GO:0030247; F:polysaccharide binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 1.
DR Gene3D; 2.60.120.180; -; 1.
DR Gene3D; 2.60.40.290; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR001919; CBD2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR012291; CBM2_carb-bd_dom_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013319; GH11/12.
DR InterPro; IPR002594; GH12.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR34002; BLR1656 PROTEIN; 1.
DR PANTHER; PTHR34002:SF9; XYLOGLUCAN-SPECIFIC ENDO-BETA-1,4-GLUCANASE A; 1.
DR Pfam; PF00553; CBM_2; 1.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF01670; Glyco_hydro_12; 1.
DR SMART; SM00637; CBD_II; 1.
DR SMART; SM00060; FN3; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR PROSITE; PS51173; CBM2; 1.
DR PROSITE; PS50853; FN3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|RuleBase:RU361163};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361163};
KW Hydrolase {ECO:0000256|RuleBase:RU361163};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|RuleBase:RU361163};
KW Reference proteome {ECO:0000313|Proteomes:UP000253094};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..461
FT /note="Glycosyl hydrolase family 5"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039121545"
FT DOMAIN 267..352
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 352..461
FT /note="CBM2"
FT /evidence="ECO:0000259|PROSITE:PS51173"
FT REGION 334..355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 334..351
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 461 AA; 47298 MW; 8BF24EC72407151C CRC64;
MKRRALRALT AMIAAAGTLL MTPVVAGQAH ADTVICEKYG ATTIAGGKYV VINNNWGDDT
QQCINVTSTG FSITQASHNK PQNGAPGSYP AVYAGCHYAN CSTGSQLPMR VSDSRFPTIQ
TSVSMTYPSS GVYDAAYDVW FDPTARTDGQ NTGAELMVWL NHTGSVQPVG SRVGTASIAG
GTWDVWFGNI GWNVVSYVRT SPTSSINFAV NSFYTDMVSR GYAQNSWYIT SVQAGFEPWV
GGAGLAVNTF SYSVGGTGGG DTTPPSAPSG LTASNVTSSG ATLSWNASTD NVGVTGYDVS
RNGTVIGTAV GTSYNVTGLS ASTAYSFSVR ARDAAGNTSS PSNTVSVTTP PGGGGGGSCR
VAYVKNEWPG GFTANLTITN TGTAAVNGWT LAFAFPGDQR ITSSWNATVS QSGTSVTARN
AAHNGSIPPG GNATFGFQGT WNTSDASPAS YTLNGSACTV G
//