ID D6X5Q0_STRE2 Unreviewed; 1392 AA.
AC D6X5Q0;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=Glycosyl hydrolase family 98 putative carbohydrate-binding module domain-containing protein {ECO:0000259|SMART:SM00776};
GN ORFNames=SSDG_06180 {ECO:0000313|EMBL:EFH30958.1};
OS Streptomyces pristinaespiralis (strain ATCC 25486 / DSM 40338 / CBS 914.69
OS / JCM 4507 / NBRC 13074 / NRRL 2958 / 5647).
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=457429 {ECO:0000313|EMBL:EFH30958.1, ECO:0000313|Proteomes:UP000002805};
RN [1] {ECO:0000313|Proteomes:UP000002805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 25486 / DSM 40338 / CBS 914.69 / JCM 4507 / NBRC 13074 /
RC NRRL 2958 / 5647 {ECO:0000313|Proteomes:UP000002805};
RG The Broad Institute Genome Sequencing Platform;
RA Fischbach M., Ward D., Young S., Jaffe D., Gnerre S., Berlin A., Heiman D.,
RA Hepburn T., Sykes S., Alvarado L., Kodira C.D., Straight P., Clardy J.,
RA Hung D., Kolter R., Mekalanos J., Walker S., Walsh C.T., Lander E.,
RA Galagan J., Nusbaum C., Birren B.;
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 25486 / DSM 40338 / CBS 914.69 / JCM 4507 / NBRC 13074 /
RC NRRL 2958 / 5647 {ECO:0000313|Proteomes:UP000002805};
RG The Broad Institute Genome Sequencing Platform;
RG Broad Institute Microbial Sequencing Center;
RA Fischbach M., Godfrey P., Ward D., Young S., Zeng Q., Koehrsen M.,
RA Alvarado L., Berlin A.M., Bochicchio J., Borenstein D., Chapman S.B.,
RA Chen Z., Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A.,
RA Gujja S., Heilman E.R., Heiman D.I., Hepburn T.A., Howarth C., Jen D.,
RA Larson L., Lewis B., Mehta T., Park D., Pearson M., Richards J.,
RA Roberts A., Saif S., Shea T.D., Shenoy N., Sisk P., Stolte C., Sykes S.N.,
RA Thomson T., Walk T., White J., Yandava C., Straight P., Clardy J., Hung D.,
RA Kolter R., Mekalanos J., Walker S., Walsh C.T., Wieland-Brown L.C.,
RA Haas B., Nusbaum C., Birren B.;
RT "The genome sequence of Streptomyces pristinaespiralis strain ATCC 25486.";
RL Submitted (OCT-2009) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000950; EFH30958.1; -; Genomic_DNA.
DR eggNOG; COG0366; Bacteria.
DR eggNOG; COG1874; Bacteria.
DR HOGENOM; CLU_002413_1_1_11; -.
DR Proteomes; UP000002805; Chromosome.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0033926; F:glycopeptide alpha-N-acetylgalactosaminidase activity; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR CDD; cd14244; GH_101_like; 1.
DR Gene3D; 2.70.98.10; -; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 1.
DR InterPro; IPR018905; A-galactase_NEW3.
DR InterPro; IPR025706; Endoa_GalNAc.
DR InterPro; IPR040633; Gal_mutarotas_3.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR014718; GH-type_carb-bd.
DR InterPro; IPR049314; GH101_dom-5.
DR InterPro; IPR040502; GH101_dom-6.
DR InterPro; IPR035364; Glyco_hyd_101_beta.
DR InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR038637; NPCBM_sf.
DR Pfam; PF18080; Gal_mutarotas_3; 1.
DR Pfam; PF17974; GalBD_like; 1.
DR Pfam; PF21466; GH101_dom-5; 1.
DR Pfam; PF17451; Glyco_hyd_101C; 1.
DR Pfam; PF12905; Glyco_hydro_101; 1.
DR Pfam; PF08305; NPCBM; 1.
DR Pfam; PF10633; NPCBM_assoc; 1.
DR SMART; SM00776; NPCBM; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002805};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1392
FT /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT binding module domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039418900"
FT DOMAIN 1132..1254
FT /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT binding module"
FT /evidence="ECO:0000259|SMART:SM00776"
FT REGION 1148..1170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1203..1275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1368..1392
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1203..1231
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1246..1271
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1368..1382
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1392 AA; 149697 MW; 92C8F65DBF221593 CRC64;
MPRRFRAAGA LAAASCAVFA LAGPAFPASP ADGPGAVTAA GAGPVISSGA LSVTVADDFP
RVLSYTDRAS GAVLLGSTAP VTEVTLNGKP YAVRPAGAPR VERSAAHYTL VFPALPGVEM
DAVLAVSGRA VTFKVTAVRD TEAFRVGTID IPGHDLVSVG STERGAATAF TRLDPDSTRT
ADVFADVTAG TPAEAAPVGA SYAIVNTGAL AAAVESNSSY DRPSGPTGGD DARFWHQARK
AADGSVRVGV WSGQWTYRGA GAPRPESGDG LPWAKVVVTP DANGDRKVDW QDGAVAFRTI
GVTAPGSDAT PERVVTHIPF NFASQATHPF LRTLDDVKRV SLATDGLGQL AVLKGYGAEG
HDSAHPDYGG NYNERAGGLT DLNKLLEEGE KWGADFGVHV NATESYPEAN AFDETLVDRT
KPGWNWLNQS YYIDQRRDIN SGDLARRFQQ LRDETHRNLD FLYIDVYYTH GWIADKTIRA
VQQQGWTVGT EWADKFERAS LWSHWANDLD YGGATNKGLN SQIIRFIRNG EKDVWNNHPV
LGQTALEDFE GWTGETDWNA FYDNIWQRDL PAKYLQHERI TRWNGNDIRF TGGLRGTVED
GRRTFYDDGR KVLDGDRYLL PWAGGEKLYH YNKAGGTSSW AVDGNGTYTV YKLGDNGRVK
AGIVRPEGGR ITLTATAGQP YVLYPDRAPE QRDARWGEGG PVADPGFNDA SLGAWAKDGG
VTRDTDGHGR NSAALEGPGA ASLSQRITGL EAGERYTASA WIEVEPGASR RTTLSVGGRS
VTVERSTAKD QVAASDWHGT HFQRAKVNFT APDEAVTLRI EAAAGSAARV RADDVRIVAN
APTTKQGALV YEDFEDVDQG WGPFLKGDAG GATDPRTHVS RLHAPYTQAG WNGKLIDDVL
GGKESLKAHD ENTGLVYRTA PWTVPMQDGH SYRVEYDYQS SHSGAYEWVT GYDRTTGGSV
ERRRTPVEQQ RTTGHFSETV TAGCGDTWTG LRKRADAPDG ADFVMDGFTV TDLGPAARPT
ACGTLAVTAP EALEPGDRNE VEAVFANDER TAAADVRLAL QLPEGWTAEP AGPVSFETVA
PGAKVTGTWQ VTPPADAEYR AYSLGSTATY TVGGSPRELT AATSVRTLPP PPTTDAWASD
LDWTSATNGW GPVERDLSNG ETGTGDGGPL RIGGVVYEKG LGSHAPAKVR YYLGGEVHVL
HRAGGRGRRP DDTGDRTVRR RRGRHGEGGL PGAEGRRPGL VPDRGRHRSR LRRTGRRRRR
GRQRQRPRRL GCRPLPLREL SRAEHLRRAA RYAGPPAGLT RAAYAARPGC EATLSSHACR
TSTKQNRPEL VVGALCGMVL FKPDRRTGAR LYRVDGRVSL VGSVGSMHDR FGRDHQRADR
RPGSGGRAAG VA
//