ID A0A550HQ03_9ACTN Unreviewed; 839 AA.
AC A0A550HQ03;
DT 16-OCT-2019, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Carbohydrate-binding protein {ECO:0000313|EMBL:TRO60854.1};
GN ORFNames=E4K73_28415 {ECO:0000313|EMBL:TRO60854.1};
OS Streptomyces sp. IB201691-2A2.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=2561920 {ECO:0000313|EMBL:TRO60854.1, ECO:0000313|Proteomes:UP000317862};
RN [1] {ECO:0000313|EMBL:TRO60854.1, ECO:0000313|Proteomes:UP000317862}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IB201691-2A2 {ECO:0000313|EMBL:TRO60854.1,
RC ECO:0000313|Proteomes:UP000317862};
RA Voitsekhovskaya I., Paulus C., Dahlem C., Rebets Y., Nadmid S.,
RA Axenov-Gribanov D.V., Ruckert C., Timofeyev M.A., Kalinowski J.,
RA Kiemer A.K., Luzhetskyy A.;
RT "Baikalomycins A-C, new aquayamycin type angucyclines isolated from Lake
RT Baikal derived Streptomyces sp. IB201691-2A.";
RL Submitted (MAR-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TRO60854.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SPQF01000012; TRO60854.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A550HQ03; -.
DR OrthoDB; 6402258at2; -.
DR Proteomes; UP000317862; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR CDD; cd04084; CBM6_xylanase-like; 1.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR012938; Glc/Sorbosone_DH.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR InterPro; IPR011041; Quinoprot_gluc/sorb_DH.
DR PANTHER; PTHR19328; HEDGEHOG-INTERACTING PROTEIN; 1.
DR PANTHER; PTHR19328:SF13; HIPL1 PROTEIN; 1.
DR Pfam; PF03422; CBM_6; 1.
DR Pfam; PF07995; GSDH; 1.
DR Pfam; PF18911; PKD_4; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SMART; SM00089; PKD; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR SUPFAM; SSF50952; Soluble quinoprotein glucose dehydrogenase; 1.
DR PROSITE; PS51175; CBM6; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000317862};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..49
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 50..839
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5022238481"
FT DOMAIN 517..598
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 711..837
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 839 AA; 88539 MW; 5D4942FBC007F00D CRC64;
MHGNDHISTT RTESHRRRPR LRLRKALALF TGALLAGASL TLATPPAGAA VADPAAAPAA
PAAAEDFQQV TLAKGEPEVG EPMSLAVLPD RSVLHTSRDG ELRITDSAGN TKLAGKLAVY
SHDEEGLQGV GVDPGFADNR FIYLYYAPPL DTPAGDAPET GTAADFAPFD GVNRLSRFVL
NADGTLDNAS EKKILDVPAT RGICCHVGGD IDFDAAGNLY LSTGDDSNPF QSDGYSPIDE
RANRNPVFDA QRTSGNTNDL RGKILRIKVN ADGSYAVPDG NLFAPGTDKT KPEIYAMGFR
NPFRFSVDKK TGILYVGDYG PDAGAADPAR GPAGQVEFAR VTEPGNFGWP YCTGNNDAYV
DYDFGTGASG ASFDCSAPKN TSPNNTGLTD LPPAQTAWIP YDGGSVPEFG TGSESPMGGP
VYDYDASLDS PVKFPEAYDG DFFAGEFGRR WIKRIASDDS GTVQSINDVP WTGTQVMDMA
FGPDGALYVL DYGLAWFGGD ENSALYRIEN ATDGHSPVAQ AAADRTSGQA KLKVKFSSAG
TTDADGDALT YSWDFGDGGK STSANPTYTY KRNGTYTATL TAKDATGRTG SASVRIVVGN
TAPKVTLQLP EDGQLFSFGD AVPFKVKVTD PEDGRSIDCA KVKVSFVLGH DSHGHPLTTA
NGCSGTIQTS ADGGHDEDAN IFGVFDAEYT DNGGGGQEAL TTHDQNVVQP RHRQAEHYGK
SEGVTIQTKT TAHGGRTVGD INNRDWISFE PYVLSNATKI TARIASAGTG GKLEVRAGSP
TGRLLGTATV PVTGGWENFQ DVTANLTKAP RGTTTLYLVF KGSGTGSLYD VDDFTFTTR
//