ID Q21N34_SACD2 Unreviewed; 436 AA.
AC Q21N34;
DT 18-APR-2006, integrated into UniProtKB/TrEMBL.
DT 18-APR-2006, sequence version 1.
DT 27-MAR-2024, entry version 95.
DE SubName: Full=Chitin-binding protein {ECO:0000313|EMBL:ABD79895.1};
GN Name=cbpA {ECO:0000313|EMBL:ABD79895.1};
GN OrderedLocusNames=Sde_0633 {ECO:0000313|EMBL:ABD79895.1};
OS Saccharophagus degradans (strain 2-40 / ATCC 43961 / DSM 17024).
OC Bacteria; Pseudomonadota; Gammaproteobacteria; Cellvibrionales;
OC Cellvibrionaceae; Saccharophagus.
OX NCBI_TaxID=203122 {ECO:0000313|EMBL:ABD79895.1, ECO:0000313|Proteomes:UP000001947};
RN [1] {ECO:0000313|EMBL:ABD79895.1, ECO:0000313|Proteomes:UP000001947}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2-40 / ATCC 43961 / DSM 17024
RC {ECO:0000313|Proteomes:UP000001947};
RX PubMed=18516288; DOI=10.1371/journal.pgen.1000087;
RA Weiner R.M., Taylor L.E.II., Henrissat B., Hauser L., Land M.,
RA Coutinho P.M., Rancurel C., Saunders E.H., Longmire A.G., Zhang H.,
RA Bayer E.A., Gilbert H.J., Larimer F., Zhulin I.B., Ekborg N.A., Lamed R.,
RA Richardson P.M., Borovok I., Hutcheson S.;
RT "Complete genome sequence of the complex carbohydrate-degrading marine
RT bacterium, Saccharophagus degradans strain 2-40 T.";
RL PLoS Genet. 4:E1000087-E1000087(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000282; ABD79895.1; -; Genomic_DNA.
DR RefSeq; WP_011467116.1; NC_007912.1.
DR AlphaFoldDB; Q21N34; -.
DR SMR; Q21N34; -.
DR STRING; 203122.Sde_0633; -.
DR CAZy; AA10; Auxiliary Activities 10.
DR CAZy; CBM2; Carbohydrate-Binding Module Family 2.
DR KEGG; sde:Sde_0633; -.
DR eggNOG; COG3397; Bacteria.
DR HOGENOM; CLU_036068_0_0_6; -.
DR OrthoDB; 3675244at2; -.
DR Proteomes; UP000001947; Chromosome.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030247; F:polysaccharide binding; IEA:UniProtKB-UniRule.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd21177; LPMO_AA10; 1.
DR Gene3D; 2.60.40.290; -; 1.
DR Gene3D; 2.60.40.3440; -; 1.
DR Gene3D; 2.70.50.50; chitin-binding protein cbp21; 1.
DR InterPro; IPR001919; CBD2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR012291; CBM2_carb-bd_dom_sf.
DR InterPro; IPR018366; CBM2_CS.
DR InterPro; IPR004302; Cellulose/chitin-bd_N.
DR InterPro; IPR014756; Ig_E-set.
DR PANTHER; PTHR34823:SF1; CHITIN-BINDING TYPE-4 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR34823; GLCNAC-BINDING PROTEIN A; 1.
DR Pfam; PF17963; Big_9; 1.
DR Pfam; PF00553; CBM_2; 1.
DR Pfam; PF03067; LPMO_10; 1.
DR SMART; SM00637; CBD_II; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR PROSITE; PS51173; CBM2; 1.
DR PROSITE; PS00561; CBM2_A; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000001947};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..436
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004199787"
FT DOMAIN 335..436
FT /note="CBM2"
FT /evidence="ECO:0000259|PROSITE:PS51173"
FT REGION 316..341
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 436 AA; 46195 MW; B2B7B44703A87E4F CRC64;
MFAKKITYST IALAIAGLSG NALSHGLMVD PPSRNALCGM IEKPDQATSP ACQQAFQNDF
NGGYQFMSVL THDIGRQGGT SNNVCGFDSE TWNGGATPWD AAIDWPTTQI SSGPLEIDWN
ISWGPHWDDT EEFVYYITKP DFVYQVGVPL SWSDFEATPF CQLDYSDANP NANPGVSTTK
SANLFHTQCN VPARSGRHVI YGEWGRNYFT YERFHGCMDV TFGGSNPPPS NQAPTANAQS
VNVSSGSSVS ITLSGSDVDG VISSYAIAAA PSNGSLSGSG AQRLYTPNGN FSGSDSFQFT
VTDDDGATSN AATVSINVSS QPEPEPEPEP EPEPEPGTGA SCEHVVVNAW DSGFQGAIRI
TNTSDQNING WNVSWSYNNG TTISQLWNAN FSGSNPYSAS NLGWNATIQP GQTVEFGFTG
NGSVPAAPAV TGAVCN
//