ID A0A1E4SVZ1_9ASCO Unreviewed; 223 AA.
AC A0A1E4SVZ1;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 22-FEB-2023, entry version 15.
DE RecName: Full=CBM1 domain-containing protein {ECO:0000259|PROSITE:PS51164};
DE Flags: Fragment;
GN ORFNames=CANARDRAFT_9242 {ECO:0000313|EMBL:ODV83675.1};
OS [Candida] arabinofermentans NRRL YB-2248.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Pichiaceae; Ogataea; Ogataea/Candida clade.
OX NCBI_TaxID=983967 {ECO:0000313|EMBL:ODV83675.1, ECO:0000313|Proteomes:UP000094801};
RN [1] {ECO:0000313|Proteomes:UP000094801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NRRL YB-2248 {ECO:0000313|Proteomes:UP000094801};
RG DOE Joint Genome Institute;
RA Riley R., Haridas S., Wolfe K.H., Lopes M.R., Hittinger C.T., Goker M.,
RA Salamov A., Wisecaver J., Long T.M., Aerts A.L., Barry K., Choi C.,
RA Clum A., Coughlan A.Y., Deshpande S., Douglass A.P., Hanson S.J.,
RA Klenk H.-P., Labutti K., Lapidus A., Lindquist E., Lipzen A.,
RA Meier-Kolthoff J.P., Ohm R.A., Otillar R.P., Pangilinan J., Peng Y.,
RA Rokas A., Rosa C.A., Scheuner C., Sibirny A.A., Slot J.C., Stielow J.B.,
RA Sun H., Kurtzman C.P., Blackwell M., Grigoriev I.V., Jeffries T.W.;
RT "Comparative genomics of biotechnologically important yeasts.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV453861; ODV83675.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1E4SVZ1; -.
DR OrthoDB; 548101at2759; -.
DR Proteomes; UP000094801; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR InterPro; IPR000254; Cellulose-bd_dom_fun.
DR Pfam; PF00734; CBM_1; 2.
DR SMART; SM00236; fCBD; 2.
DR PROSITE; PS51164; CBM1_2; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000094801};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..223
FT /note="CBM1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009162941"
FT DOMAIN 30..67
FT /note="CBM1"
FT /evidence="ECO:0000259|PROSITE:PS51164"
FT DOMAIN 122..159
FT /note="CBM1"
FT /evidence="ECO:0000259|PROSITE:PS51164"
FT REGION 72..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 170..191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 223
FT /evidence="ECO:0000313|EMBL:ODV83675.1"
SQ SEQUENCE 223 AA; 22814 MW; D1893B2E92952655 CRC64;
MLIKSKLLAV NILALSFLFT AQSADAARTT CVSNYGQCGG TTYTGATNCC NSNFYCSTQN
AYWAGCESKT TSTTIAPTTG KTSTSTTCST SSKTTSTSLK QTTSSSSTST STSCSTSTSS
TSCVPNYSQC GGTTYTGATN CCNSNFYCST QNAYWAGCET RTSTTLSVST SASTSTSSSS
TTSTISNSAT GSSTSYCFVT GTYPSSSAVP SNIVYASCST LNF
//