ID A0A1Y2BUS7_9FUNG Unreviewed; 1086 AA.
AC A0A1Y2BUS7;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=CBM1 domain-containing protein {ECO:0000259|PROSITE:PS51164};
GN ORFNames=LY90DRAFT_704436 {ECO:0000313|EMBL:ORY38493.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY38493.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY38493.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY38493.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY38493.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000136; ORY38493.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2BUS7; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR InterPro; IPR035971; CBD_sf.
DR InterPro; IPR000254; Cellulose-bd_dom_fun.
DR PANTHER; PTHR37435:SF3; DUMPY: SHORTER THAN WILD-TYPE; 1.
DR PANTHER; PTHR37435; PROTEIN CBG14344; 1.
DR Pfam; PF00734; CBM_1; 1.
DR SMART; SM00236; fCBD; 1.
DR SUPFAM; SSF57180; Cellulose-binding domain; 1.
DR PROSITE; PS00562; CBM1_1; 1.
DR PROSITE; PS51164; CBM1_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000193920};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1086
FT /note="CBM1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012734123"
FT DOMAIN 1051..1086
FT /note="CBM1"
FT /evidence="ECO:0000259|PROSITE:PS51164"
FT REGION 115..134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 275..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 524..543
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 115..133
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 275..293
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 524..542
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1086 AA; 121278 MW; 7D10813E068DFBDA CRC64;
MKFKFVKTIL ILSSIAYGKA DDLNIEEIPK NIKTITKTVI YTSKSTPYKP YKGSAFDDVT
PCEEGDEECF TELINECTPE IQYECMASNS IVIQAKCRQL SDVCEDIWYP KTTTTTTTTT
TTTTTSTKPT SHKPYEGSAF DDVTPCDEGD EECFGKLINE CTPEIQYECM ASNSIVIQAK
CGQLSDVCED IWYPKATTTT TTTKPTSHKP YEGSAFDDVT PCEEGDEECF KRLINECTPE
IQYECMASNS IVIQAKCRQL SDVCEDIWYP KSTTTTTTTT TTTTTSTKPT SHKPYEGSAF
DDVTPCEEGD EECFTELINE CTPEIQYECM ASNSIVIQAK CRQLSDVCED IWYPKTTTTT
TTTTTTTSTK PTSHKPYEGS AFDDVTPCEE GDEECFKRLI NECTPEIQYE CMASNSIVIQ
AKCGQLSDVC EDIWYPKSTI TNTTTTTTTT TTKPTSHKPY EGSAFDDVTP CEEGDEECFT
ELINECTPEI QYECMASNSI VIQAKCRQLS DVCEDIWYPK STTTTTTTTT TTTTSTKPTS
HKPYEGSAFD DVTPCEEGDE ECFTELINEC TPEIQYECMT SNSIVIQAKC RQLSDVCEDI
WYPKSTTTNT TTTTTTTTKP TSHKPYEGSA FDDVTPCDEG DEECFTELIN ECTPEIQYEC
MTSNSIVIQA KCRQLSDVCE DIWYPKSTTT NTTTTTTTTT KPTSHKPYEG SAFDDVTPCE
EGDEECFTEL INECTPEIQY ECMASNSIVI QAKCRQLSDV CEDIWYPKTT TTTTTTTTTT
STKPTSHKPY EGSAFDDVTP CDEGDEECFT ELINECTPEI QYECMASNSI VIQAKCRQLS
DVCEDIWYPK STTTTTTTKP TSHKPYEGSA FDDVTPCDED DEECFTELIN ECTPEIQYEC
MASNSIVIQA KCRQLSDVCE DIWYPKTTTT TTTTTTTTTT STKPTSHKPY EGSAFDDIIL
CDESSDESDD ECFDKLINEC TPEIQHECMA SNSMMIQAKC RQLSEICDDI WNPRFTTTTI
TTPTTTSLNK DSVTVTTTEK EIATEAMDMK HCAPKWGQCG GIGYKGPTCC QSGDCHEYNP
WYSQCL
//