ID A0A1Y2ARN2_9FUNG Unreviewed; 823 AA.
AC A0A1Y2ARN2;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 22-FEB-2023, entry version 15.
DE SubName: Full=Cellulase-domain-containing protein {ECO:0000313|EMBL:ORY25136.1};
GN ORFNames=LY90DRAFT_514294 {ECO:0000313|EMBL:ORY25136.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY25136.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY25136.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY25136.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 5 (cellulase A) family.
CC {ECO:0000256|ARBA:ARBA00005641}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY25136.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000215; ORY25136.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2ARN2; -.
DR STRING; 1754190.A0A1Y2ARN2; -.
DR OrthoDB; 1329388at2759; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0071704; P:organic substance metabolic process; IEA:InterPro.
DR Gene3D; 3.90.1220.10; Cellulose docking domain, dockering; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 2.
DR InterPro; IPR002883; CBM10/Dockerin_dom.
DR InterPro; IPR009034; Dockerin_dom_fun_sf.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR31297:SF17; ENDOGLUCANASE; 1.
DR PANTHER; PTHR31297; GLUCAN ENDO-1,6-BETA-GLUCOSIDASE B; 1.
DR Pfam; PF02013; CBM_10; 2.
DR Pfam; PF00150; Cellulase; 2.
DR SUPFAM; SSF51445; (Trans)glycosidases; 2.
DR SUPFAM; SSF64571; Cellulose docking domain, dockering; 2.
DR PROSITE; PS51763; CBM10; 2.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000193920};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 740..778
FT /note="CBM10"
FT /evidence="ECO:0000259|PROSITE:PS51763"
FT DOMAIN 783..819
FT /note="CBM10"
FT /evidence="ECO:0000259|PROSITE:PS51763"
FT REGION 348..378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 823 AA; 94048 MW; A5C6CB0E08FAFED7 CRC64;
MNFGWNLGNT MDAQCIEYLN YEKDQTASET CWGNPKTTED MFKVLIDNQF NVFRIPTTWS
GHFGEAPDYK IDEKWLKRVH EVVDYPYKNG AFVILNLHHE TWNHAFSETL DTAKEILEKI
WSQIAEEFKD YDEHLIFEGS NEPRKNDTPV EWTGGDQEGW DAVNAMNAVF LKTIRSAGGN
NPKRHLMIPP YAAACNENSF NNFIFPEDDD KVIASVHAYA PYNFALNNGE GAVDKFDAAG
KRDLEWNINL MKKRFVDQGI PMILGEYGAM NRDNEEDRAV WAEFYMEKVT AIGVPQIWWD
NGVFEGTGER FGLLDRKNLK IVYPTIIAAL QKGRGLEVNV VHAVEKKPEE PTKTIKPTKP
TETTSPEEST KPEEPTGNIR DISSKELIKE MNFGWNLGNT MDAQCIEYLN YEKDQTASET
CWGNPKTTED MFKVLIDNQF NVFRIPTTWS GHFGEAPDYK IDEKWLKRVH EVVDYPYKNG
AFVILNLHHE TWNHAFSETL DTAKEILEKI WSQIAEEFKD YDEHLIFEGS NEPRKNDTPV
EWTGGDQEGW DAVNAMNAVF LKTIRSAGGN NPKRHLMIPP YAAACNENSF NNFIFPEDDD
KVIASVHAYA PYNFALNNGE GAVDKFDAAG KRDLEWNINL MKKRFIDQGI PMILGEYGAM
NRDNEEDRAV WAEFYMEKVT AIGVPQIWWD NGVFEGTGER FGLLDRKNLK IVYPTIIAAL
QKGRGLEVNV VHAVEKETEE CWSEKYGYEC CSPNNTKVVV SDENGKWGVE NGNWCGVLKY
TETCWSLPFG YPCCPHCNSL TKDENGKWGE LNGEWCGIVA DKC
//