ID A0A1Y2F3F9_9FUNG Unreviewed; 775 AA.
AC A0A1Y2F3F9;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE SubName: Full=Arabinanase/levansucrase/invertase {ECO:0000313|EMBL:ORY77866.1};
GN ORFNames=LY90DRAFT_698413 {ECO:0000313|EMBL:ORY77866.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY77866.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY77866.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY77866.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY77866.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000018; ORY77866.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2F3F9; -.
DR STRING; 1754190.A0A1Y2F3F9; -.
DR OrthoDB; 5470935at2759; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd04084; CBM6_xylanase-like; 1.
DR CDD; cd09003; GH43_XynD-like; 1.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 2.
DR Gene3D; 3.90.1220.10; Cellulose docking domain, dockering; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR002883; CBM10/Dockerin_dom.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR009034; Dockerin_dom_fun_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR PANTHER; PTHR43772; ENDO-1,4-BETA-XYLANASE; 1.
DR PANTHER; PTHR43772:SF2; PUTATIVE (AFU_ORTHOLOGUE AFUA_2G04480)-RELATED; 1.
DR Pfam; PF02013; CBM_10; 2.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR Pfam; PF14200; RicinB_lectin_2; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SMART; SM00458; RICIN; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF64571; Cellulose docking domain, dockering; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS51763; CBM10; 2.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000193920};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..775
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013277028"
FT DOMAIN 686..723
FT /note="CBM10"
FT /evidence="ECO:0000259|PROSITE:PS51763"
FT DOMAIN 734..770
FT /note="CBM10"
FT /evidence="ECO:0000259|PROSITE:PS51763"
FT REGION 484..513
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 656..682
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 775 AA; 82198 MW; 968D7678DDF2FA55 CRC64;
MKTTLFSYFV LVSLTLFKNV FSAGAGSFNG VSPGKSIKST SNHNPTITHR YSADPGVMVY
NGRVYVYATN DGDAATRYNS ENTYAQINTI NVMSSADLVN WMDHGSINAT GGGGAAKWAK
NSWAPAAAWK KINGKDKFFL YFADNASGIG VLTADSPTGP FRDPIGRALI SRQTPNSNVE
WLFDPAVFVD SDGTGYLYYG GGVPKGREAN PNTIRVAKLG GDMTSISGTP ANIDAPWVFE
DSGINKIGNT YVYSYCTNWN GGPYGNARIA YMTSNNPMGK FTFQGTCFNN PGDFFQTTGN
NHHTIIEFKN KYYIFYHAEW LNKQILGGQK GYRTTHVDEL PVNGSKLGNA KGTLTGVSQL
QNVDGSALNY AASFAWQSGI SVKGQGAVTQ VNYGRGAWTG VSNVNLGNAK SITLKASSSG
GATVKICAGS ENGTVLGYVD IPAGGSLQNV SGNLSGASGT KNLFFIASGN LTIESWQLGG
SSGNTANTGN TANGGNASNT NTGNTNTNTD TRTSTSNVVD GWYYIKNTGS NKYLQTVGST
ANSNVEISSF TGSAAQKWKV TKNSEGYITL LNGVGDFNLD VANGKNENGV NILIYIAYGG
EAQQFALSAG SNNSFIITTR VSNGDKALDV YEHRTEDGAN VCQWTTSGKP NQTWVFEKVD
GGSSNNSNNS SNSNNSSSSN NSSNATCWSK ALGYECCKSC SPVYFTDEDG EWGVVGNDWC
GIPASCKTNS AASSCRGAQG YPCCKKSCAI YSTDGDGDWS IENNDWCLID NSICK
//