ID A0A1Y1VUU1_9FUNG Unreviewed; 473 AA.
AC A0A1Y1VUU1;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=Carbohydrate esterase 2 N-terminal domain-containing protein {ECO:0000259|Pfam:PF17996};
GN ORFNames=BCR32DRAFT_272858 {ECO:0000313|EMBL:ORX65059.1};
OS Anaeromyces robustus.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Anaeromyces.
OX NCBI_TaxID=1754192 {ECO:0000313|EMBL:ORX65059.1, ECO:0000313|Proteomes:UP000193944};
RN [1] {ECO:0000313|EMBL:ORX65059.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX65059.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ORX65059.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX65059.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORX65059.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFG01000481; ORX65059.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y1VUU1; -.
DR OrthoDB; 23761at2759; -.
DR Proteomes; UP000193944; Unassembled WGS sequence.
DR GO; GO:0052689; F:carboxylic ester hydrolase activity; IEA:InterPro.
DR CDD; cd01831; Endoglucanase_E_like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.40.50.1110; SGNH hydrolase; 1.
DR InterPro; IPR040794; CE2_N.
DR InterPro; IPR037461; CtCE2-like_dom.
DR InterPro; IPR001087; GDSL.
DR InterPro; IPR036514; SGNH_hydro_sf.
DR PANTHER; PTHR37834; GDSL-LIKE LIPASE/ACYLHYDROLASE DOMAIN PROTEIN (AFU_ORTHOLOGUE AFUA_2G00620); 1.
DR PANTHER; PTHR37834:SF2; PUTATIVE-RELATED; 1.
DR Pfam; PF17996; CE2_N; 1.
DR Pfam; PF00657; Lipase_GDSL; 1.
DR SUPFAM; SSF52266; SGNH hydrolase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000193944};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..473
FT /note="Carbohydrate esterase 2 N-terminal domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012530776"
FT DOMAIN 116..220
FT /note="Carbohydrate esterase 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF17996"
SQ SEQUENCE 473 AA; 53620 MW; 298B4AF3224B6F15 CRC64;
MKFNLLLLAA ISNLTLAIPL SNEKNDNPYL VCKKNDFNCK IEQSKICYQD VYSCFKTNSK
KYMDCIKLSN LCSEIWLNNK PEPQPTQTNE QEPQPTQINE PVINSFEPIK DNVKIIGRAK
YINDSLWFGQ TDSGIEFKIN GKTVTFVVST DSIYGSLSKE SPARIFIYGD DKLYLDTLTT
ESTMELNVEF DEVGEHTIRF LKVSECLFGS IYIDEIRTDS NVITPTETKT KKIEFIGDSI
TCAFGAMDTE GDFTTTTEDG TKSYAYKVAQ KFNADYSLFA FSGYGVYSGC DFEGIRNTNS
LIPPIYDKLG DLQWNSIHPE NVTHSMNSEE WDSNEFEPDL IIINLGTNDA AYINSVPDND
KREEEKINFT NYYKDFIGQV RSIHSKAEIL CTLGAMGQDL YQEIEVAVDN YLKENNDNKV
NVFPLNLQDI EKNGVGILFH PNALSQVDIA NEIIEKIETL YSWVSDPNVD ISE
//