ID F0XZ41_AURAN Unreviewed; 1811 AA.
AC F0XZ41;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Uncharacterized protein GLB1 {ECO:0000313|EMBL:EGB12414.1};
GN Name=GLB1 {ECO:0000313|EMBL:EGB12414.1};
GN ORFNames=AURANDRAFT_61114 {ECO:0000313|EMBL:EGB12414.1};
OS Aureococcus anophagefferens (Harmful bloom alga).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Pelagophyceae; Pelagomonadales;
OC Aureococcus.
OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729};
RN [1] {ECO:0000313|EMBL:EGB12414.1, ECO:0000313|Proteomes:UP000002729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729};
RX PubMed=21368207; DOI=10.1073/pnas.1016106108;
RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A.,
RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., Dill B.D.,
RA Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., Lindquist E.A.,
RA Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., Talmage S.C., Walker E.A.,
RA Koch F., Burson A.M., Marcoval M.A., Tang Y.Z., Lecleir G.R., Coyne K.J.,
RA Berg G.M., Bertrand E.M., Saito M.A., Gladyshev V.N., Grigoriev I.V.;
RT "Niche of harmful alga Aureococcus anophagefferens revealed through
RT ecogenomics.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 2 family.
CC {ECO:0000256|ARBA:ARBA00007401}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL833121; EGB12414.1; -; Genomic_DNA.
DR RefSeq; XP_009033447.1; XM_009035199.1.
DR EnsemblProtists; EGB12414; EGB12414; AURANDRAFT_61114.
DR GeneID; 20223323; -.
DR KEGG; aaf:AURANDRAFT_61114; -.
DR eggNOG; KOG2024; Eukaryota.
DR eggNOG; KOG2797; Eukaryota.
DR eggNOG; KOG4475; Eukaryota.
DR InParanoid; F0XZ41; -.
DR OrthoDB; 5487253at2759; -.
DR Proteomes; UP000002729; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd04905; ACT_CM-PDT; 1.
DR Gene3D; 3.30.70.260; -; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR045865; ACT-like_dom_sf.
DR InterPro; IPR036156; Beta-gal/glucu_dom_sf.
DR InterPro; IPR032311; DUF4982.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR040605; Glyco_hydro2_dom5.
DR InterPro; IPR023232; Glyco_hydro_2_AS.
DR InterPro; IPR006103; Glyco_hydro_2_cat.
DR InterPro; IPR006102; Glyco_hydro_2_Ig-like.
DR InterPro; IPR006104; Glyco_hydro_2_N.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR025924; YHYH_dom.
DR PANTHER; PTHR42732; BETA-GALACTOSIDASE; 1.
DR PANTHER; PTHR42732:SF1; BETA-MANNOSIDASE; 1.
DR Pfam; PF16355; DUF4982; 1.
DR Pfam; PF18565; Glyco_hydro2_C5; 1.
DR Pfam; PF00703; Glyco_hydro_2; 1.
DR Pfam; PF02836; Glyco_hydro_2_C; 2.
DR Pfam; PF02837; Glyco_hydro_2_N; 1.
DR Pfam; PF14240; YHYH; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55021; ACT-like; 1.
DR SUPFAM; SSF49303; beta-Galactosidase/glucuronidase domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS00608; GLYCOSYL_HYDROL_F2_2; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000002729};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..1811
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003264288"
FT DOMAIN 200..256
FT /note="YHYH"
FT /evidence="ECO:0000259|Pfam:PF14240"
FT DOMAIN 1002..1171
FT /note="Glycosyl hydrolases family 2 sugar binding"
FT /evidence="ECO:0000259|Pfam:PF02837"
FT DOMAIN 1208..1297
FT /note="Glycoside hydrolase family 2 immunoglobulin-like
FT beta-sandwich"
FT /evidence="ECO:0000259|Pfam:PF00703"
FT DOMAIN 1304..1381
FT /note="Glycoside hydrolase family 2 catalytic"
FT /evidence="ECO:0000259|Pfam:PF02836"
FT DOMAIN 1388..1433
FT /note="Glycoside hydrolase family 2 catalytic"
FT /evidence="ECO:0000259|Pfam:PF02836"
FT DOMAIN 1615..1673
FT /note="DUF4982"
FT /evidence="ECO:0000259|Pfam:PF16355"
FT DOMAIN 1705..1791
FT /note="Glycoside hydrolase family 2"
FT /evidence="ECO:0000259|Pfam:PF18565"
FT REGION 675..903
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 757..780
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 821..838
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 845..868
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1811 AA; 190499 MW; E70513845E4894DF CRC64;
MAARALVALL AVDVASACWT LDLGHNLNPG SAHATCPGVP NGQVCAPDGS SNYHFWDGAY
HNGIFRGVFQ TNQCSNDKWG YCPLCDPPQY LVYANHSANC VNQTVPAPQY DPKDGPHGAQ
LRGRVGMTLA GVNIYGPEEA GFGFGPNPMP CERTARRFDR FPVDEPRSTD GTGYCYAGVD
VPTCESALDI MCSATGSKVV HELMLDSCGG HAFPYHYHND NACDYDHSLK QHSVLIGIAL
DGHGIYGLYE SYNETSEEQV TPDDLDACNG HTHGVPANAT YGVDEAAVYA RTGIYPIYTP
ESPEGYCYDA DCPCFEKSND RYGRNSNLTV AECSCYNVCD EVSNTDCGSV SCDAMLASGD
FHCADDFCPT CPKKNWCDKS CGFGACADPA FDLLLQHDLR IYGEIVVTEG PLDYTRFLVL
STTDAAPPLA RPKTSLAFAV IDRPGALAEA LGEFGRRQGS KRERNSQLQR LISRPFSTRR
RGINLLKLES RPRRRTTGAG FTYVFFLGFE GTLRDEPCVA AVKALLERCA FVRFLGSYDA
APPVEREGSG NRLVCCESTM STVDPLRSGA IFRCDIEKIR SPLPRAAGHA GAAGAPPPAR
GPAAALMAAP APGGRISSDF ADLIAMSPVP HSVRPPARSL PPPKNAPTKV TAGSLGVLKV
LGLEGSPRAQ ALQGLGAAAE VTPPKAPAPR CDARDFKSAG RTPRPAGGGG LLAGYKAPPA
KKAKKRPAAK APPPTRAPTR PAEATPEKAA RPAAATPPAA ATPPARPPAA ATPPAKKRPA
ATPPAKKRPA ADDYDFPGLP SDDEFGDFVS AAARGPRPAK KRRVAPPEPR KNAPREAASP
KKAKVASPAK KKKAAPASPA KKPKKAAKKT TAPWPTAAPT RKPAAKKKKA PKAPRAPAAA
EVATPVVTTT RSGRRSFGAL DWWRSERIVV RPGADAVVVA GSAAAEDYLG AAKIMMAALL
LLGCLLGGAA RRDGSSMPSA AARRDGNSMP GGVKLYERKI YSIDRSWRFS LCEDDACAQG
SPASCPEGAW CAASLDDDAW RRVDVPHDFV VEGTFNNESD MAHGYLPYGV GLYRKALAVS
AAEAKAIRDG SLVAHVEFDG AQTSTTAYLN GSLLGTHGSG YTNFRFPLSG DHAEALAAGA
VLALRVDATQ PDGWWYDGGG IYRHARLLLT PATHLSVLGG AYLPSTVAGP IEAGVADAIV
SPRIAVASTS PTAEPFEVTV TVAEAATGRV VGSASAKGAT TPPVGPSGAN FTTLDLPPVA
VARAALWSPE SPFLYDVTVE VATAASRDVT THSIGVRDVA WKADSGLWLN GAPYKIRGAA
NHQDFAGVGV AVPDALQAFR VRKLKSMGAN AWRTAHNAPT PALLDACDAE GMLVWDETHR
NGNPGEQTRL VLRDRNHPSV VIWSICNEKL CETNDTLGDG KAAAALYREL DPSFGRAVSA
NYNPWRPPDY PGDVVGIDYA TSTYDAEHAK NASMPFISSE TSSAVSDRGE YASDAVAAHV
AGYDTTAPTW GEVAEDAWGG VGEADAQGIL TRPFVSGGFT WTGFDYKGEP TPYSWPNVNS
HFGILDIAGF EKDRFYYYQS VFKPEEPMVH LFPHWNWAVE EKGAHLAECE GLCDGSAVEV
WAFTNGHSAE LTVNGVSQGA KLVPALGHVQ WTVDYAPGAV AVVVRDAQNA TVASDAVETT
GPAAALRLSF KDGVGAGGVD ADCGVALVQV EVVDAEGRVV PTADANVTLA SSAALRFIGG
GNGDPSEHTA DKSASRPAFH GLLLGVFEAT GGAAAAATVT ATAAGVDAAE LDVAIVESDA
ATYWCPRYPA L
//