ID R1BBG2_EMIHU Unreviewed; 616 AA.
AC R1BBG2;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 24-JAN-2024, entry version 52.
DE RecName: Full=Glycoside hydrolase family 2 catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EMIHUDRAFT_96903 {ECO:0000313|EMBL:EOD07028.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD07028.1};
RN [1] {ECO:0000313|EMBL:EOD07028.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD07028.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD07028}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 2 family.
CC {ECO:0000256|ARBA:ARBA00007401}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB869421; EOD07028.1; -; Genomic_DNA.
DR RefSeq; XP_005759457.1; XM_005759400.1.
DR STRING; 2903.R1BBG2; -.
DR PaxDb; 2903-EOD07028; -.
DR EnsemblProtists; EOD07028; EOD07028; EMIHUDRAFT_96903.
DR GeneID; 17253303; -.
DR KEGG; ehx:EMIHUDRAFT_96903; -.
DR eggNOG; ENOG502S3MC; Eukaryota.
DR HOGENOM; CLU_006501_0_1_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR036156; Beta-gal/glucu_dom_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR023232; Glyco_hydro_2_AS.
DR InterPro; IPR006103; Glyco_hydro_2_cat.
DR InterPro; IPR006102; Glyco_hydro_2_Ig-like.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR42732; BETA-GALACTOSIDASE; 1.
DR PANTHER; PTHR42732:SF1; BETA-GALACTOSIDASE (EUROFUNG); 1.
DR Pfam; PF00703; Glyco_hydro_2; 1.
DR Pfam; PF02836; Glyco_hydro_2_C; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49303; beta-Galactosidase/glucuronidase domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS00608; GLYCOSYL_HYDROL_F2_2; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827}.
FT DOMAIN 235..317
FT /note="Glycoside hydrolase family 2 immunoglobulin-like
FT beta-sandwich"
FT /evidence="ECO:0000259|Pfam:PF00703"
FT DOMAIN 411..465
FT /note="Glycoside hydrolase family 2 catalytic"
FT /evidence="ECO:0000259|Pfam:PF02836"
FT REGION 29..54
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 616 AA; 65789 MW; F6F26A81C3EB8DFC CRC64;
MLLLCFLSLS APRDEVSFDF GWRFTTGLST HPAQSDEPPP ASADPGLHPA EAQTGYDDSG
WAEVQLPHDG LIAAAASSAA CRDGCSGKSY IPRHVLWYRK AFTLPAEWAG SLLFLDFEGS
FRKTTAHDCG YTPFRVRLDN ATSLRRRLGS GDSAHTIAVF VDPDNGDEGA RSHGSGWWYE
GGGLYRSVAL LRVPPLHIER DGLFAYSNLS WRASGGMEEE GEQEGEQPSG GVLHASAAVA
NAGSSAQTIC VAFSLSPPGG GRLLAAASTA AVSVPPGGRA TVSLALSVAE PEVWSAASPH
LYTVHAAVMQ SGCASSSSSA SASSSSSSAA AASLSDGVVD AVSTTHGFRS LRYDADAGFF
LNQRHFKVRG FCDHNSFAVV GMAVPPRVDL FRAQALRVVV MDENRLFANS SKYVANMGAL
VRRDRNHPSV VIWSFCNEAG CEGWRERGGP RFQEVAHRLD GSRPTLANMF TFDDLLSHTV
DVQGFSHQSR QKLDACHARL PDKPIYMSEC CSCETLRDNP HTACTQKSFN ARCAESNTAT
NASDGAAYAV GDMACGTMVW TLFDYYGEPP VGGFEVSSSY GQYDLCGFPK AAASLAAASW
YRTQWLLGVP DGPDKP
//