ID A0A267GEE4_9PLAT Unreviewed; 684 AA.
AC A0A267GEE4;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:PAA83669.1};
GN ORFNames=BOX15_Mlig019851g3 {ECO:0000313|EMBL:PAA83669.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA83669.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA83669.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA83669.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA83669.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 35 family.
CC {ECO:0000256|ARBA:ARBA00009809, ECO:0000256|RuleBase:RU003679}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA83669.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01000412; PAA83669.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267GEE4; -.
DR STRING; 282301.A0A267GEE4; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0004565; F:beta-galactosidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR026283; B-gal_1-like.
DR InterPro; IPR048912; BetaGal1-like_ABD1.
DR InterPro; IPR048913; BetaGal_gal-bd.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR031330; Gly_Hdrlase_35_cat.
DR InterPro; IPR001944; Glycoside_Hdrlase_35.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR23421:SF165; BETA-GALACTOSIDASE; 1.
DR PANTHER; PTHR23421; BETA-GALACTOSIDASE RELATED; 1.
DR Pfam; PF21317; BetaGal_ABD_1; 1.
DR Pfam; PF21467; BetaGal_gal-bd; 1.
DR Pfam; PF01301; Glyco_hydro_35; 1.
DR PIRSF; PIRSF006336; B-gal; 1.
DR PRINTS; PR00742; GLHYDRLASE35.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..684
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012899119"
FT DOMAIN 41..357
FT /note="Glycoside hydrolase 35 catalytic"
FT /evidence="ECO:0000259|Pfam:PF01301"
FT DOMAIN 404..533
FT /note="Beta-galactosidase 1-like first all-beta"
FT /evidence="ECO:0000259|Pfam:PF21317"
FT DOMAIN 560..619
FT /note="Beta-galactosidase galactose-binding"
FT /evidence="ECO:0000259|Pfam:PF21467"
FT ACT_SITE 190
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR006336-1"
FT ACT_SITE 269
FT /note="Nucleophile"
FT /evidence="ECO:0000256|PIRSR:PIRSR006336-1"
SQ SEQUENCE 684 AA; 75930 MW; 04CA84729C2FEFE0 CRC64;
MARKIDYSAL LFAAWFISFS IAVQHDSSVQ RSFIIDYDNN TFLKDGQPFQ YISGSIHYFR
LLPDSWDDRL AKMRAAGLNA VQIYVPWNFH MPRPGQHRFT GSADLDLFLA LAQKHDLLAI
VRAGPYICAE WDFGGLPAWL LQRNASMVMR SSDPQFLAAV DEWMAVLLPI IARRLYSSGG
NVIMVQLENE YGSTGQCDLA YIRHLYQLGR RYLGNQTVLF STDNTLTCVA KERQFYATID
FGVDGTRSLK DTFAPLRALQ PKGPLVNSEY YAGWLDHWSS PHVRVGAQLV ADGLARMLAY
GVNVNIYMFF GGTNFGYWNG ANSPPFEPQP TSYDYDAPIA ENGDTRYKYQ LIRQVIKDAT
GRQPSPVPRN SSASSFGRVK LQRSASLLGN LDSLIVNRLD GKLPVTMESL GQFQGFALYG
TQISEDFKAV APSLTSSYSL RVEKVRDFVY AFASSSSKPS LGQFLGFISY AESAAVLNFT
ASDLSSIRNI WLLVENCGHV NSGPDIQGNV KGIIGQVTLN GAPLNGWRSY SLDFDRLAAK
LPYAKRLAAS EEPQAAQASL YMGRLILDKN NLTDTYANLS TFCKGQLWIN GFNLGRYWPC
AGPQVTLYVP RAVLREGSNS VAVLELLQAP CLSDSSCRID FLREPDVGFR SKSVADSAIS
LRFRGKFAVP SSADQLPSVD DERI
//