ID V4MXF9_EUTSA Unreviewed; 540 AA.
AC V4MXF9;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Thioglucosidase {ECO:0008006|Google:ProtNLM};
GN ORFNames=EUTSA_v10002470mg {ECO:0000313|EMBL:ESQ37141.1};
OS Eutrema salsugineum (Saltwater cress) (Sisymbrium salsugineum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema.
OX NCBI_TaxID=72664 {ECO:0000313|EMBL:ESQ37141.1, ECO:0000313|Proteomes:UP000030689};
RN [1] {ECO:0000313|EMBL:ESQ37141.1, ECO:0000313|Proteomes:UP000030689}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23518688; DOI=10.3389/fpls.2013.00046;
RA Yang R., Jarvis D.E., Chen H., Beilstein M.A., Grimwood J., Jenkins J.,
RA Shu S., Prochnik S., Xin M., Ma C., Schmutz J., Wing R.A.,
RA Mitchell-Olds T., Schumaker K.S., Wang X.;
RT "The Reference Genome of the Halophytic Plant Eutrema salsugineum.";
RL Front. Plant Sci. 4:46-46(2013).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 1 family.
CC {ECO:0000256|ARBA:ARBA00010838, ECO:0000256|RuleBase:RU003690}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI517609; ESQ37141.1; -; Genomic_DNA.
DR RefSeq; XP_006418705.1; XM_006418642.1.
DR AlphaFoldDB; V4MXF9; -.
DR EnsemblPlants; ESQ37141; ESQ37141; EUTSA_v10002470mg.
DR Gramene; ESQ37141; ESQ37141; EUTSA_v10002470mg.
DR KEGG; eus:EUTSA_v10002470mg; -.
DR Proteomes; UP000030689; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR001360; Glyco_hydro_1.
DR InterPro; IPR018120; Glyco_hydro_1_AS.
DR InterPro; IPR033132; Glyco_hydro_1_N_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR10353; GLYCOSYL HYDROLASE; 1.
DR PANTHER; PTHR10353:SF218; MYROSINASE 1-RELATED; 1.
DR Pfam; PF00232; Glyco_hydro_1; 1.
DR PRINTS; PR00131; GLHYDRLASE1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS00572; GLYCOSYL_HYDROL_F1_1; 1.
DR PROSITE; PS00653; GLYCOSYL_HYDROL_F1_2; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycosidase {ECO:0000256|RuleBase:RU004468};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU004468};
KW Reference proteome {ECO:0000313|Proteomes:UP000030689};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..540
FT /note="Thioglucosidase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004722696"
FT ACT_SITE 421
FT /note="Nucleophile"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10055"
SQ SEQUENCE 540 AA; 61474 MW; 42158E56DF5BE115 CRC64;
MKLLGLALVF LLAVATCKAN EEITCEENLP FTCSNTDRLN SKSFGKDFIF GVASSAYQVE
GGRGRGLNTW DAFTHRYPEK AGPDLGNGDT TCESYTRWQK DIDVMDELNA TGYRFSFAWS
RILPKGKVSR GVNPGGLKYY HDLIDGLLAK KITPFVTLYH WDLPQTLQDE YEGFLDRRII
DDFRDYADLC FKEFGGKVKN WITINQLYTV PTRGYAIGTD APGRCSPAVD ERCYGGNSST
EPYIVAHNQL LAHAAVVDLY RTKYKFQGGK IGTVMITRWF LPYDVNDKAS IEATERMKEF
FFGWFMEPLT KGRYPDIMRQ IVGSRLPNFT EAEARSVAGS YDFLGLNYYV TQYVQENKNT
GPPEKHTAMM DADTNASGHI IGPLFAEDKI GGNSYYYPKG IYYTMDHFKT RYGDPLIYIT
ENGISTPSEE TREQAVADSS RIDYLCSHLC FLRKVIKEKR VNVKGYFAWA LGDNYEFCKG
FTVRFGLSYV NWTDLDDRNL KDSGKWFQRF INVTTIKPPS AKQEFLRSSL SFQNKKLADA
//