ID A0A061EUV0_THECC Unreviewed; 809 AA.
AC A0A061EUV0;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Glycosyl hydrolases family 31 protein isoform 2 {ECO:0000313|EMBL:EOY08860.1};
GN ORFNames=TCM_024111 {ECO:0000313|EMBL:EOY08860.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY08860.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY08860.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 31 family.
CC {ECO:0000256|ARBA:ARBA00007806, ECO:0000256|RuleBase:RU361185}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY08860.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061EUV0; -.
DR EnsemblPlants; EOY08860; EOY08860; TCM_024111.
DR Gramene; EOY08860; EOY08860; TCM_024111.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06594; GH31_glucosidase_YihQ; 1.
DR CDD; cd14752; GH31_N; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.1760; glycosyl hydrolase (family 31); 1.
DR InterPro; IPR011013; Gal_mutarotase_sf_dom.
DR InterPro; IPR000322; Glyco_hydro_31_TIM.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR044112; YihQ_TIM-like.
DR PANTHER; PTHR46959; SULFOQUINOVOSIDASE; 1.
DR PANTHER; PTHR46959:SF2; SULFOQUINOVOSIDASE; 1.
DR Pfam; PF01055; Glyco_hydro_31_2nd; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF74650; Galactose mutarotase-like; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|RuleBase:RU361185};
KW Hydrolase {ECO:0000256|RuleBase:RU361185, ECO:0000313|EMBL:EOY08860.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 440..751
FT /note="Glycoside hydrolase family 31 TIM barrel"
FT /evidence="ECO:0000259|Pfam:PF01055"
SQ SEQUENCE 809 AA; 93001 MW; 75D62FB469D52F54 CRC64;
MHNQNSPTLI MSTLKITKKH HKHLNNPFPS TPRYLPSIQG NLFINSQTLP PHQIFPVGKD
FQLLWSTRNG GSISISHQSQ PSKSLWSTIP GQAFMSAALA ETEVEESRGS FVVKDRDVHL
VCQHQTLDDI ILINPFDDKD NDFLPDHLEL DRLKIDSKIA DPPVLVITGH IFSKRKKKRL
QSSGIYKDIK FEKREPAASA RYWVLFDQKN CNQIGFQVKI GQPNFQLLHQ KASPLTASGW
YRRLRRKLGR YRKRKLGWSW VFTRTKGLVT VSSSEEELGE LNVAEPSAEF NRVCFTYASE
GNERFFGFGE QFSRMDFKGK RVPIFVQEQG IGRGDQPITF AANLVSYRAG GDWSTTYAPS
PFYMTSKMRS LYLEGYNYSI FDLTQHDRVQ VQIHGNAIQG RILHGNSPLE IIEHFTEAIG
RPPKLPEWMI SGAVVGMQGG TETVRCVWDK LTTYKVPISV FWLQDWVGQR ETLIGSQLWW
NWEVDTTRYP GWQQLVKDLS THSIKVMTYC NPCLALMDEK PNKRRNLFEE AKELDILVRD
QHGEPYMVPN TAFDVGMLDL THPLTANWFK QILLEMVNDG VRGWMADFGE GLPVDAVLYS
GEDPISAHNR YPELWAQINR EFVEEWKSNH VGNEREDPEE GLVFFMRAGF RNSPRWGMLF
WEGDQMVSWQ ANDGIKSSVV GLLSSGLSGY AFNHSDIGGY CAINLPIIKY HRSEELLLRW
MELNAFTIVF RTHEGNKPSC NSQFYSNDQT LSHFARFAKV YKAWKFYRVQ LVKLLKRAGL
SAVTYFFTTQ MMSRFRGSVT SSSWWAVRS
//