ID A0A061EPB2_THECC Unreviewed; 797 AA.
AC A0A061EPB2;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Glycosyl hydrolase family protein {ECO:0000313|EMBL:EOY06890.1};
GN ORFNames=TCM_021476 {ECO:0000313|EMBL:EOY06890.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY06890.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY06890.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001882; EOY06890.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061EPB2; -.
DR STRING; 3641.A0A061EPB2; -.
DR EnsemblPlants; EOY06890; EOY06890; TCM_021476.
DR Gramene; EOY06890; EOY06890; TCM_021476.
DR eggNOG; ENOG502QR4D; Eukaryota.
DR HOGENOM; CLU_004542_5_3_1; -.
DR InParanoid; A0A061EPB2; -.
DR OMA; WTYLQPP; -.
DR Proteomes; UP000026915; Chromosome 4.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0046556; F:alpha-L-arabinofuranosidase activity; IBA:GO_Central.
DR GO; GO:0009044; F:xylan 1,4-beta-xylosidase activity; IBA:GO_Central.
DR GO; GO:0031222; P:arabinan catabolic process; IBA:GO_Central.
DR GO; GO:0045493; P:xylan catabolic process; IBA:GO_Central.
DR Gene3D; 3.40.50.1700; Glycoside hydrolase family 3 C-terminal domain; 1.
DR Gene3D; 3.20.20.300; Glycoside hydrolase, family 3, N-terminal domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR044993; BXL.
DR InterPro; IPR026891; Fn3-like.
DR InterPro; IPR002772; Glyco_hydro_3_C.
DR InterPro; IPR036881; Glyco_hydro_3_C_sf.
DR InterPro; IPR001764; Glyco_hydro_3_N.
DR InterPro; IPR036962; Glyco_hydro_3_N_sf.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR42721:SF11; FN3_LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42721; SUGAR HYDROLASE-RELATED; 1.
DR Pfam; PF14310; Fn3-like; 1.
DR Pfam; PF00933; Glyco_hydro_3; 1.
DR Pfam; PF01915; Glyco_hydro_3_C; 1.
DR PRINTS; PR00133; GLHYDRLASE3.
DR SMART; SM01217; Fn3_like; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF52279; Beta-D-glucan exohydrolase, C-terminal domain; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:EOY06890.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..797
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001601295"
FT DOMAIN 716..786
FT /note="Fibronectin type III-like"
FT /evidence="ECO:0000259|SMART:SM01217"
SQ SEQUENCE 797 AA; 88169 MW; 00354B609FA03FD1 CRC64;
MAKKMLFMVL LSHIILCFSV VSSLNLNASL VDDKGIPRFI YVCDPERFNI LGLDMAQFAY
CDKSLPYNVR AKDLVDRLTL VEKAQQMTDN SSVDIPRIGL PHYKWWSEAL HGVAETGDGT
HFDSLVPSAT VFPNVILTTA SFNKTLWKTI GQAVSTEARA MHNLGRAGLT FWSPTINVVR
DPRWGRTLET PGEDPYVVGV YAVNYVRGLQ DIEGQENTSD PNSRPLKVSA CCKHFAAYDL
EEFQGVRRLE FDAKVVTEQD MVETFNRPFE MCVKDGDVSS VMCSFNRVNG IPTCADAYLL
KKLVREDWNL HGYVVADCDS INEIVKNHKW LNDTVEEASA QVLKAGMDLD CGKSYLKLVD
AVKQGLVKEA DMDKSLNYLY VVLMRLGWFD GIPSLASLGK KDMCTEENVE LAAEAAREGI
VLLQNDNETL PLDPAKFKSF ALIGPHANAT DVMKGNYAGF PCKFITPVEG FSAFGQVTYE
LGCLDAKCPN DTTIQSAVDI AKNADATFLF VGLSTAIEAE WRDRKDLLLP ANQTLLVNKA
AEASKGPVIL VIMAATGIDI SFAKTNPKIK SILWVGYPGE QGGRAIAEVV FGMHNPGGRL
PITWYENNYV DKLPMTSMAL RPVGDYPGRT YKFFNGSTVY PFGYGLSYTS FKYEYNSADM
SLDIKLNRLQ HCQGLPYNDT NYKQNCSSVS IDDLTCNDEI TFEITVQNVG SRDGSDAVLV
YSVPPEGIVG TPFKQVVGFE RVYLQANESV NVKFVLNVCQ SLNIVDVSGY RLLPSGLHKI
VVGDNAISIP VKISYSR
//