ID A0A061F416_THECC Unreviewed; 500 AA.
AC A0A061F416;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE SubName: Full=Aspartic proteinase {ECO:0000313|EMBL:EOY11771.1};
GN ORFNames=TCM_026840 {ECO:0000313|EMBL:EOY11771.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY11771.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY11771.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the peptidase A1 family.
CC {ECO:0000256|ARBA:ARBA00007447, ECO:0000256|RuleBase:RU000454}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY11771.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F416; -.
DR STRING; 3641.A0A061F416; -.
DR EnsemblPlants; EOY11771; EOY11771; TCM_026840.
DR Gramene; EOY11771; EOY11771; TCM_026840.
DR eggNOG; KOG1339; Eukaryota.
DR HOGENOM; CLU_013253_3_1_1; -.
DR InParanoid; A0A061F416; -.
DR OMA; KYDHDAS; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0006629; P:lipid metabolic process; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR CDD; cd06098; phytepsin; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 2.
DR Gene3D; 1.10.225.10; Saposin-like; 1.
DR InterPro; IPR001461; Aspartic_peptidase_A1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR033121; PEPTIDASE_A1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR033869; Phytepsin.
DR InterPro; IPR007856; SapB_1.
DR InterPro; IPR008138; SapB_2.
DR InterPro; IPR011001; Saposin-like.
DR InterPro; IPR008139; SaposinB_dom.
DR PANTHER; PTHR47966; BETA-SITE APP-CLEAVING ENZYME, ISOFORM A-RELATED; 1.
DR PANTHER; PTHR47966:SF28; PLASMEPSIN X; 1.
DR Pfam; PF00026; Asp; 1.
DR Pfam; PF05184; SapB_1; 1.
DR Pfam; PF03489; SapB_2; 1.
DR PRINTS; PR00792; PEPSIN.
DR SMART; SM00741; SapB; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF47862; Saposin; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 2.
DR PROSITE; PS51767; PEPTIDASE_A1; 1.
DR PROSITE; PS50015; SAP_B; 2.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750,
KW ECO:0000256|RuleBase:RU000454};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR601461-2};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU000454};
KW Protease {ECO:0000256|RuleBase:RU000454};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..500
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001597594"
FT DOMAIN 76..497
FT /note="Peptidase A1"
FT /evidence="ECO:0000259|PROSITE:PS51767"
FT DOMAIN 306..346
FT /note="Saposin B-type"
FT /evidence="ECO:0000259|PROSITE:PS50015"
FT DOMAIN 370..411
FT /note="Saposin B-type"
FT /evidence="ECO:0000259|PROSITE:PS50015"
FT ACT_SITE 94
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT ACT_SITE 281
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT DISULFID 107..113
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-2"
SQ SEQUENCE 500 AA; 54464 MW; D07B5C9306ABDB5A CRC64;
MGHKLLQMTF CLWAITCLLL PSPSVGLSRI TLKKQRLDLQ GIKAARIAMH GEDMLHNFGS
SDGEVMPLKN YLDAQYYGVI GIGSPPQNFT VIFDTGSSNL WVPSSKCYFS IACYFHSKYK
SSRSSTYTKI GKSCEINYGS GSISGFLSQD NVKVGGLVVK DQVFIEATRE GSLTFALAKF
DGILGLGFQE ISVGNATPVW YNMLNQDLVR EDVFSFWLNR DPLAQVGGEI VFGGVDPKHY
KGKHTYVPVS RKGYWQFDMG DFLIGNHSTG VCETGCAAIV DSGTSLLAGP TTVVAEINQA
IGARGVVSAE CKEVVSQYGD LIWQLLVSGV LPDKVCTQIG LCPLKGVQSM STGIETVVDK
KNMEGLSAGD KVLCTACEMT VIWIQSQLRQ KETKDRVLNY VNELCESLPS PMGESAIDCA
KISEMPHITF TIGDKPFKLT PEQYVLKTGE DITTVCLSGF TALDVPPPRG PLWILGDVFM
GVYHTVFDYG NLEIGFAEAA
//