GenomeNet

Database: UniProt
Entry: A0A061EZ29_THECC
LinkDB: A0A061EZ29_THECC
Original site: A0A061EZ29_THECC 
ID   A0A061EZ29_THECC        Unreviewed;       435 AA.
AC   A0A061EZ29;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   24-JAN-2024, entry version 49.
DE   SubName: Full=Eukaryotic aspartyl protease family protein, putative {ECO:0000313|EMBL:EOY10345.1};
GN   ORFNames=TCM_025719 {ECO:0000313|EMBL:EOY10345.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY10345.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY10345.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   -!- SIMILARITY: Belongs to the peptidase A1 family.
CC       {ECO:0000256|ARBA:ARBA00007447}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001883; EOY10345.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061EZ29; -.
DR   EnsemblPlants; EOY10345; EOY10345; TCM_025719.
DR   Gramene; EOY10345; EOY10345; TCM_025719.
DR   eggNOG; KOG1339; Eukaryota.
DR   HOGENOM; CLU_005738_1_3_1; -.
DR   InParanoid; A0A061EZ29; -.
DR   OMA; CYKSSES; -.
DR   Proteomes; UP000026915; Chromosome 5.
DR   GO; GO:0005576; C:extracellular region; IBA:GO_Central.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd05476; pepsin_A_like_plant; 1.
DR   Gene3D; 2.40.70.10; Acid Proteases; 2.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR034161; Pepsin-like_plant.
DR   InterPro; IPR033121; PEPTIDASE_A1.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR032799; TAXi_C.
DR   InterPro; IPR032861; TAXi_N.
DR   PANTHER; PTHR47967:SF66; ASPARTIC PROTEINASE CDR1-RELATED; 1.
DR   PANTHER; PTHR47967; OS07G0603500 PROTEIN-RELATED; 1.
DR   Pfam; PF14541; TAXi_C; 1.
DR   Pfam; PF14543; TAXi_N; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS51767; PEPTIDASE_A1; 1.
PE   3: Inferred from homology;
KW   Hydrolase {ECO:0000313|EMBL:EOY10345.1};
KW   Protease {ECO:0000313|EMBL:EOY10345.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..435
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001601513"
FT   DOMAIN          93..428
FT                   /note="Peptidase A1"
FT                   /evidence="ECO:0000259|PROSITE:PS51767"
SQ   SEQUENCE   435 AA;  46351 MW;  8643586652EECEF5 CRC64;
     MAATANTTSM FFIGFAILVL SCFCLIEAQK GGFSVELIHR DSPKSPLYNP LETASNRVAN
     ALRRSFNRAQ RFKPSSISTK AVDADLIADS GEYLMNVSIG TPAFDIVAIA DTGSDLIWTQ
     CKPCSQCFRQ DAPLFDPSKS STFRTFSCSA SQCENLEGSS CSSNNTCRYS VTYGDNSFSN
     GDVAADTLTL PSTTGRPVAF RNTIIGCGHN NDGTFDENTS GIIGLGGGDV SLISQLGTSI
     AGKFSYCLLP LSDAGESNKM NFGTDAIVSG AGVVSTPLTK KFPSTFYFLT LEAVSVGSKR
     IKFTGSSLGT DDGNIIIDSG TTLTLLPEDF YSELESAVAS QIKARRVDGP QGLSLCYDAT
     TDFAVPNITI HFTNADVKLA PLNTFVLVSD TVSCFTFSSL QGFAIYGNLA QMNFLVGYDT
     EKQTVSFKPT DCSKN
//
DBGET integrated database retrieval system