ID A0A061GQC2_THECC Unreviewed; 1433 AA.
AC A0A061GQC2;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Calpain-type cysteine protease family isoform 5 {ECO:0000313|EMBL:EOY31681.1};
GN ORFNames=TCM_038725 {ECO:0000313|EMBL:EOY31681.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY31681.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY31681.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the peptidase C2 family.
CC {ECO:0000256|ARBA:ARBA00007623}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY31680.1; -; Genomic_DNA.
DR EMBL; CM001887; EOY31681.1; -; Genomic_DNA.
DR EnsemblPlants; EOY31680; EOY31680; TCM_038725.
DR EnsemblPlants; EOY31681; EOY31681; TCM_038725.
DR Gramene; EOY31680; EOY31680; TCM_038725.
DR Gramene; EOY31681; EOY31681; TCM_038725.
DR HOGENOM; CLU_001987_0_0_1; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0004198; F:calcium-dependent cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00214; Calpain_III; 1.
DR CDD; cd00044; CysPc; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.120.380; -; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR033883; C2_III.
DR InterPro; IPR022684; Calpain_cysteine_protease.
DR InterPro; IPR022682; Calpain_domain_III.
DR InterPro; IPR022683; Calpain_III.
DR InterPro; IPR036213; Calpain_III_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR001300; Peptidase_C2_calpain_cat.
DR PANTHER; PTHR10183; CALPAIN; 1.
DR PANTHER; PTHR10183:SF379; CALPAIN-A-RELATED; 1.
DR Pfam; PF01067; Calpain_III; 1.
DR Pfam; PF00648; Peptidase_C2; 1.
DR PRINTS; PR00704; CALPAIN.
DR SMART; SM00720; calpain_III; 1.
DR SMART; SM00230; CysPc; 1.
DR SUPFAM; SSF49758; Calpain large subunit, middle domain (domain III); 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS50203; CALPAIN_CAT; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|PROSITE-ProRule:PRU00239};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Protease {ECO:0000256|PROSITE-ProRule:PRU00239,
KW ECO:0000313|EMBL:EOY31681.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Thiol protease {ECO:0000256|PROSITE-ProRule:PRU00239};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 41..62
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 91..114
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 121..143
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 159..180
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 192..215
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 221..241
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 253..273
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 285..307
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 328..347
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 353..375
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 977..1279
FT /note="Calpain catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50203"
FT ACT_SITE 1043
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 1201
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 1221
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
SQ SEQUENCE 1433 AA; 159334 MW; 6EA4F011CD290F44 CRC64;
MVACLSVAIP KWIHNGYQFW VPQVQCVGHA GNHRPPGTKE VVVLTLCITV FAGSVLALGA
IVSAKPLEDL RYKGWTGEQN NFSSPYASSA YLGWAMASAV ALAVTGVLPI ISWFATYRFS
ASSAVCVGIF SVVLVAFCGA SYLKIVKSRD DQVPTTGDFL AALLPLVCIP ALLALCSGLL
KWKDDDWKLS RGVYVFVTIG LLLLLGAISA VIVVIKPWTI GAAFLLVLLL IVLAIGVIHH
WASNNFYLTR TQMFLVCFLA FLLGLAAFFV GWFQDKPFVG ASVGYFSFLF LLAGRALTVL
LSPPIVVYSP RVLPVYVYDA HADCGKNVSA AFLVLYGIAL ATEGWGVVAS LKIYPPFAGA
AVSAVTLVVA FGFAVSRPCL TLKMMEDAVH FLSKDTVVQA IARSATKTRN ALSGTYSAPQ
RSASSAALLV GDPAATLDKG GNFVLPRDDV MKLRDRLRNE ELVAGSFFHR MRYRRRFHHE
PTSDVDYRRE MCAHARILAL EEAIDTEWVY MWDKFGGYLL LLLGLTAKAE RVQDEVRLNL
FLDSIGFSDL SAKKIKKWMP EDRRQFEIIQ ESYIREKEME EEILMQRREE EGRGKERRKA
LLEKEERKWK EIEASLISSI PNAGGREAAA MAAAVRAVGG DSVLEDSFAR ERVSSIARRI
RTAQLARRAL QTGITGAVCI LDDEPTTSGR HCGQIDPSMC QSQKVSFSIA VMIQPESGPV
CLLGTEFQKK VCWEILVAGS EQGIEAGQVG LRLITKGDRQ TTVAKEWSIS ATSIADGRWH
IVTMTIDADI GEATCYLDGG FDGYQTGLPL CVGSSIWEQE TEVWVGVRPP IDMDAFGRSD
SEGAESKMHV MDVFLWGRCL NEDEIASLHA AISLTEFNLI DFPEDNWHWA DSPPRVDEWD
SDPADVDLYD RDDVDWDGQY SSGRKRRSER EGFVVHVDSF ARRYRKPRIE TQEEINQRML
SVELAVKEAL SARGEMHFTD NEFPPNDQSL FIDPGNPPSK LQVVSEWMRP AEIVKEGRLD
SRPCLFSGTA NPSDVCQGRL GDCWFLSAVA VLTEVSRISE VIITPEYNEE GIYTVRFCIQ
GEWVPVVVDD WIPCESPGKP SFATSRKGNE LWVSILEKAY AKLHGSYEAL EGGLVQDALV
DLTGGAGEEI DMRSPQAQID LASGRLWSQM LRFKQEGFLL GAGSPSGSDV HVSSSGIVQG
HAYSLLQVRE VDGHKLVQIR NPWANEVEWN GPWSDTSSEW TDRMRHKLKH VPQSKDGIFW
MSWQDFQIHF RSIYVCRVYP PEMRYSVHGQ WRGYSAGGCQ DYNSWHQNPQ FRLRASGPDA
SYPIHVFITL TQGVSFSRTA AGFRNYQSSH DSLMFYIGMR ILKTRGRRAA YNIYLHESVG
GTDYVNSREI SCEMVLEPDP KGYTIVPTTI HPGEEAPFVL SVFTKASIIL EPL
//