ID A0A061DJ96_THECC Unreviewed; 792 AA.
AC A0A061DJ96;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Pentatricopeptide repeat-containing protein, putative isoform 1 {ECO:0000313|EMBL:EOX92407.1};
GN ORFNames=TCM_001361 {ECO:0000313|EMBL:EOX92407.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX92407.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX92407.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX92407.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061DJ96; -.
DR EnsemblPlants; EOX92407; EOX92407; TCM_001361.
DR Gramene; EOX92407; EOX92407; TCM_001361.
DR Proteomes; UP000026915; Chromosome 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 7.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 9.
DR PANTHER; PTHR45613:SF476; MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1; 1.
DR PANTHER; PTHR45613; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF12854; PPR_1; 2.
DR Pfam; PF13041; PPR_2; 5.
DR PROSITE; PS51375; PPR; 11.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 160..194
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 195..229
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 265..299
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 300..334
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 370..404
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 405..439
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 440..474
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 576..610
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 611..645
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 696..730
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 731..765
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 792 AA; 89229 MW; 7D2724B30D9D962E CRC64;
MHYALAQLAS SSSDEQETVS IPNMNHNSDH FFELDRVEVV QTLNNLIKQP NKALSFFNQL
NEDGFFHDLC TYTAIVRILC YWGWDRKLDS VLLEIIRKEK RLGFEIMDLC EALEEGLEGE
DSYLLVRLSN ALVKAYVSVE MFDEVINILF QTRRCGFVPH IFSCNFLMNR LIHCGKIDMA
VATYQQLKRI GLKPNDYTYS ILIKALCKKG SLEEAFNVFR EMEEAEVRPN AFAYTTYIEG
LCMHGRTELG YEVLKVCRKA KVPLDPFAYS VVIRGFSKEM KLKVAEDVLF DAENNGVVPD
VTSYGALIRG YCKCGNILKA LDIHHEMVSK GIKTNCVILT SILQSLCQMG LDFKAVNQFK
EFRDIGIFLD EVCHNVIADA LCKGGQVEEA KKLLDEMKGK QISPDVINYT TLINGYCRQG
KVEDAWNLFK EMKNNGHKPD IVFYSVLAGG LARNGHAQKA VDLLNSMEAQ GLKCDTVIHN
MIIKGLCMGD KVKEAENFLD SLPGKCLENY AALVDGYREA CLTKEAFKLF VKLSEQGFLV
TKASCSKLLS SLCMKGDNDK ALMLLKIMFS LNAEPTKLMY CKLIGAFCQA GNLSIAQLLF
NIMIKKGLTP DLVTYTIMIN GYCKVKLLQK ALDLFNNMKE RGIKPDVITY TVLLNSHMKM
NLRSLSNPDV TQKNGKTIMV ASPFWSEMKH MGVEPDVVCY TVLIDQFCKT NNLQDASRIF
DEMIDRGLEP DTVTYTALIS GYFKGGYIDK AVTLVNELLS KGIQPDTHTM LHHCILIAKR
VVRSKHLCDS SG
//