ID A0A061G9Y9_THECC Unreviewed; 833 AA.
AC A0A061G9Y9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Pentatricopeptide repeat superfamily protein isoform 1 {ECO:0000313|EMBL:EOY26401.1};
GN ORFNames=TCM_027991 {ECO:0000313|EMBL:EOY26401.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY26401.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY26401.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily.
CC {ECO:0000256|ARBA:ARBA00006643}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001884; EOY26401.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061G9Y9; -.
DR EnsemblPlants; EOY26401; EOY26401; TCM_027991.
DR Gramene; EOY26401; EOY26401; TCM_027991.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_1_1; -.
DR InParanoid; A0A061G9Y9; -.
DR OMA; TGWSWVE; -.
DR Proteomes; UP000026915; Chromosome 6.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IBA:GO_Central.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR046848; E_motif.
DR InterPro; IPR046849; Eplus_motif.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 7.
DR PANTHER; PTHR47928:SF113; DYW DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47928; REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF20431; E_motif; 1.
DR Pfam; PF20430; Eplus_motif; 1.
DR Pfam; PF01535; PPR; 5.
DR Pfam; PF13041; PPR_2; 3.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51375; PPR; 5.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 143..173
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 238..272
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 342..376
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 444..474
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 475..509
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 685..761
FT /note="DYW"
FT /evidence="ECO:0000259|Pfam:PF14432"
SQ SEQUENCE 833 AA; 94451 MW; 7C657EBAD8D10CC3 CRC64;
MAKGDWNDLT CHVWEHIASK ISNQTQVKQL HAQLIQNSLH HHYSWVALLI NACMRLRAPL
SYTRTILHYS TATPSPDIYA FISALEYYYT LPVCKEQEVA SLCHQLLAST NKPAALYPIL
IKSSVKAGIL FHSHLVKLGH HHDPHTRNAL MDSYAKFGPI EAARKLFDEM PGRMAEDWNS
MISGYWKWGK EAEACCLFNL MPENKRNVVT WTAMVTGSAN MKDLITARRY FDRMPRRNVV
SWNAMLSGYA KNGFAKEALH LFLHMIKAGD GIEPNQITWV AVISSCSSLA DPCLADSVVK
FLDKKKIQLN SYLKTALLDM HAKCGNLETA QKIFDEFGEH RSCTTWNAMI SAYMRFGNLA
LARELFDKMP VRNVVSWNSM IAGFAQNGQP AMAIQLFKEM IATTNLKPDE VTMVSVISVC
GQLGALEMGN WVVNFIVENQ IKLSISGYNT LIFMYSKCGS MKDAERIFQE MKRRDTISYN
ALVSGFGAHG RGIEAVELMS RMRKEGIEPD HITYIGVLTA CSHARLLKEG RRVFESIKFP
AVDHYACMVD LLGRVGELDE AKRLIDHMPM EPHAGIYGSL LNASTIHKRV ELGEFAANKL
FELEPSNSGN YVLLSNIYAS AARWGDVDWV REAMRKLGVK KTTGWSWVEH DGKVHKFIVG
DRSHERSDDI YRLLEELCRK MGRLGYIANK SCVLRDVEDE EKEEMVGTHS EKLAVCFALL
VSEVGAVVRV VKNLRVCQDC HTAMKMISML EGREIIMRDN NSKFYVIVLI DIKHRRNDFD
MLLQPCLYLG EYMVKSGYSC VMDSLIIQGK GEASFGERIQ FRRVYEEQVF PAE
//