ID A0A061DN17_THECC Unreviewed; 891 AA.
AC A0A061DN17;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Pentatricopeptide repeat (PPR) superfamily protein, putative {ECO:0000313|EMBL:EOX93817.1};
GN ORFNames=TCM_002757 {ECO:0000313|EMBL:EOX93817.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX93817.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX93817.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX93817.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061DN17; -.
DR STRING; 3641.A0A061DN17; -.
DR EnsemblPlants; EOX93817; EOX93817; TCM_002757.
DR Gramene; EOX93817; EOX93817; TCM_002757.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_49_2_1; -.
DR InParanoid; A0A061DN17; -.
DR OMA; YVECLME; -.
DR Proteomes; UP000026915; Chromosome 1.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 7.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 14.
DR PANTHER; PTHR45613:SF482; OS08G0300700 PROTEIN; 1.
DR PANTHER; PTHR45613; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF12854; PPR_1; 2.
DR Pfam; PF13041; PPR_2; 4.
DR Pfam; PF13812; PPR_3; 1.
DR PROSITE; PS51375; PPR; 11.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 258..292
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 293..327
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 328..362
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 363..397
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 398..432
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 433..467
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 468..502
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 503..537
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 573..607
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 608..642
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 643..677
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 891 AA; 99034 MW; 6EAACE16740E8B79 CRC64;
MFLTKRLVVK LPRGFPALIS ASILYSFSAV APKDAAHHAS SLINHPNWKT NQTLKSLVSH
MNPRVAAQVI LLQNDNASLA LQFFRWVCQH STYCYPITGR IHLLNLLIFS HSFQIAHKAI
IDLIKNCSTC ENDLLKLMEA LDEMRKTGFR LNYPCYSILL VSLAKLNMGV LASSVYKRMV
AEGFVLSAID YRTIINALSK IGFVCQAEMF ISKALKLGFG LGTHISTSLV LGYCRQNDLR
EAFRVLDVMS KRDGCGANSV TYSILIHGLC EVGRVEEAFS LKEGMKEKGC QPSTRTYTVL
VKALCDNGLI GKAFDLVGEM SGKGCKPNVY TYTVLIDALC REGKIEDANG MFRQMLKEDV
YPGIVTYNAL INGYCKEGKI ISAFELLSLM EKRNCKPNIR TYNELIEGLC KINRPYKAML
LLGKIVDNGL LPNSITYNIL IDGFCKEGHF YMASKIFELM NSLGVNPDGH SYTAIIDGLC
KQGSLKLANG LWGKMIKKGI NPDEVTFTAL MDGFCKIGNT GDASKLFKMM IVNGCLKTCH
AFNVFLHILS KECKLTEEYA FFGKILKHGL VPSVVTYTIL VGALFQAGKV EQSLSMLKLM
KQVGCPPNVY TYTVVVNGLC QIGRVDDAER ILHLMFDLGV PPNHVTYTIL VKAHVNAGRL
NRALDITSFM VKNGYEPNCH IYSALLAGFV SSNKVTKAGS SSFISPLDFG SPPTAENYDE
CVSRNVLKEM DLDHALKLRA EIEKFGGSVL DFYNFLIVGL CKGGRIVVAE HLTKDILKDG
LYPDKACFSI IDWHSKNSNC NECLEVLDLI LSHGFLPSFA SYCSVIHCMR NKGKIKEAQR
LFSDLMKDNS IGEAKAVLPH IEFLVNCDEP EKCIEHLKLI EQMANRERPV I
//