ID A0A061EXA8_THECC Unreviewed; 626 AA.
AC A0A061EXA8;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Pentatricopeptide repeat (PPR) superfamily protein {ECO:0000313|EMBL:EOY09680.1};
GN ORFNames=TCM_025073 {ECO:0000313|EMBL:EOY09680.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY09680.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY09680.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily.
CC {ECO:0000256|ARBA:ARBA00006643}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY09680.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061EXA8; -.
DR EnsemblPlants; EOY09680; EOY09680; TCM_025073.
DR Gramene; EOY09680; EOY09680; TCM_025073.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_8_1; -.
DR InParanoid; A0A061EXA8; -.
DR OMA; YARANKW; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IBA:GO_Central.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR046848; E_motif.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR045215; PPR_prot_At1g15510.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 4.
DR PANTHER; PTHR24015:SF96; OS01G0848300 PROTEIN; 1.
DR PANTHER; PTHR24015; OS07G0578800 PROTEIN-RELATED; 1.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF20431; E_motif; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF13041; PPR_2; 1.
DR PROSITE; PS51375; PPR; 4.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 86..120
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 187..217
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 218..252
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 319..353
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 534..626
FT /note="DYW"
FT /evidence="ECO:0000259|Pfam:PF14432"
SQ SEQUENCE 626 AA; 70468 MW; AA1884BE82F6D534 CRC64;
MNISSNSSSS SKVLNTLRLK NPKLLLLESC KNLSQLKIIH GHMIRTHIIF DIFAASRLIS
LCTDPSFGTA LLDYAFKIFS QIETPNLFIF NALIKGFSAC QNPHQSFHFY TQLLRANILP
DNLSFPFLVR ACAQLESLDM GIQAHGQIIK HGFESNVYVQ NSLVHMYSTC GDIKAANAIF
QRMTFLNVVS WTSMIAGLNK VGDVEMARKL FDTMPEKNLV TWSIMISGYA KNSYFEKAVE
LFQVLQEEGV QANETVMVSV ISSCAHLGAI ELGEKAHEYI FRNNLSLNVI LGTALVDMYA
RCGSIEKAIG VFEELPERDV LSWTALIAGL AMHGYAERAL WFFSEMVKSG LKPRDISFTA
VLSACSHGGL VGKGLELFGS MKRDFGIEPR LEHYGCVVDL LGRAGKLAEA EKFVLEMPVK
PNAPIWGALL GACRIHRNAE IAERVGKILI PLLPEHSGYY VLLSNIYART NRWENVESMR
QMMKEKGVKK PPGYSLIEVD GKVHNFTMGD KSHPEIDMIE RTWEAILKKI RLAGYSGNTS
DALFDIDEEE KESALYRHSE KLAIAFGIMR TKASMPIRIV KNLRVCEDCH TATKLISKVF
ERELIVRDRN RFHHFRHGTC SCMDYW
//