ID A0A061F2V6_THECC Unreviewed; 1072 AA.
AC A0A061F2V6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE SubName: Full=Pentatricopeptide repeat (PPR) superfamily protein isoform 2 {ECO:0000313|EMBL:EOY11208.1};
GN ORFNames=TCM_026455 {ECO:0000313|EMBL:EOY11208.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY11208.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY11208.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily.
CC {ECO:0000256|ARBA:ARBA00006643}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY11208.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F2V6; -.
DR EnsemblPlants; EOY11208; EOY11208; TCM_026455.
DR Gramene; EOY11208; EOY11208; TCM_026455.
DR HOGENOM; CLU_002706_15_0_1; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IEA:InterPro.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 6.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR046848; E_motif.
DR InterPro; IPR046849; Eplus_motif.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR046960; PPR_At4g14850-like_plant.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 9.
DR PANTHER; PTHR47926; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47926:SF339; UMP-CMP KINASE; 1.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF20431; E_motif; 1.
DR Pfam; PF20430; Eplus_motif; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF13041; PPR_2; 5.
DR Pfam; PF13812; PPR_3; 1.
DR PROSITE; PS51375; PPR; 8.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 158..192
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 260..294
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 361..395
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 462..496
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 532..562
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 563..597
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 664..698
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 765..799
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 980..1072
FT /note="DYW"
FT /evidence="ECO:0000259|Pfam:PF14432"
SQ SEQUENCE 1072 AA; 120540 MW; DE99126E77B59E4E CRC64;
MFLNQLSLHG TSTYSFFYHF PSLKLKPHQI SFNHSQNFQK YFNRKWSRLR LACFNTNAIS
NSFDELSIEE NEGNSKEVSF LYWMENRGVK ANQQTFLWLL EGCLNSGSIE QGKKLHGKIL
KMGFSKEHVL SEKLMDLHIA SGDLDAAINV FDDMPKRNVF SWNKMISGFI SKKLTNKVLR
FYSRMVVENV NPNERTFAGI LKACSGSNVW FEYVEQIHAR IIRHGFGFSS FVCNPLIDLY
TKNGFIDSAI KVFDKLYVKD SVSWVAMISG LSQNGYEEQA ILLFSEMHIS GICPTPYVFS
SVLSACTKIE FFKLGEQLHS LVFKQGFSSE TYVCNALVTL YSRSGSLVSA EQIFSNMQLR
DGVTYNSLIS GLAQCGYSDR ALELFEKMHH DCLKPDCVTV ASLLGACASL GALYTGKQLH
SYAIKAGFSM DIIVEGSLLD LYLKCSDIET AYEFFSTTET ENVVLWNVML VAYGQLDNLS
ESFHIFRQMQ IEGLVPNQFT YPSILRTCTS LGALDLGEQI HSQVIKTGFQ YNVYVCSVLI
DMYAKLGKLE TALEILRKLP EEDVVSWTAM IAGYTQHDMF YEALELFGEM LNRGIQSDNI
GLSSAISACA GIQALSQGQQ IHAQSFLSGF SDDLSIGNAL VSLYARCSQR QDAYKAFKKI
DNKDNISWNA LISGFTQSGF CEEALQVFSQ MNKAGLEATL YTCISSVSAA ANTANIKQGK
QIHAMIIKKG YDLEIEASNV LITLYAKCGS IDDAKKEFLE IPEKNEVSWN AMITGYSQHG
YGIEAIDLFE KMKQVGVTPN PVTLVGVLSA CSHVGLVDEG LDYFDSMSKE HGLVPKPEHY
ACVVDLLGRA GLLCRARKFV EDMPIEPDAI IWRTLLSACA VHKNVDIGEF AAHHLLKLEP
QDSASYVLLS NLYAVSKKWD SRDQTRQMMK ERGVKKEPAQ SWIEVKNSIH AFFVGDRLHP
LAEKIYEHLE DLNKRAAEIG YVQDRYSRFS DVEQGQKDPT VHIHSEKLAI AFGLLSLPSA
IPVRVIKNLR VCNDCHNWIK FVSKISNQLI IVRDAYRFHH FEGGSCSCRD YW
//