ID A0A061GM92_THECC Unreviewed; 592 AA.
AC A0A061GM92;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Pentatricopeptide repeat superfamily protein, putative {ECO:0000313|EMBL:EOY30252.1};
GN ORFNames=TCM_037525 {ECO:0000313|EMBL:EOY30252.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY30252.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY30252.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY30252.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GM92; -.
DR EnsemblPlants; EOY30252; EOY30252; TCM_037525.
DR Gramene; EOY30252; EOY30252; TCM_037525.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_49_0_1; -.
DR InParanoid; A0A061GM92; -.
DR OMA; SFCHMGL; -.
DR Proteomes; UP000026915; Chromosome 9.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 6.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 9.
DR PANTHER; PTHR47932; ATPASE EXPRESSION PROTEIN 3; 1.
DR PANTHER; PTHR47932:SF92; OS01G0908800 PROTEIN; 1.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF13041; PPR_2; 4.
DR SUPFAM; SSF81901; HCP-like; 1.
DR PROSITE; PS51375; PPR; 10.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 152..186
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 222..256
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 257..291
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 292..327
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 328..362
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 363..397
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 398..432
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 433..467
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 468..502
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 503..537
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 592 AA; 66317 MW; D3B3EEA0E6D5A380 CRC64;
MKSREFNQKL NFESGVPNKK FGHFQVEIFG YLVKPRNGLG LQMTLFSFTT RASRVRAASK
VFIPHFHIQF HGGPHPQGNK EVKAIQKHEA WFVKVVCTLF VYSQPLDDSC LSYLSKNLTP
LIEFEVVKWL NNPALGLKFL EFSRVNFNIA HSFWTYNLLM RSFCHMGLHD SAKLVFDYMR
IDGHLPDTTI LGFMISSFGR AGEFGMAKKL LADVQSDEVV ISIFALNNLL NMMVKQNKLE
EAVSLYKENL GSNFYPDAWT FNILIRGLCR VGKVDQAFEL FNDMGSFGCF PDIVTYNTII
NGLCKVNEVD RGHKLLNQVQ SRDDCSPDVV TYTSVISGYC KLGKMDEASA LFHEMISSGT
VPTVVTFNVL IDGFGKVGDM VSAKSMYEQM ASFGCIADVV TFTSLIDGYC RIGDVNQSLQ
LWNTMKGRDL SPNVYTFAIT INALCKENRL HEARGFLREL QCRNIVPKPF IFNPVIDGFC
KAGNLDEANL IVAEMEEKQC HPDKVTFTIL IIGHCMKGRM FEAISIFNKM LSVGCTPDDV
TVNSLISCLL KAGMPSEASR ITKMASEDMK LGSSLLENNS PLRINRGVPV AA
//