ID A0A061GWC7_THECC Unreviewed; 1159 AA.
AC A0A061GWC7;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=Pentatricopeptide repeat superfamily protein, putative {ECO:0000313|EMBL:EOY31434.1};
GN ORFNames=TCM_038372 {ECO:0000313|EMBL:EOY31434.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY31434.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY31434.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY31434.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GWC7; -.
DR SMR; A0A061GWC7; -.
DR EnsemblPlants; EOY31434; EOY31434; TCM_038372.
DR Gramene; EOY31434; EOY31434; TCM_038372.
DR eggNOG; KOG0919; Eukaryota.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_49_12_1; -.
DR InParanoid; A0A061GWC7; -.
DR OMA; ATTLMHG; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 3.90.120.10; DNA Methylase, subunit A, domain 2; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 7.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1.
DR InterPro; IPR001525; C5_MeTfrase.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 14.
DR PANTHER; PTHR47934; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN PET309, MITOCHONDRIAL; 1.
DR PANTHER; PTHR47934:SF6; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN PET309, MITOCHONDRIAL-RELATED; 1.
DR Pfam; PF00145; DNA_methylase; 2.
DR Pfam; PF01535; PPR; 4.
DR Pfam; PF12854; PPR_1; 2.
DR Pfam; PF13041; PPR_2; 6.
DR SUPFAM; SSF81901; HCP-like; 3.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1.
DR PROSITE; PS51375; PPR; 16.
PE 3: Inferred from homology;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT REPEAT 541..575
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 576..610
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 611..645
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 646..680
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 681..715
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 750..784
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 785..819
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 820..854
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 855..889
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 890..924
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 925..959
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 960..994
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 995..1029
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 1030..1064
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 1065..1099
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 1100..1134
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 321..378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 321..360
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1159 AA; 130323 MW; 9C9882C8B211657F CRC64;
MAESICRGAE EPWRVLEFYS GIGGMRYSLM KAGVNAQVVE AFDINDTAND VYQHNFGHRP
YQGNIQSLTD ADLDSYKAHV WLLSPPCQPY TRQDTRAKMV EILAKSDFVT QELILSPLQF
GVPYSRPRYF CLAKRKPLSF QCQLFNNQLL WSPSPLFGND ENMVIGEYDQ SQENWDKLIE
SCQPIEKFLE FTSSSDQVDV ETSSFGTTDV SANGLETSEE FVGGDAFDFS SIDQFVVPLN
IVYPDSKRCC CFTKSYYRYV KGTGSLLATV QPKRKGKATS LKEQCLRYFT PREVANLHSF
PEDFQFPKHI SLRQRPKYLC SNPQTPLPGN PNPNTNFLEK VTSDSHCPSK SISDSDSPGS
LIPPTPKDPR LTPSLTQDTS LTRTHVINTL LIHRNNPESA LKYFRFVENK RGFVRSIDVF
CVLLHILVGS QQTNKQVKYL LNRFVAGDSG PTPIVFLDHL IDIAKRFDFE LDSRVFNYLL
NSYVRVRIDD AVDCFNGMIE HDIVPMLPFM NILLTALVRG NLIDKARELY DKMVSIGVRG
DRVTVLLMMR AFLKDGKPWE AEEFFKEAKA RGTELDAAVY SIAIQASCQK PDLNMAGGLL
REMRDRGWVP SEGTFTTVIG AFVKQGNLAE ALRLKDEMLS CGKQLNLVVA TSLMKGYCKQ
GDIGSALYLF NKIKEDGLTP NKVTYAVLIE WCCRKQNVKK AYELYTEMKL MDIQPTVFNV
NSLIRGFLEA CSLKEASNLF DEAVESGIAN VFTYNVLLYH FCNDGKVNEA HSLWQRMEDN
GVVPTYASYN NMILAHCRAG NMDMAHTVFS EMLERGIKPT VITYTILMDG HFKKGNAEQA
LDVFDEMVGV NITPSDFTFN IIINGLAKVG RTSEARDMLK KFVDKGFVPI CLTYNSIING
FVKEGAMNSA LAVYREMCES GLSPNVVTYT TLINGFCKSH NIDLALKMQY EMKSKGLRLD
VPAFSALIDG FCKEQDMDRA CELFSELQQV GLSPNVIVYN SMIRGFRNVN NMEAALDLHK
KMINEGILCD LQTYTTLIDG LLREGKLLFA FDLYSEMLAK GIEPDIITYT VLLNGLCNKG
QLENARKILE EMDRKGMTPS VLIYNTLIAG QFKEGNLEEA LRLHNEMLDR GLVPDAATYD
ILINGKAKGQ TSLSGVSCA
//