ID A0A061GNM9_THECC Unreviewed; 801 AA.
AC A0A061GNM9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=Pentatricopeptide repeat (PPR) superfamily protein {ECO:0000313|EMBL:EOY31480.1};
GN ORFNames=TCM_046928 {ECO:0000313|EMBL:EOY31480.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY31480.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY31480.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily.
CC {ECO:0000256|ARBA:ARBA00006643}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY31480.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GNM9; -.
DR EnsemblPlants; EOY31480; EOY31480; TCM_046928.
DR Gramene; EOY31480; EOY31480; TCM_046928.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_15_1_1; -.
DR InParanoid; A0A061GNM9; -.
DR OMA; VYCRLNE; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IBA:GO_Central.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR046848; E_motif.
DR InterPro; IPR046849; Eplus_motif.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR045215; PPR_prot_At1g15510.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 6.
DR PANTHER; PTHR24015:SF1695; OS06G0185800 PROTEIN; 1.
DR PANTHER; PTHR24015; OS07G0578800 PROTEIN-RELATED; 1.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF20431; E_motif; 1.
DR Pfam; PF20430; Eplus_motif; 1.
DR Pfam; PF01535; PPR; 5.
DR Pfam; PF13041; PPR_2; 2.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51375; PPR; 5.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 190..224
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 292..326
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 393..427
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 463..493
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 494..528
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 709..801
FT /note="DYW"
FT /evidence="ECO:0000259|Pfam:PF14432"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 801 AA; 89198 MW; CE4CEB4E582AD1C6 CRC64;
MALTSGRRRR RRRQMFSKSI ASTYSPTRSR NFFLNLLKKS TTLPQLTQTH AQLILNGFRN
DLSTITKLTH RLFDLNATSY ARDVFLSIPN PDLFLFNVLI KGFSNTHSIS LYTHLRKCTR
LNPDNFTYAF AIASASTLSD EKVGMFLYEH AVVDGYGFDL FVGTAVVDFY FKIWRVELAR
KVFDKMPERD TVLWNSMISG LVKNCCFEDA IRVFRDMLED GGIRLDSTSV AAVLPAFSEL
QELISGMEVQ CLALKLGFHS HVYVLTGLIS LYSKGGEIEA AKLLFGEIGR PDLVSCNAMI
SGYTSNGESE CSVRLFKQLL GSGEKVNSST IVGLIPVLSP FGYLNLTNCI HSFCVKYGFV
SQSSVSTALT TAYSRLNEIE SARQLFDESS EKTPASWNAM ISGYTQNGLT EAAISLFQEM
QMSKVGPNPV TLTSILSACA QLGALSLGKW VHGLVKSKSF DSNIYVSTAL IDMYAKCGSI
REARQLFDLM LGKNVVTWNA MISGYGLHGQ GQDALRLFSE MLHSGVSPNG VTFLSLLYAC
SHAGLVKEGE EIFRSMVHAN QFKPLAEHYA CMVDILGRAG QLEKAFKFIK EMPVEPGPAE
WGALLGACMI HKDKKLAHVA SERLFELDPE NVGYYVLLSN LYSAERNYPL AASVRQNVKK
RMLAKIPGCT LIEIGETPHV FTSGDRSHPQ ATEIYAMLEK LIRKMKEAGF QTETDTALHD
VEEEEKELMV NVHSEKLAIA FGLVVTQPGT EIRIFKNLRV CVDCHTATKF ISKITERVIV
VRDANRFHHF KDGVCSCGDY W
//