ID A0A061FUI6_THECC Unreviewed; 811 AA.
AC A0A061FUI6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Plastid transcriptionally active 2 isoform 3 {ECO:0000313|EMBL:EOY20557.1};
GN ORFNames=TCM_011951 {ECO:0000313|EMBL:EOY20557.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY20557.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY20557.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY20557.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061FUI6; -.
DR EnsemblPlants; EOY20557; EOY20557; TCM_011951.
DR Gramene; EOY20557; EOY20557; TCM_011951.
DR Proteomes; UP000026915; Chromosome 3.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 10.
DR PANTHER; PTHR47936:SF1; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN GUN1, CHLOROPLASTIC; 1.
DR PANTHER; PTHR47936; PPR_LONG DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF13041; PPR_2; 2.
DR Pfam; PF13812; PPR_3; 3.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51375; PPR; 13.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 135..169
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 170..204
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 205..240
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 241..275
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 276..310
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 311..345
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 346..380
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 381..415
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 416..450
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 486..520
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 521..555
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 556..590
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 591..625
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 811 AA; 91526 MW; D9080DD7CD244812 CRC64;
MAISIPNHFL ALTQPSNFAL NRRQLSSNRI FTGGNHSFLS GGAGICRAKP RELVLGNPSV
TVEKGKYSYD VETLINKLSS LPPRGSIARC LDVFRNKLSL NDFALVFKEF AHRGDWQRSL
RLFKYMQRQI WCKPNEHIYT IMISLLGREG LLEKCREVFD EMPSQGVTRS VFAYTALINA
YGRNGAYNIS LELLDKMKKD KVLPSILTYN TVINACARGG LDWEGLLGLF AEMRHEGIQP
DIVTYNTLLS ACANRGLGNE AEMVFRTMNE GGILPDLTTY SYLVESFGKL GKLEKVSELL
KEMESGGNLP DIMSYNVLLE AYAKSGSIKE AMGVFKQMQV AGCAPNATTY SILLNLYGRN
GRYDDVRELF LEMKESNTEP DAATYNILIQ VFGEGGYFKE VVTLFHDMVE ENIEPNVKTY
DGLIFACGKG GLHEDAKKIL LHMNEKCIVP SSRAYTGVIE AYGQAALYEE VLVAFNTMNE
VESNPTIETY NSLLQTFARG GLYKEANAIL SRMNETGVAK NRDSFNALIE AFRQGGQFED
AIKAYVEMEK ARCDPDERTL EAVLSVYCFA GLVDESNEQF QEIKALGVLP SVMCYCMMLA
VYAKCDRWDD AYQLFDEMLT NKVSNIHQVI GKMIRGDYDD DANWQMVEYV FDKLNSEGCG
FGIRFYNALL EALWWLRQKE RAARVLNEAT KRGLFPELFR KNKLVWSVDV HRMWEGGTYT
AVSIWLNSMQ KMFLSGDDLP QLATVVVAWR KAQLPVIFQL QKLPIHSCRI LCRHHFLSLG
GTKGELSVSG LSLSEFCQPQ AHLQMSRKRI I
//