ID A0A061DHQ1_THECC Unreviewed; 897 AA.
AC A0A061DHQ1;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Tetratricopeptide repeat-like superfamily protein, putative isoform 1 {ECO:0000313|EMBL:EOX91702.1};
GN ORFNames=TCM_000805 {ECO:0000313|EMBL:EOX91702.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX91702.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX91702.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX91702.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061DHQ1; -.
DR STRING; 3641.A0A061DHQ1; -.
DR EnsemblPlants; EOX91702; EOX91702; TCM_000805.
DR Gramene; EOX91702; EOX91702; TCM_000805.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_009409_1_0_1; -.
DR InParanoid; A0A061DHQ1; -.
DR OMA; FWEEGRV; -.
DR Proteomes; UP000026915; Chromosome 1.
DR GO; GO:0009507; C:chloroplast; IBA:GO_Central.
DR GO; GO:0009658; P:chloroplast organization; IBA:GO_Central.
DR GO; GO:0042793; P:plastid transcription; IEA:EnsemblPlants.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR044645; DG1/EMB2279-like.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 2.
DR PANTHER; PTHR46935; OS01G0674700 PROTEIN; 1.
DR PANTHER; PTHR46935:SF2; PPR_LONG DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01535; PPR; 2.
DR Pfam; PF13812; PPR_3; 2.
DR PROSITE; PS51375; PPR; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 341..375
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 376..410
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 411..445
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 516..550
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 897 AA; 102599 MW; 196F83C74A720136 CRC64;
MDASIVPSPQ LPPPQFEPNT ENIKRKLLRK GVYPTPKIIR TLRKREIQKH TRKTKHSQPQ
TPPLTAFQLQ SLAEESHFLT LKREYKRFSK ELNPKKEPRS PSLLGKPWER IERAKLAELV
SKNGEFDGQS LKRENLVELR EMFEKDLRWV LDDDVDVEDD GGLLPREKPA RDRDPSKRWR
NEKEAIRFLV DRLSEREITE RHWKFVRIMK QSGLQFTEWQ LLRIVEGLGK NGKWRQAMAV
VQWLYGNKEH KEFKSRFVYT KLLSVLGKAR KPQEALRVFN LMLGDCHIYP DLAAYHSIAV
TMGQAGLLKE LLNIIERMRQ RPYKRIKNMR RKNWDPVLEP DLVVYNAVLN ACVPVHQWKG
VSWVFEQLRK SGLRPNGATY GLAMEVMLQS GKYDLVHEFF RKMKRSGEAP RALSYRVLVK
AFWEEGKINE AVEAVRDMEQ RGVIGTASVY YELACCLCKN GRWRDAIIEV DKMKKLSQRK
PLEITFTGLI MASLDGGHFN DCISIFQYMK DHCAPNIGTI NAMLKVYGQN DMFSKAKELF
EEINKAKSGP YDSQNGKSTN LIPDGYTYSL MLGASASALQ WEYFEYVYKE MTLSGYHLDQ
TKHAILLVEA SRARKWYLLE HAFDTFLEVG EIPHPLLFTE MIIQATAQSN YEKVVTLVNT
MAHALYQVSE KQWTEAFEEN GDRISHGSLS KLLDALSNCE LSSEITASNL IRSLQYLCGS
AKSEPNSNDG ETYGSERLNI QSISQDMRGE KIIAAMDPPL KATDVSFAVF SANCNGKNEE
GGVDADLIHR LSNYDMDDSA SKTFTCMEDF ANDTASGDPT SMGKQVSLLN LDEYTKDVDE
AEVDLPIDDD EAEMELLINE DGDSSTSKLP SANEILESWK ESSKNDGIFF PIHLGLK
//