ID A0A061F3W5_THECC Unreviewed; 559 AA.
AC A0A061F3W5;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE SubName: Full=ATP binding,nucleic acid binding,helicases, putative {ECO:0000313|EMBL:EOY11598.1};
GN ORFNames=TCM_026733 {ECO:0000313|EMBL:EOY11598.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY11598.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY11598.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. P subfamily.
CC {ECO:0000256|ARBA:ARBA00007626}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY11598.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F3W5; -.
DR EnsemblPlants; EOY11598; EOY11598; TCM_026733.
DR Gramene; EOY11598; EOY11598; TCM_026733.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_49_0_1; -.
DR InParanoid; A0A061F3W5; -.
DR OMA; FNMILGS; -.
DR Proteomes; UP000026915; Chromosome 5.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 5.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 12.
DR PANTHER; PTHR47942:SF16; OS02G0321000 PROTEIN; 1.
DR PANTHER; PTHR47942; TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATED; 1.
DR Pfam; PF01535; PPR; 1.
DR Pfam; PF12854; PPR_1; 1.
DR Pfam; PF13041; PPR_2; 6.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS51375; PPR; 12.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 97..131
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 132..166
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 167..201
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 202..236
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 237..271
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 272..306
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 307..341
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 347..381
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 382..416
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 417..451
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 452..486
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 487..521
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
SQ SEQUENCE 559 AA; 62457 MW; FE6C31CBC73F4A1E CRC64;
MHFLFCFAIA SPSTWELLFS FSYPFLFFFL FPRFNKSRYA FHKIDDALVS FDHMLRTHPR
PCIVEFTQVL GAIVRMKHYE TAVSLSRQMD FLGIRHDVYT LNILVNCFCS LHRTDFGFSL
LGKMLKLGIQ PDTTTFNTLV NGLCVEGKIA EAVILFDGIV RNGCQPDLIT YGTVMNGLCK
IGYTTGAIRL LRNMKQSGIV PNTVTYNTTI DCLCKDKLVP EALNLLSEMR GKGIPPDVVT
YNSFIHAMCS LGQWNEVMRL LTEMVANNCK PNIVSYSILV DAFCTEGRVS EACDIVEGMI
RRGVDSDTIT YNALMDGYCL QGKMDEARKI LNLMITKGCV PNVYVPNTVT YTALINGMCQ
VGRLGAAREL HKEMSARGLV PNTVTYSTLL HGLCKHGHVH EAAELFHVMQ SNGIEANIVH
YSILIDGLCQ VGQLNVARKL FHSLPGKGLH PNVYTCDIMI KVLCKEGLPN EAYDLFRKME
VNGCLQDSCS YNTMIKGFFQ NNDVSRAVQI LHEMVDKGFS AGSSTATMVV DLLCKNGGDQ
SILELFLRNS EDDQNVNMK
//