ID A0A061G4Y4_THECC Unreviewed; 685 AA.
AC A0A061G4Y4;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Tetratricopeptide repeat (TPR)-like superfamily protein {ECO:0000313|EMBL:EOY24222.1};
GN ORFNames=TCM_015887 {ECO:0000313|EMBL:EOY24222.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY24222.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY24222.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the PPR family. PCMP-H subfamily.
CC {ECO:0000256|ARBA:ARBA00006643}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY24222.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061G4Y4; -.
DR STRING; 3641.A0A061G4Y4; -.
DR EnsemblPlants; EOY24222; EOY24222; TCM_015887.
DR Gramene; EOY24222; EOY24222; TCM_015887.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_002706_37_2_1; -.
DR InParanoid; A0A061G4Y4; -.
DR OMA; MPFKPHP; -.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0009451; P:RNA modification; IBA:GO_Central.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR032867; DYW_dom.
DR InterPro; IPR046848; E_motif.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; TIGR00756; PPR; 7.
DR PANTHER; PTHR47924:SF204; DYW_DEAMINASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47924; PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN; 1.
DR Pfam; PF14432; DYW_deaminase; 1.
DR Pfam; PF20431; E_motif; 1.
DR Pfam; PF01535; PPR; 6.
DR Pfam; PF13041; PPR_2; 2.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS51375; PPR; 6.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 121..156
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 184..218
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 246..276
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 277..311
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 347..377
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REPEAT 378..412
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT DOMAIN 593..685
FT /note="DYW"
FT /evidence="ECO:0000259|Pfam:PF14432"
FT REGION 53..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..84
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 685 AA; 77272 MW; 66CBE9912887C2E6 CRC64;
MFHYLKNTRK QPTLPFLKIF PPTKHPLSPF STFTILNPNV NPNHEAVPAN FPSHSPHEPH
LVPTSNITRK PPTAPSLDKS PTLDQQDHNY IISSNKVITS YIRSGDLDSA LRVFNTMTVK
TTVTWNSILA GYSKKPGKIT QAQKLFDKIP EKDTVSYNIM LACYVHNSDM ETAWSFFNSM
PFKDSASWNT MISGFAQKGL MGKARELFSA TPEKNSVTWS AMISGYVECG ELELAVEFFE
LVDVKSVVAW TAMISGYMKF GKIEKAERLF KEMPVKNLVT WNAMISGYVE NCRAEDGLKL
FRMMLRYGIR PNNSSLSSVL LGCSELSALQ FGKQVHQLVC KSLLRDDTTA DTSLISMYCK
CGALDDAWKL FLEIKKKDVV SWNAMISGYA QHGAGEKALH LFEEMRDEGV RPDWITFVAV
LLACNHAGLV DMGIRYFDSM LKDYGVEARP DHYTCMVDLL GRAGKLVEAV NLIKRMPFKP
HCAIYGTLLG ACRIHKNLEM AEFAAENLLN LDPKNAAGYV QLANIYAAMN KWDHVARVRQ
SMKDNKVVKT PGYSWIEIKS VVHEFRSGDR VHPDLASIHE KLRELEKKLK FAGYVPDLEF
ALHDVGEEQK AQLLLRHSEK LAIAFGLIKV PSGGPIRVFK NLRVCGDCHR AIKYISAIET
REIIVRDTVR FHHFKDGSCS CGDYW
//