ID A0A061DF79_THECC Unreviewed; 683 AA.
AC A0A061DF79;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=Ankyrin repeat protein {ECO:0000313|EMBL:EOX90735.1};
GN ORFNames=TCM_000124 {ECO:0000313|EMBL:EOX90735.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX90735.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX90735.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX90735.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061DF79; -.
DR STRING; 3641.A0A061DF79; -.
DR EnsemblPlants; EOX90735; EOX90735; TCM_000124.
DR Gramene; EOX90735; EOX90735; TCM_000124.
DR eggNOG; KOG0504; Eukaryota.
DR eggNOG; KOG4661; Eukaryota.
DR HOGENOM; CLU_403041_0_0_1; -.
DR InParanoid; A0A061DF79; -.
DR OMA; PRYSPHA; -.
DR Proteomes; UP000026915; Chromosome 1.
DR GO; GO:0030941; F:chloroplast targeting sequence binding; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0045036; P:protein targeting to chloroplast; IBA:GO_Central.
DR CDD; cd00590; RRM_SF; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR PANTHER; PTHR24203:SF53; ANKYRIN REPEAT DOMAIN-CONTAINING PROTEIN, CHLOROPLASTIC; 1.
DR PANTHER; PTHR24203; ANKYRIN REPEAT FAMILY PROTEIN; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF13637; Ank_4; 1.
DR Pfam; PF00076; RRM_1; 1.
DR SMART; SM00248; ANK; 5.
DR SMART; SM00360; RRM; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 2.
DR PROSITE; PS50088; ANK_REPEAT; 4.
DR PROSITE; PS50102; RRM; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176}.
FT DOMAIN 69..147
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REPEAT 524..552
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 553..585
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 586..618
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 619..651
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 1..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 144..220
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 428..460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..25
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..56
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..218
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 321..336
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..447
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 683 AA; 78380 MW; CF76CA096AF15FAA CRC64;
MADSPRKRNS KSPSPWREQS RSRSRSRSRS RSRSRSRSWS RPRHGSRSHS RGRSRSRSRG
RVDDGNPGNT LYVTGLSQRV SERDVEEHFS KEGKVVSCFL VVEPRTRISR GFAFVTMDNV
EDANRCIKYL NQSVLEGRYI TVERSRRKRP RTPTPGHYLG LKNTRDYGRG DRGRYRGGRD
DNGYRRSPRR SPYRGRDYSP RYSPHAGRSR RERSHSPYSR GSRQCVKCLT LDLEHGPWFY
LKLEKERAMS LLNPSTFLPL STPAKLSLPQ NLYFPRKLHS LSPSFLSSYI TQNDDVDDEE
HVIGDCLVFE DGAFEDPHLQ ADSTSFGNPN TSTGIGKTKP KKKKKKNKEK QSVDIEAENL
VPEQWRHVQE EINVTKKERR KMAQLLEFGS RVEKKRQGLV PIRNVNMQEY LKYKEAKLAQ
LNPVVLDNPS SFPVKEEEKE NEEGSKEAEM NLPSSSERVA PKNPRWAVYG RGFEDVAELF
NSENYQPGEN KSEGRRKLFT KEEKFMLNRR VPDLATATSG KWLPLHTLAA SGEFYLVDAL
LKHNVDINAV DKNGLTAVHK AIIGKKQAIT NYLLRESANP FVRDEDGATL MHYAVRTAST
PTIKLLLLYN VDINLQDNDG WSPLHLAVQA RRTDIVKLLL IRGADKMLKN KDGLTPLDLC
LYSGRDTRTY ELIKLLKQLP KPR
//