GenomeNet

Database: UniProt
Entry: A0A061DI43_THECC
LinkDB: A0A061DI43_THECC
Original site: A0A061DI43_THECC 
ID   A0A061DI43_THECC        Unreviewed;       641 AA.
AC   A0A061DI43;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   24-JAN-2024, entry version 33.
DE   SubName: Full=RNA-binding KH domain-containing protein, putative isoform 3 {ECO:0000313|EMBL:EOX91912.1};
GN   ORFNames=TCM_000966 {ECO:0000313|EMBL:EOX91912.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX91912.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOX91912.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001879; EOX91912.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061DI43; -.
DR   EnsemblPlants; EOX91912; EOX91912; TCM_000966.
DR   Gramene; EOX91912; EOX91912; TCM_000966.
DR   Proteomes; UP000026915; Chromosome 1.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd22459; KH-I_PEPPER_rpt1_like; 1.
DR   Gene3D; 3.30.1370.10; K Homology domain, type 1; 4.
DR   InterPro; IPR004087; KH_dom.
DR   InterPro; IPR004088; KH_dom_type_1.
DR   InterPro; IPR036612; KH_dom_type_1_sf.
DR   PANTHER; PTHR10288; KH DOMAIN CONTAINING RNA BINDING PROTEIN; 1.
DR   PANTHER; PTHR10288:SF269; RNA-BINDING KH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00013; KH_1; 4.
DR   SMART; SM00322; KH; 4.
DR   SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 4.
DR   PROSITE; PS50084; KH_TYPE_1; 4.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117}.
FT   DOMAIN          23..106
FT                   /note="K Homology"
FT                   /evidence="ECO:0000259|SMART:SM00322"
FT   DOMAIN          126..198
FT                   /note="K Homology"
FT                   /evidence="ECO:0000259|SMART:SM00322"
FT   DOMAIN          271..347
FT                   /note="K Homology"
FT                   /evidence="ECO:0000259|SMART:SM00322"
FT   DOMAIN          363..436
FT                   /note="K Homology"
FT                   /evidence="ECO:0000259|SMART:SM00322"
FT   REGION          459..493
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        462..493
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   641 AA;  67888 MW;  1142A465EEF4758F CRC64;
     MQNPYNRRRG RQGPPVTIVP LPGEVSFRII CHVSSIGGVI GNSGAVVSQL RRETSSRIHC
     EEPVRGSAHR VILIVGSGSV ERRFSLGEGE ECDVSCAQEA MVRVFQRVWE VEAEREWGNA
     CDGEDEEAYC GVLADTTQIG AVVGRGGNNI VRMRTETGAK IRILPPPPCG RKNDELIQIT
     GGTLAVKKAL VAVSGCLQAC PPLDRESTPM SVPTEKPSRG TSPEPHIEFF PHLSSLLPPM
     SANSVSASSN ATFSSMDADG DSNLDSNGTQ KEVVFRMLCS NGAAGAIIGK KGAIVRALQN
     QTGASIMFAS PVTESGERVV TISALENLES WYSPAQNAVV LVFARSVEAD IGKGLPSGLS
     KGSAVTVRLL VAKNLVSCLN DKGGRVLSEI VEVTGADVQI LDGDLTLDHS PEDVVQITGE
     YKSVQNAIFQ VTSRLRHNLL PPEVLNEMRV RNCYGKVSDT GVPQAYQPTS LSSDTDQGPN
     LAQRTQPGLS DNTAGPLPFK LQPQQLQTTG NGCTVATQDA ERGSTTFGGS LDLERSLDFL
     LPSEVLNEVG GRSPCKGGSE TTSGLLQSLG LSLDSDQENA LTRAVGNLGL SNNVGCPPKS
     PLLETVRRGH GLANAEGTGG LELERSVLHK NKIKAVSFCC C
//
DBGET integrated database retrieval system