ID A0A061F521_THECC Unreviewed; 1824 AA.
AC A0A061F521;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 57.
DE SubName: Full=RNA binding,RNA binding isoform 1 {ECO:0000313|EMBL:EOY09614.1};
GN ORFNames=TCM_025027 {ECO:0000313|EMBL:EOY09614.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY09614.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY09614.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY09614.1; -; Genomic_DNA.
DR STRING; 3641.A0A061F521; -.
DR EnsemblPlants; EOY09614; EOY09614; TCM_025027.
DR Gramene; EOY09614; EOY09614; TCM_025027.
DR eggNOG; KOG1070; Eukaryota.
DR HOGENOM; CLU_000845_1_1_1; -.
DR InParanoid; A0A061F521; -.
DR OMA; GQYLRAY; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0005730; C:nucleolus; IBA:GO_Central.
DR GO; GO:0032040; C:small-subunit processome; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05694; S1_Rrp5_repeat_hs2_sc2; 1.
DR CDD; cd05695; S1_Rrp5_repeat_hs3; 1.
DR CDD; cd04461; S1_Rrp5_repeat_hs8_sc7; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 10.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 5.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 15.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 12.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 14.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 32..115
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 131..196
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 219..289
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 305..364
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 396..463
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 483..552
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 567..639
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 659..728
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 772..836
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 962..1033
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1057..1128
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1164..1238
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1272..1341
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1362..1432
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1515..1541
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1524..1541
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1824 AA; 203700 MW; B2FB1BC971B7C68C CRC64;
MLDDLGSLFG DGITGKLPRY ANKITLKNIS PGMKLWGVVA EVNEKDLVIS LPGGLRGLVR
AADALDSVLS NEVENNEGNF LTNIFCTGQL VSCIVLQLDD DKKETGKRKI WLSLRLSLLH
KSFTLDAVQE GMVLTAYVKS IEDHGYILHF GLSSFMGFLP KDDEESRDIK VRTGQFLQGV
VRRIDKTRKV VYLSSNPDTV SKCVTKDLKG ISIDLLIPGM LVNTSVRSIL ENGVMLSFLT
YFTGTVDMFH LQNQFPTKDW KDDYNQNKKI NARILFIDPS TRAVGLTLNP HLVHNKAPPS
HVNIGEIYDQ SKVIRVDRGL GLLLDIPSKP VSTPAYVYIS DVAEEEVRKL EKKFKEGSQV
RVRIHGFRHL EGLATGILKA SAFEGQVFTH SDVKPGMVIR AKVIALDSFS AIVQFPGGVK
ALCPIRHMSE FEIAKPGKKF KVGAELVFRV LGCKSKRITV THKKTLVKSK LGIISSYADA
TEGFITHGWI TKIEKHGCFV RFYNGVQGFA PRSELGLGPG YDPSSMYHVG QVIKCRVTSS
NPASRRINLS FQMKPVRVSE DDLVKLGSIV SGLIDRLTPS AVVIQVNSKA HLKGTISNEH
LADNHESAAL LKSVLKPGYK FDQLLVLDIE GNNILLSAKY SLTSLAEQLP SDISQIHPNS
VVHGYVCNLI ETGCFVRFLG RLTGFSPRSK STDDYKADLS GAFYVGQSVR SNILDVNSET
ARITLSLKQS SCSSTDASFI QEFFLLEEKI AKLQSSDSDG SELKWVEGFN VGSVIEGKIG
EAKDIGVVVS FDKYNDVLGF VTHYQLGGLT LETGSIVQAA VLDVAKAERL VDLSLKPEFV
DKSQEESSKG QIQKKKRKRE ASKDLEVHQT VNAVVEIVKE HYLVLAIPEY NYAIGYASKA
DYNTQKFPQK QFVNGQRVIA TVMALPSPTT SGRLLLLLNS ISEVTETSSS KRAKKKSSYS
VGSLVSAEVT EIMPLELRLK FGIGFRGRVH VTEVNDDNVL ENPFGNFKIG QTITARVVGK
ANQKGYLWDL SIKPTMLAGT GETGVNSTND ECNFSAGQLV TGYVYKMDTE WAWLTISRHV
KAQLYILDSA REPNELQQFQ ERFKVGKAVS GHVLNVNKDK KLLRLVRHPL GALSIRNVHG
EDKRTGESDN NISGESVTTH IHEGDILGGR ISKILPGVGG LLVQIGPHIF GRVHFTELKD
TWESDPLSGY YEGQFVKCKV LEISHSVKGT IHIDLSLRLS LDGMLPNNPS ELGSDEDSTS
KRVEKIEDLY PNMAIQGYVK NTIPKGCFIL LSRKLDAKIL LSNLSDGYID DPKKEFPIGK
LVAGRVLAVE PLSKRVEVTL KKSNTNGTSK SEINDFSSLH VGDIVSGRIR RVESYGLFVT
LDHTNMVGLC HVSELSDDHV DNIQTKYRAG EKVTAKILKL DEERHRISLG MKNSYLTDDI
DIQIPSNEES DEDVEETDDT RSRMLTDSTL GMAIEYENGA SSICAQAESR ASIPPLEVTL
DDIEHSDMDI LVSQNQANSN EAVTGDEKNK RRAKKKAKED REREIRAAEE RQLEMDVPRT
ADEFEKLVRN SPNSSFVWIK YMAFMLNSAD IEKARAIAER ALRTINIREE NEKLNIWVAY
FNLENQYGNP PEEAVQKIFQ RALQYCDPKK VHLALLGMYE RTEQHKLADE LLDKMTRKFK
HSCKVWLRRV QMLLMQQQDG VQSVVNRALL CLPRHKHIKF ISQTAILEFK SGVPDRGRSM
FEGILREYPK RTDLWSIYLD XEIRLGDEDV IRALFERAIS LSLPPKKMKF LFKKYLDYEK
SLGDEERIKS VKQKAMDYVE STLT
//