GenomeNet

Database: UniProt
Entry: A0A061F521_THECC
LinkDB: A0A061F521_THECC
Original site: A0A061F521_THECC 
ID   A0A061F521_THECC        Unreviewed;      1824 AA.
AC   A0A061F521;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   24-JAN-2024, entry version 57.
DE   SubName: Full=RNA binding,RNA binding isoform 1 {ECO:0000313|EMBL:EOY09614.1};
GN   ORFNames=TCM_025027 {ECO:0000313|EMBL:EOY09614.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY09614.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY09614.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC       {ECO:0000256|ARBA:ARBA00004604}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001883; EOY09614.1; -; Genomic_DNA.
DR   STRING; 3641.A0A061F521; -.
DR   EnsemblPlants; EOY09614; EOY09614; TCM_025027.
DR   Gramene; EOY09614; EOY09614; TCM_025027.
DR   eggNOG; KOG1070; Eukaryota.
DR   HOGENOM; CLU_000845_1_1_1; -.
DR   InParanoid; A0A061F521; -.
DR   OMA; GQYLRAY; -.
DR   Proteomes; UP000026915; Chromosome 5.
DR   GO; GO:0005730; C:nucleolus; IBA:GO_Central.
DR   GO; GO:0032040; C:small-subunit processome; IBA:GO_Central.
DR   GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR   GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR   CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR   CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR   CDD; cd05694; S1_Rrp5_repeat_hs2_sc2; 1.
DR   CDD; cd05695; S1_Rrp5_repeat_hs3; 1.
DR   CDD; cd04461; S1_Rrp5_repeat_hs8_sc7; 1.
DR   Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 10.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR   InterPro; IPR003107; HAT.
DR   InterPro; IPR012340; NA-bd_OB-fold.
DR   InterPro; IPR045209; Rrp5.
DR   InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR   InterPro; IPR003029; S1_domain.
DR   InterPro; IPR008847; Suf.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR   PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR   Pfam; PF00575; S1; 5.
DR   Pfam; PF05843; Suf; 1.
DR   SMART; SM00386; HAT; 6.
DR   SMART; SM00316; S1; 15.
DR   SUPFAM; SSF50249; Nucleic acid-binding proteins; 12.
DR   SUPFAM; SSF48452; TPR-like; 2.
DR   PROSITE; PS50126; S1; 14.
PE   4: Predicted;
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT   DOMAIN          32..115
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          131..196
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          219..289
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          305..364
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          396..463
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          483..552
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          567..639
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          659..728
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          772..836
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          962..1033
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          1057..1128
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          1164..1238
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          1272..1341
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   DOMAIN          1362..1432
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
FT   REGION          1515..1541
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1524..1541
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1824 AA;  203700 MW;  B2FB1BC971B7C68C CRC64;
     MLDDLGSLFG DGITGKLPRY ANKITLKNIS PGMKLWGVVA EVNEKDLVIS LPGGLRGLVR
     AADALDSVLS NEVENNEGNF LTNIFCTGQL VSCIVLQLDD DKKETGKRKI WLSLRLSLLH
     KSFTLDAVQE GMVLTAYVKS IEDHGYILHF GLSSFMGFLP KDDEESRDIK VRTGQFLQGV
     VRRIDKTRKV VYLSSNPDTV SKCVTKDLKG ISIDLLIPGM LVNTSVRSIL ENGVMLSFLT
     YFTGTVDMFH LQNQFPTKDW KDDYNQNKKI NARILFIDPS TRAVGLTLNP HLVHNKAPPS
     HVNIGEIYDQ SKVIRVDRGL GLLLDIPSKP VSTPAYVYIS DVAEEEVRKL EKKFKEGSQV
     RVRIHGFRHL EGLATGILKA SAFEGQVFTH SDVKPGMVIR AKVIALDSFS AIVQFPGGVK
     ALCPIRHMSE FEIAKPGKKF KVGAELVFRV LGCKSKRITV THKKTLVKSK LGIISSYADA
     TEGFITHGWI TKIEKHGCFV RFYNGVQGFA PRSELGLGPG YDPSSMYHVG QVIKCRVTSS
     NPASRRINLS FQMKPVRVSE DDLVKLGSIV SGLIDRLTPS AVVIQVNSKA HLKGTISNEH
     LADNHESAAL LKSVLKPGYK FDQLLVLDIE GNNILLSAKY SLTSLAEQLP SDISQIHPNS
     VVHGYVCNLI ETGCFVRFLG RLTGFSPRSK STDDYKADLS GAFYVGQSVR SNILDVNSET
     ARITLSLKQS SCSSTDASFI QEFFLLEEKI AKLQSSDSDG SELKWVEGFN VGSVIEGKIG
     EAKDIGVVVS FDKYNDVLGF VTHYQLGGLT LETGSIVQAA VLDVAKAERL VDLSLKPEFV
     DKSQEESSKG QIQKKKRKRE ASKDLEVHQT VNAVVEIVKE HYLVLAIPEY NYAIGYASKA
     DYNTQKFPQK QFVNGQRVIA TVMALPSPTT SGRLLLLLNS ISEVTETSSS KRAKKKSSYS
     VGSLVSAEVT EIMPLELRLK FGIGFRGRVH VTEVNDDNVL ENPFGNFKIG QTITARVVGK
     ANQKGYLWDL SIKPTMLAGT GETGVNSTND ECNFSAGQLV TGYVYKMDTE WAWLTISRHV
     KAQLYILDSA REPNELQQFQ ERFKVGKAVS GHVLNVNKDK KLLRLVRHPL GALSIRNVHG
     EDKRTGESDN NISGESVTTH IHEGDILGGR ISKILPGVGG LLVQIGPHIF GRVHFTELKD
     TWESDPLSGY YEGQFVKCKV LEISHSVKGT IHIDLSLRLS LDGMLPNNPS ELGSDEDSTS
     KRVEKIEDLY PNMAIQGYVK NTIPKGCFIL LSRKLDAKIL LSNLSDGYID DPKKEFPIGK
     LVAGRVLAVE PLSKRVEVTL KKSNTNGTSK SEINDFSSLH VGDIVSGRIR RVESYGLFVT
     LDHTNMVGLC HVSELSDDHV DNIQTKYRAG EKVTAKILKL DEERHRISLG MKNSYLTDDI
     DIQIPSNEES DEDVEETDDT RSRMLTDSTL GMAIEYENGA SSICAQAESR ASIPPLEVTL
     DDIEHSDMDI LVSQNQANSN EAVTGDEKNK RRAKKKAKED REREIRAAEE RQLEMDVPRT
     ADEFEKLVRN SPNSSFVWIK YMAFMLNSAD IEKARAIAER ALRTINIREE NEKLNIWVAY
     FNLENQYGNP PEEAVQKIFQ RALQYCDPKK VHLALLGMYE RTEQHKLADE LLDKMTRKFK
     HSCKVWLRRV QMLLMQQQDG VQSVVNRALL CLPRHKHIKF ISQTAILEFK SGVPDRGRSM
     FEGILREYPK RTDLWSIYLD XEIRLGDEDV IRALFERAIS LSLPPKKMKF LFKKYLDYEK
     SLGDEERIKS VKQKAMDYVE STLT
//
DBGET integrated database retrieval system