ID A0A061FR14_THECC Unreviewed; 2062 AA.
AC A0A061FR14;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=U5 small nuclear ribonucleoprotein helicase, putative isoform 2 {ECO:0000313|EMBL:EOY19725.1};
GN ORFNames=TCM_045030 {ECO:0000313|EMBL:EOY19725.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY19725.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY19725.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001888; EOY19725.1; -; Genomic_DNA.
DR EnsemblPlants; EOY19725; EOY19725; TCM_045030.
DR Gramene; EOY19725; EOY19725; TCM_045030.
DR HOGENOM; CLU_000335_1_0_1; -.
DR Proteomes; UP000026915; Chromosome 10.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0004386; F:helicase activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006396; P:RNA processing; IEA:UniProt.
DR CDD; cd18019; DEXHc_Brr2_1; 1.
DR CDD; cd18021; DEXHc_Brr2_2; 1.
DR CDD; cd18795; SF2_C_Ski2; 2.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 2.
DR Gene3D; 2.60.40.150; C2 domain; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 4.
DR Gene3D; 1.10.3380.10; Sec63 N-terminal domain-like domain; 2.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 2.
DR InterPro; IPR003593; AAA+_ATPase.
DR InterPro; IPR048863; BRR2_plug.
DR InterPro; IPR035892; C2_domain_sf.
DR InterPro; IPR011545; DEAD/DEAH_box_helicase_dom.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR004179; Sec63-dom.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR47961; DNA POLYMERASE THETA, PUTATIVE (AFU_ORTHOLOGUE AFUA_1G05260)-RELATED; 1.
DR PANTHER; PTHR47961:SF4; U5 SMALL NUCLEAR RIBONUCLEOPROTEIN HELICASE; 1.
DR Pfam; PF21188; BRR2_plug; 1.
DR Pfam; PF00270; DEAD; 2.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF02889; Sec63; 2.
DR PIRSF; PIRSF039073; BRR2; 2.
DR SMART; SM00382; AAA; 2.
DR SMART; SM00487; DEXDc; 2.
DR SMART; SM00490; HELICc; 2.
DR SMART; SM00973; Sec63; 2.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 4.
DR SUPFAM; SSF158702; Sec63 N-terminal domain-like; 2.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 2.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 2.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Helicase {ECO:0000256|ARBA:ARBA00022806, ECO:0000313|EMBL:EOY19725.1};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022728};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Ribonucleoprotein {ECO:0000313|EMBL:EOY19725.1};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 404..587
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 621..815
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DOMAIN 1250..1427
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT REGION 24..86
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 192..246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 291..313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..86
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 211..245
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2062 AA; 234248 MW; 2669485DE60CDFD1 CRC64;
MAHLGGGAEA HARFKQYEYR ANSSLVLTTD SRPRDTHEPT GEPESLWGKI DPRSFGDRVY
KGRPLELDEK LKKSKKKKER DPLAEPVPVR KTKRRRLHEE SVLSVTEEGV YQPKTKETRA
AYEAMLSLIQ QQLGGQPLNI VSGAADEILA VLKNEGIKNP DKKKEIEKLL NPIPSQVFDQ
LVSIGKLITD YQDGGEGGGG SMGNGDDGLD DDVGVAVEFE ENEDEEEESD LDMVQEDEDE
DQEERKKIEE EMMSLGPDLA AILEQLHATR ATAKERQKNL EKSIREEARR LKDESVGDGD
RDRRGLADRD TDGGWLKGQR QLLDLDSLAF EQGGLLMANK KCELPMGSYK HHAKGYEEVH
VPAPKSKPLE SDERLVKISE MPEWAQPAFK GMQQLNRVQS KVYETALFAA DNILLCAPTG
AGKTNVAVLT ILQQLALNMD SDGSINHSNY KIVYVAPMKA LVAEVVGNLS HRLEAYGVTV
RELSGDQTLT RQQIDETQII VTTPEKWDII TRKSGDRTYT QLVKLLIIDE IHLLHDNRGP
VLESIVARTV RQIETTKEHI RLVGLSATLP NYEDVALFLR VDLKEGLFHF DNSYRPVPLS
QQYIGITVKK PLQRFQLMND ICYEKVMAVA GKHQVLIFVH SRKETTKTAR AVRDTALAND
TLSRFLKEDA ASREILQSHT DMVKSNDLKD LLPYGFAIHH AGLARTDRQI VEELFADGHV
QVLVSTATLA WGVNLPAHTV IIKGTQIYSP EKGAWTELSP LDVMQMLGRA GRPQYDSYGE
GIIITGHSEL QYYLSLMNQQ LPIESQFVSK LADQLNAEIV LGTVQNAREA CNWITYTYLY
VRMLRNPTLY GLPADVLSRD LTLDERRADL IHSAATILDK NNLVKYDRKS GYFQVTDLGR
IASYYYITHG TISTYNEHLK PTMGDIELYR LFSLSEEFKY VTVRQDEKME LAKLLDRVPI
PIKESLEEPS AKINVLLQAY ISQLKLEGLS LTSDMVYITQ SAGRLLRALF EIVLKRGWAQ
LAEKALNLCK MVTKRMWNVQ TPLRQFHGIP NEILMKLEKK DLAWDRYYDL SSQEIGELIR
FQKMGRTLHR FIHQFPKLNL AAHVQPITRT VLRVELTITP DFQWEDKVHG YVEPFWVIVE
DNDGEYVLHH EYFLLKKQYI DEDHTLNFTV PIYEPLPPQY FIRVVSDKWL GSQTILPVSF
RHLILPEKYP PPTELLDLQP LPVTALRNPS YEALYQDFKH FNPVQTQVFT VLYNTDDNVL
VAAPTGSGKT ICAEFAILRN HQKGPDSIMR VVYIAPLEAI AKERYRDWEK KFGRGLGMRV
VELTGETSMD LKLLEKGQIV ISTPEKWDAL SRRWKQRKYV QQVSVFIVDE LHLIGGQGGP
VLEVIVSRMR YIASQVENKI RIVALSTSLA NAKDLGEWIG ATSHGLFNFP PGVRPVPLEI
HIQGVDIANF EARMQAMTKP TYTAVVQHAK NGKPAIVFVP TRKHVRLTAV DLMSYSKVDN
EEPAFRLRSA EELKPFVDKI SEETLRTTLE HGVGYLHEGL NSLDQEVVSQ LFEAGWIQVC
VMSSSLCWGV PLSAHLVVVM GTQYYDGREN AHTDYPVTDL LQMMGHASRP LLDNSGKCVI
LCHAPRKEYY KKFLYEAFPV ESHLHHFLHD NFNAEIVALV IENKQDAVDY LTWTFMYRRL
TQNPNYYNLQ GVSHRHLSDH LSELVENTLT DLEASKCITI EDDMDLSPLN LGMIASYYYI
SYTTIERFSS SLTSKTKMKG LLEILASASE YAQLPIRPGE EDVLRRLINH QRFSFENPRC
TDPHVKANAL LQAHFTRQHV GGNLALDQRE VLLYATRLLQ AMVDVISSNG WLSLALLAME
VSQMVTQGMW ERDSMLLQLP HFTKDLAKRC QENPGKNIET IFDLVEMEDD ERRELLQMSD
LQLLDIAKFC NRFPNIDLSY DVLEGENVRA GENVTLQVTL ERDLEGRTEV GPVDAPRYPK
AKEEGWWLVV GETRSNQLLA IKRVSLQRKA KVKLEFAAPT EAAKKAYTLY FMCDSYLGCD
QEYNFTVDAK EAAGPDEDSG KE
//