ID A0A061G5Z9_THECC Unreviewed; 1484 AA.
AC A0A061G5Z9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE RecName: Full=DNA-directed RNA polymerase subunit {ECO:0000256|RuleBase:RU004279};
DE EC=2.7.7.6 {ECO:0000256|RuleBase:RU004279};
GN ORFNames=TCM_016168 {ECO:0000313|EMBL:EOY24612.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY24612.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY24612.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- FUNCTION: DNA-dependent RNA polymerase catalyzes the transcription of
CC DNA into RNA using the four ribonucleoside triphosphates as substrates.
CC {ECO:0000256|RuleBase:RU004279}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate +
CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA-
CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395;
CC EC=2.7.7.6; Evidence={ECO:0000256|RuleBase:RU004279};
CC -!- SIMILARITY: Belongs to the RNA polymerase beta' chain family.
CC {ECO:0000256|RuleBase:RU004279}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY24612.1; -; Genomic_DNA.
DR STRING; 3641.A0A061G5Z9; -.
DR EnsemblPlants; EOY24612; EOY24612; TCM_016168.
DR Gramene; EOY24612; EOY24612; TCM_016168.
DR eggNOG; KOG0260; Eukaryota.
DR HOGENOM; CLU_002449_0_1_1; -.
DR InParanoid; A0A061G5Z9; -.
DR OMA; FPFTILH; -.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0005665; C:RNA polymerase II, core complex; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003899; F:DNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:GOC.
DR CDD; cd10506; RNAP_IV_RPD1_N; 1.
DR Gene3D; 1.10.132.30; -; 1.
DR Gene3D; 2.40.40.20; -; 1.
DR Gene3D; 3.10.450.40; -; 1.
DR Gene3D; 6.20.50.80; -; 1.
DR Gene3D; 3.30.1490.180; RNA polymerase ii; 1.
DR Gene3D; 4.10.860.120; RNA polymerase II, clamp domain; 1.
DR Gene3D; 1.10.274.100; RNA polymerase Rpb1, domain 3; 1.
DR InterPro; IPR045867; DNA-dir_RpoC_beta_prime.
DR InterPro; IPR040403; NRPD1_N.
DR InterPro; IPR000722; RNA_pol_asu.
DR InterPro; IPR006592; RNA_pol_N.
DR InterPro; IPR007080; RNA_pol_Rpb1_1.
DR InterPro; IPR007066; RNA_pol_Rpb1_3.
DR InterPro; IPR042102; RNA_pol_Rpb1_3_sf.
DR InterPro; IPR007083; RNA_pol_Rpb1_4.
DR InterPro; IPR007081; RNA_pol_Rpb1_5.
DR InterPro; IPR044893; RNA_pol_Rpb1_clamp_domain.
DR InterPro; IPR038120; Rpb1_funnel_sf.
DR PANTHER; PTHR19376; DNA-DIRECTED RNA POLYMERASE; 1.
DR PANTHER; PTHR19376:SF36; DNA-DIRECTED RNA POLYMERASE IV SUBUNIT 1; 1.
DR Pfam; PF11523; DUF3223; 1.
DR Pfam; PF04997; RNA_pol_Rpb1_1; 1.
DR Pfam; PF00623; RNA_pol_Rpb1_2; 1.
DR Pfam; PF04983; RNA_pol_Rpb1_3; 1.
DR Pfam; PF05000; RNA_pol_Rpb1_4; 1.
DR Pfam; PF04998; RNA_pol_Rpb1_5; 1.
DR SMART; SM00663; RPOLA_N; 1.
DR SUPFAM; SSF64484; beta and beta-prime subunits of DNA dependent RNA-polymerase; 1.
PE 3: Inferred from homology;
KW DNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022478,
KW ECO:0000256|RuleBase:RU004279}; Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695,
KW ECO:0000256|RuleBase:RU004279};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU004279};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|RuleBase:RU004279};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 228..517
FT /note="RNA polymerase N-terminal"
FT /evidence="ECO:0000259|SMART:SM00663"
SQ SEQUENCE 1484 AA; 167428 MW; 32100A003A5E7E13 CRC64;
MTSMENDLYE AEQLPAAFVT GIRFNVSNDR DNEKMSVMEI AAPSEVSDPK LGFPNFSNHC
TTCGAADMKH CEGHFGVINF PYAILHPYFL SEVVQILNKI CPGCKSVRKD LRIKGANSVS
KVNQRKGCKY CVGNSIDWYP PMNFKISSKD LFRKSAIIVE VSEKSSMKVR KRGKQALPSD
YWDFIPKDEQ QEESLIRPNR RVLSHSQVRY LLKDVDPEFI KKFVLSMDSI FLNCFPVTPN
SHRVTEIMHA SSNGQRLIFD QRTRVYKKLA DFRGIANELS SHVLECLKIS KLHLEKPSNE
ESALVLAQKR NKDSASNMSG LRYMKDVILG KRNDHCFRMV LTGNPNLKLS EISIPCHVAE
RLQIAEQLNN WNEERLKACC DLRLLEKGEI HVRREGRLVR IRHNEKLQVG DTIFRPLNNG
DIILINRPPS IHQHSLIALS VKVLPVSSVV SINPLICSPF RGDFDGDCLH GYVPQSIKAR
VELIELVSLN RQLINGQSGR NLLSLSHDSL TAAYLVKEDG VLLNLFQMQQ LEMFCPNHSP
FPAIVKAPLL RSSVWTGKQL LSMLFPLEFD YDFTPNDVVI RNGELISSSE GSTWLRDADG
NLFQSLIKHY QGKVLDFLYA AQEVLCEWLS MRGLSVSLSD LYLSSDSNSQ KNMMDEIFCG
LQEIEQTCNF KQLMVDSNHD FLVGHDEEVD SFMALDVEQM CYEKQRSAAL SQASVDSFKQ
VFRDIQNLLY KYANKDNSLL TMFKAGSKGN LLKLVQHSLC LGLQHSLVPL SFRFPHQLSC
AAWNNQKSHG LTQKVDDTAE SAKNYIPYAV VESSFMTGLN PLESFVHSVT SRDSSFSDNA
DLPGTLSRRL MFFMRDLYTA YDGTVRNSYG DLVVQFCYDI DKDASSPTSC AHGLISESST
IPEGIGGQPV GSLSACAISE AAYSALDQPV SLLETSPLLN LKRVVECGSK RSNADQTMTL
FLSNKLGRKR HGSEYAALEV KNHLERLTFS DIVTTVSIIF SPQMYRENHF TPWVCHFHVC
KDTMKRRQLK VQSIIDSLHM HCTTAKTMWK ISLPDMQITS NGRACSHIDM PNEDDTFCIT
VTIVEYSKSS HMELDVIRDM VIPYLLEAVI KGFPEIKKVD ILWKDRLKVS KPHKSSCGEL
YLRVFVSGDF GITKLWGVLM NDCLQIMDMI DWTRSHPDNI NQLCLAYGID AGWKFFLNNL
KSAISDTGKT ILNEHLLVVA NCLSATGEFV GLNSKGLRQQ REHAYVSSPF MQACFSNPSA
SFVKAAKTGA SDDLQGTIDA LAWGRIPHIG TGGQFDIIYS VKDQRLAEPV DVYKLLGSSI
SSQKQDVEFE VPKALNFKSE KYGSLLIDAL GDSASEELKK IETKRRSIWR ELLTLDDIQR
LSRALRNILH KYPIDHRLSE ADWNTLMMAL YFHPRRDEKI GSGAQEIKVG YHPEHANARC
FSLVRTDGTI VDFSYHKCVL GALEIIAPHR AKSYKSKWLQ SGSL
//