ID C5Y385_SORBI Unreviewed; 1878 AA.
AC C5Y385;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 2.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=DNA-directed RNA polymerase subunit {ECO:0000256|RuleBase:RU004279};
DE EC=2.7.7.6 {ECO:0000256|RuleBase:RU004279};
GN ORFNames=SORBI_3005G131800 {ECO:0000313|EMBL:EES08565.2};
OS Sorghum bicolor (Sorghum) (Sorghum vulgare).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade;
OC Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum.
OX NCBI_TaxID=4558 {ECO:0000313|EMBL:EES08565.2, ECO:0000313|Proteomes:UP000000768};
RN [1] {ECO:0000313|EMBL:EES08565.2, ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=19189423; DOI=10.1038/nature07723;
RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J.,
RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J.,
RA Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M.,
RA Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L.,
RA Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P.,
RA Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman,
RA Ware D., Westhoff P., Mayer K.F., Messing J., Rokhsar D.S.;
RT "The Sorghum bicolor genome and the diversification of grasses.";
RL Nature 457:551-556(2009).
RN [2] {ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=29161754; DOI=10.1111/tpj.13781;
RA McCormick R.F., Truong S.K., Sreedasyam A., Jenkins J., Shu S., Sims D.,
RA Kennedy M., Amirebrahimi M., Weers B.D., McKinley B., Mattison A.,
RA Morishige D.T., Grimwood J., Schmutz J., Mullet J.E.;
RT "The Sorghum bicolor reference genome: improved assembly, gene annotations,
RT a transcriptome atlas, and signatures of genome organization.";
RL Plant J. 93:338-354(2018).
CC -!- FUNCTION: DNA-dependent RNA polymerase catalyzes the transcription of
CC DNA into RNA using the four ribonucleoside triphosphates as substrates.
CC {ECO:0000256|RuleBase:RU004279}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate +
CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA-
CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395;
CC EC=2.7.7.6; Evidence={ECO:0000256|RuleBase:RU004279};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RNA polymerase beta' chain family.
CC {ECO:0000256|ARBA:ARBA00006460, ECO:0000256|RuleBase:RU004279}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000764; EES08565.2; -; Genomic_DNA.
DR STRING; 4558.C5Y385; -.
DR EnsemblPlants; EES08565; EES08565; SORBI_3005G131800.
DR Gramene; EES08565; EES08565; SORBI_3005G131800.
DR HOGENOM; CLU_083452_0_0_1; -.
DR InParanoid; C5Y385; -.
DR OrthoDB; 7998at2759; -.
DR Proteomes; UP000000768; Chromosome 5.
DR ExpressionAtlas; C5Y385; baseline and differential.
DR GO; GO:0009536; C:plastid; IEA:UniProtKB-KW.
DR GO; GO:0005665; C:RNA polymerase II, core complex; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003899; F:DNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd02584; RNAP_II_Rpb1_C; 1.
DR CDD; cd02733; RNAP_II_RPB1_N; 1.
DR Gene3D; 1.10.132.30; -; 1.
DR Gene3D; 1.10.150.390; -; 1.
DR Gene3D; 2.40.40.20; -; 1.
DR Gene3D; 3.30.1360.140; -; 1.
DR Gene3D; 6.10.250.2940; -; 1.
DR Gene3D; 6.20.50.80; -; 1.
DR Gene3D; 3.30.1490.180; RNA polymerase ii; 1.
DR Gene3D; 4.10.860.120; RNA polymerase II, clamp domain; 2.
DR Gene3D; 1.10.274.100; RNA polymerase Rpb1, domain 3; 1.
DR InterPro; IPR045867; DNA-dir_RpoC_beta_prime.
DR InterPro; IPR000722; RNA_pol_asu.
DR InterPro; IPR000684; RNA_pol_II_repeat_euk.
DR InterPro; IPR006592; RNA_pol_N.
DR InterPro; IPR007080; RNA_pol_Rpb1_1.
DR InterPro; IPR007066; RNA_pol_Rpb1_3.
DR InterPro; IPR042102; RNA_pol_Rpb1_3_sf.
DR InterPro; IPR007083; RNA_pol_Rpb1_4.
DR InterPro; IPR007081; RNA_pol_Rpb1_5.
DR InterPro; IPR007075; RNA_pol_Rpb1_6.
DR InterPro; IPR007073; RNA_pol_Rpb1_7.
DR InterPro; IPR038593; RNA_pol_Rpb1_7_sf.
DR InterPro; IPR044893; RNA_pol_Rpb1_clamp_domain.
DR InterPro; IPR038120; Rpb1_funnel_sf.
DR PANTHER; PTHR19376; DNA-DIRECTED RNA POLYMERASE; 1.
DR PANTHER; PTHR19376:SF68; DNA-DIRECTED RNA POLYMERASE SUBUNIT; 1.
DR Pfam; PF04997; RNA_pol_Rpb1_1; 1.
DR Pfam; PF00623; RNA_pol_Rpb1_2; 1.
DR Pfam; PF04983; RNA_pol_Rpb1_3; 1.
DR Pfam; PF05000; RNA_pol_Rpb1_4; 1.
DR Pfam; PF04998; RNA_pol_Rpb1_5; 1.
DR Pfam; PF04992; RNA_pol_Rpb1_6; 1.
DR Pfam; PF04990; RNA_pol_Rpb1_7; 1.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00663; RPOLA_N; 1.
DR SUPFAM; SSF64484; beta and beta-prime subunits of DNA dependent RNA-polymerase; 1.
DR PROSITE; PS00115; RNA_POL_II_REPEAT; 8.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW DNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022478,
KW ECO:0000256|RuleBase:RU004279};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695,
KW ECO:0000256|RuleBase:RU004279}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000000768};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|RuleBase:RU004279};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|RuleBase:RU004279};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 242..548
FT /note="RNA polymerase N-terminal"
FT /evidence="ECO:0000259|SMART:SM00663"
FT REGION 1717..1878
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1878 AA; 209557 MW; 4B0622B87F789A4E CRC64;
MDVRFPHSPA EVAKVELVQF GILSPDEIRQ MSVAQIEYAE TMEKGKPKAG GLSDPRMGTV
DRKIKCQTCM SGMAECPGHF GHLELAKPMF HIGFMKTVLS IMRCVCFNCS KILVDEDNTK
FKQALKIRNP KNRLKRIYDA CKTKKVCGEG DDLDVQQQQD TDEHVNRRGG CGAQQPIITV
DGVKMLVEFK APKKKNDDQE QLSELVEPKQ ILSAERVLNV LKNISDKESL LLGLHPKLAR
PDWMILQVLP IPPPPVRPSV MMGTSSKSED DLTHQLAMII RHNENLRRQE RNGAPAHIIE
EFSQLLQFHI ATYFDNELPG QPRATQRSGR PIKSICSRLK AKEGRVRGNL MGKRVDFSAR
TVITPDPNIN IDEVGVPWSI ALNLTYPETV TPYNIERLKE LVLNGPHPPP GKTGAKYIIR
EDGQRLDLRY VKKSSDQHLE LGYKVERHLN DGDFVLFNRQ PSLHKMSIMG HRIKIMPYST
FRLNLSVTSP YNADFDGDEM NMHVPQSFET RAEVLELMMV PKCIVSPQSN KPVMGIVQDT
LLGCRKITKR DTLIEKDVFM NILMWWEDFD GKIPAPAILK PRPIWTGKQV FNLIIPRQIN
LIRFCSWHSD EEKGFVTPSD TVVRIEKGEL LSGTLCKKTL GTGSGSLIHI IWEEVGPDAA
RKFLGYTQWL VNYWLLQNGF SIGIGDMIAD AATMADINKA ISGAKDCVKK LIKQARDEQL
KAEPGHTMME SFENKVNEAL NKARDNVGSN VQNNLSESNN LKAMVSAGSK GSFINISQMA
ACVGQQNIEG KRIPFGFSDR TLPHFTKDDY GPESRGFVEN SYLQGLTPQE FFFHAMGGRE
GLIDTAVKTS ETGYIQRRLV KAMEDIMVKY DGTVRNSLGD VVQFIYGEDG MDAVWIEEQN
LDSLKMKKEE FDTVFRYELD DENWRPNYLL PVHFDDLKTI SEFRSVLEAE VHKLEADRFQ
LGTEIATNGS HTWPLPVNLK RLIWNAQKTF KIDIRTCSDM HPIEIVEAID KLQERLKVVP
GDDAISIEAQ KNATLFFNIH LRATFASKRV LREYRLTREA FEWIIDEIVW RFSQSLVAPS
EMIGCVAAQS IGEPATQMTL NTFHFAGVSA KNVTLGVPRL REIINVAKKI KTPSLSVYLK
PEVNKTKELA KSVQCALEYT TLRSVTHATE VWYDPDPMGT IIEEDLEFVL SYYEMPDEDI
DPDKISPWLL RIELNREMMV DKKLSMADIA NEINREFDGD LSCIFSDDNA DKLILRLRII
NDEASRGEIL DDSAGDEVFL KKIESNMLAE MSLRGIPDIN KVFIKEVKVN KFDVSDGFKE
HKEWMLDTDG VNLLAVMCHE DVDATRTTSN HLTEVIEVLG IEAVRRALLD ELRSVISFDG
SYVNYRHLAV LCDTMTYRGH LMAITRHGIN RNDTGPLMRC SFEETVDILL DAAVHAESDY
LRGVTENIML GQLAPIGTGG CSLFLNDQML QQAIELQLPS YVDGLDFGIT SAPSPLSSPL
SGSSYHEGLM SSSHLLSPDF RHSPIDTYAS FSPYVRDMSL SPVPSQGGYS PIPRSESGYS
PIPRSVRYMA VSPIYTPYNA ASPVNASTST SYTPVSPVYS PTTPVYTPMS PVYSPTTPVS
SQIQSYSSAS PIYNPISPNS SRYNPSSPSF SPTSTSPSYS PASASYSPTS PFYSPTSPSY
SPASLSYSPT SPFYSPTSPS YSPASLSYSP TSPFYSPTSP SYSPSSPSYS PTSPTYSPTS
LGYSPTSPGY SPTSPTYSPT SPNYSPTSPS YNLSYTEYNS SNTYSPTRPR ITLDSQAPDY
SPTSPTYSPT SPSYSLPSLS YSPNSNLCRP TDDSGRPSPD YSPTSPNFSP NRSDSPTAAD
YSPSSTGQAI EKDGETSH
//