ID A0A061DF84_THECC Unreviewed; 1512 AA.
AC A0A061DF84;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE RecName: Full=DNA-directed RNA polymerase II subunit RPB1 {ECO:0000256|ARBA:ARBA00016625};
DE EC=2.7.7.6 {ECO:0000256|ARBA:ARBA00012418};
GN ORFNames=TCM_000127 {ECO:0000313|EMBL:EOX90745.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX90745.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOX90745.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate +
CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA-
CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395;
CC EC=2.7.7.6; Evidence={ECO:0000256|ARBA:ARBA00024550};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RNA polymerase beta' chain family.
CC {ECO:0000256|ARBA:ARBA00006460}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001879; EOX90745.1; -; Genomic_DNA.
DR EnsemblPlants; EOX90745; EOX90745; TCM_000127.
DR Gramene; EOX90745; EOX90745; TCM_000127.
DR Proteomes; UP000026915; Chromosome 1.
DR GO; GO:0000428; C:DNA-directed RNA polymerase complex; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003899; F:DNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd02584; RNAP_II_Rpb1_C; 1.
DR CDD; cd02733; RNAP_II_RPB1_N; 1.
DR Gene3D; 1.10.132.30; -; 1.
DR Gene3D; 1.10.150.390; -; 1.
DR Gene3D; 2.40.40.20; -; 1.
DR Gene3D; 3.30.1360.140; -; 1.
DR Gene3D; 6.10.250.2940; -; 1.
DR Gene3D; 6.20.50.80; -; 1.
DR Gene3D; 3.30.1490.180; RNA polymerase ii; 1.
DR Gene3D; 1.10.274.100; RNA polymerase Rpb1, domain 3; 1.
DR InterPro; IPR045867; DNA-dir_RpoC_beta_prime.
DR InterPro; IPR000722; RNA_pol_asu.
DR InterPro; IPR000684; RNA_pol_II_repeat_euk.
DR InterPro; IPR006592; RNA_pol_N.
DR InterPro; IPR007066; RNA_pol_Rpb1_3.
DR InterPro; IPR042102; RNA_pol_Rpb1_3_sf.
DR InterPro; IPR007083; RNA_pol_Rpb1_4.
DR InterPro; IPR007081; RNA_pol_Rpb1_5.
DR InterPro; IPR007075; RNA_pol_Rpb1_6.
DR InterPro; IPR007073; RNA_pol_Rpb1_7.
DR InterPro; IPR038593; RNA_pol_Rpb1_7_sf.
DR InterPro; IPR038120; Rpb1_funnel_sf.
DR PANTHER; PTHR19376; DNA-DIRECTED RNA POLYMERASE; 1.
DR PANTHER; PTHR19376:SF37; DNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB1; 1.
DR Pfam; PF00623; RNA_pol_Rpb1_2; 1.
DR Pfam; PF04983; RNA_pol_Rpb1_3; 1.
DR Pfam; PF05000; RNA_pol_Rpb1_4; 1.
DR Pfam; PF04998; RNA_pol_Rpb1_5; 1.
DR Pfam; PF04992; RNA_pol_Rpb1_6; 1.
DR Pfam; PF04990; RNA_pol_Rpb1_7; 1.
DR Pfam; PF05001; RNA_pol_Rpb1_R; 17.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00663; RPOLA_N; 1.
DR SUPFAM; SSF64484; beta and beta-prime subunits of DNA dependent RNA-polymerase; 1.
DR PROSITE; PS00115; RNA_POL_II_REPEAT; 14.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW DNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022478,
KW ECO:0000313|EMBL:EOX90745.1};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 1..198
FT /note="RNA polymerase N-terminal"
FT /evidence="ECO:0000259|SMART:SM00663"
FT REGION 1189..1512
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 344..371
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1189..1410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1423..1493
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1494..1512
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1512 AA; 167661 MW; 74DFE9EE20974F57 CRC64;
MGKRVDFSAR TVITPDPNIN IDELGVPWSI ALNLTYPETV TPYNIERLKE LVEYGPHPPP
GKTGAKYIIR DDGQRLDLRY LKKSSDHHLE LGYKVERHLN DGDFVLFNRQ PSLHKMSIMG
HRIRIMPYST FRLNLSVTSP YNADFDGDEM NMHVPQSFET RAEVLELMMV PKCIVSPQSN
RPVMGIVQDT LLGCRKITKR DTFIEKDVFM NILMWWEDFD GKVPAPAILK PRPLWTGKQV
FNLIIPKQIN LLRNSAWHSE TETGFITPGD TQVRIEKGEL LSGTLCKKAL GTSSGSLIHV
IWEEVGPDAA RKFLGHTQWL VNYWLLQNAF SIGIGDTIAD AATMEKINET ISKAKEEVKN
LIVKAQNKDL EPEPGRTMME SFENKVNQVL NKARDDAGNS AQKSLSESNN LKAMVTAGSK
GSFINISQMT ACVGQQNVEG KRIPFGFIDR TLPHFTKDDY GPESRGFVEN SYLRGLTPQE
FFFHAMGGRE GLIDTAVKTS ETGYIQRRLV KAMEDIMVKY DGTVRNSLGD VIQFLYGEDG
MDSVWIESQK LDSLKMKKSE FDRVFRYNID DESWNPTSYM LPEHIEDLRT IQELRDVFEA
EVQKLDADRY QLGTEIAVTG DSNWPLPVNL KRLIWNAQKT FKVDFRRVSD LHPVEIVDSV
DKLQERLKVV PGTDPLSVEA QKNATLFFSI LLRSTLASKR VLQEYRLTKE AFEWVIGEIE
SRFLQSLVAP GEMIGCVAAQ SIGEPATQMT LNTFHYAGVS AKNVTLGVPR LREIINVAKK
IKTPSLSVYL SPEASKTKEK AKNVQCALEY TTLRSVTHAT EVWYDPDPTS TIIEEDIDFV
KSYYEMPDEE VAPEKISPWL LRIELNREMM VDKKLSMADI AEKINLEFDD DLTCIFNDDN
AEKLILRIRI MNDEGPKGEL NDESAEDDVF LKKIESNMLT EMALRGIPDI NKVFIKHSKA
SKFDEADGYK TGEEWVLDTE GVNLLAVMCH EDVDARRTTS NHLIEVIEVL GIEAVRRSLL
DELRVVISFD GSYVNYRHLA ILCDTMTYRG HLMAITRHGI NRNDTGPMMR CSFEETVDIL
LDAAVYAESD YLRGVTENIM LGQLAPIGTG DCALYLNDEM LKNAIELQLP SYMEGLEFGM
TPARSPVSGT PYHEGMMSPS YLLSPNLRLS PITDAQFSPY VGGMAFSPTS SPGYSPSSPG
YSPSSPGYSP TSPGYSPTSP GYSPTSPGYS PTSPTYSPSS PGYSPTSPAY SPTSPSYSPT
SPSYSPTSPS YSPTSPSYSP TSPSYSPTSP SYSPTSPSYS PTSPVYSPTS PAYSPTSPAY
SPTSPSYSPT SPSYSPTSPS YSPTSPSYSP TSPSYSPTSP AYSPTSPGYS PTSPSYSPTS
PSYSPTSPSY NPQSAKYSPS LAYSPSSPRL SPSSPYSPTS PNYSPTSPSY SPTSPSYSPS
SPTYSPSSSP YNSGVSPDYS PSSPQYSPSA GYSPSAPGYS PSSTSQYTPQ TSNKDDRATK
DDRSSKDDRS KR
//