ID A0A068VDP0_COFCA Unreviewed; 1946 AA.
AC A0A068VDP0;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=DNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00012418};
DE EC=2.7.7.6 {ECO:0000256|ARBA:ARBA00012418};
GN ORFNames=GSCOC_T00006409001 {ECO:0000313|EMBL:CDP18669.1};
OS Coffea canephora (Robusta coffee).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Gardenieae complex;
OC Bertiereae - Coffeeae clade; Coffeeae; Coffea.
OX NCBI_TaxID=49390 {ECO:0000313|EMBL:CDP18669.1, ECO:0000313|Proteomes:UP000295252};
RN [1] {ECO:0000313|Proteomes:UP000295252}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. DH200-94 {ECO:0000313|Proteomes:UP000295252};
RX PubMed=25190796; DOI=10.1126/science.1255274;
RA Denoeud F., Carretero-Paulet L., Dereeper A., Droc G., Guyot R.,
RA Pietrella M., Zheng C., Alberti A., Anthony F., Aprea G., Aury J.M.,
RA Bento P., Bernard M., Bocs S., Campa C., Cenci A., Combes M.C.,
RA Crouzillat D., Da Silva C., Daddiego L., De Bellis F., Dussert S.,
RA Garsmeur O., Gayraud T., Guignon V., Jahn K., Jamilloux V., Joet T.,
RA Labadie K., Lan T., Leclercq J., Lepelley M., Leroy T., Li L.T.,
RA Librado P., Lopez L., Munoz A., Noel B., Pallavicini A., Perrotta G.,
RA Poncet V., Pot D., Priyono X., Rigoreau M., Rouard M., Rozas J.,
RA Tranchant-Dubreuil C., VanBuren R., Zhang Q., Andrade A.C., Argout X.,
RA Bertrand B., de Kochko A., Graziosi G., Henry R.J., Jayarama X., Ming R.,
RA Nagai C., Rounsley S., Sankoff D., Giuliano G., Albert V.A., Wincker P.,
RA Lashermes P.;
RT "The coffee genome provides insight into the convergent evolution of
RT caffeine biosynthesis.";
RL Science 345:1181-1184(2014).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG739348; CDP18669.1; -; Genomic_DNA.
DR STRING; 49390.A0A068VDP0; -.
DR EnsemblPlants; CDP18669; CDP18669; GSCOC_T00006409001.
DR Gramene; CDP18669; CDP18669; GSCOC_T00006409001.
DR InParanoid; A0A068VDP0; -.
DR OMA; WGSQQKS; -.
DR PhylomeDB; A0A068VDP0; -.
DR Proteomes; UP000295252; Chromosome 2.
DR GO; GO:0000428; C:DNA-directed RNA polymerase complex; IEA:UniProtKB-KW.
DR GO; GO:0043229; C:intracellular organelle; IEA:UniProt.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003899; F:DNA-directed 5'-3' RNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR Gene3D; 2.40.40.20; -; 1.
DR Gene3D; 3.10.450.40; -; 1.
DR Gene3D; 6.20.50.80; -; 1.
DR Gene3D; 3.30.1490.180; RNA polymerase ii; 1.
DR Gene3D; 4.10.860.120; RNA polymerase II, clamp domain; 1.
DR Gene3D; 1.10.274.100; RNA polymerase Rpb1, domain 3; 1.
DR InterPro; IPR045867; DNA-dir_RpoC_beta_prime.
DR InterPro; IPR000722; RNA_pol_asu.
DR InterPro; IPR006592; RNA_pol_N.
DR InterPro; IPR042102; RNA_pol_Rpb1_3_sf.
DR InterPro; IPR007081; RNA_pol_Rpb1_5.
DR InterPro; IPR044893; RNA_pol_Rpb1_clamp_domain.
DR PANTHER; PTHR19376; DNA-DIRECTED RNA POLYMERASE; 1.
DR PANTHER; PTHR19376:SF51; DNA-DIRECTED RNA POLYMERASE V SUBUNIT 1; 1.
DR Pfam; PF11523; DUF3223; 1.
DR Pfam; PF00623; RNA_pol_Rpb1_2; 1.
DR Pfam; PF04998; RNA_pol_Rpb1_5; 1.
DR SMART; SM00663; RPOLA_N; 1.
DR SUPFAM; SSF64484; beta and beta-prime subunits of DNA dependent RNA-polymerase; 1.
PE 4: Predicted;
KW DNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022478};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000295252};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 204..503
FT /note="RNA polymerase N-terminal"
FT /evidence="ECO:0000259|SMART:SM00663"
FT REGION 1289..1343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1381..1463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1514..1594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1636..1785
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1921..1946
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1293..1309
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1381..1406
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1417..1452
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1523..1538
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1539..1561
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1656..1673
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1684..1717
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1731..1752
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1762..1778
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1925..1946
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1946 AA; 216425 MW; C32A774DCED259B9 CRC64;
MEESPTSTSF GGKITRISFS LATQQEICKS SISDCAITHA SQLSNPFLGL PLEAGKCESC
GASEPGQCHF GYIELPIPIY HPDHVRELKR LLSLLCLKCL KIRNRKFQVK NVGVLERMLS
SCCEEASQVA INEARNPDGA LYLELKVPSK IRLQGNVWSF LEKYGYRYDK NPRPLLASEV
MAMLRRLSSD TKKKLSAKGY FPQDGYILQY LPVPPNCLSV PDISDGTNVM SKDHSLSLLK
RALKQIEVIK NSRSGMPNFE SHQIEANDLQ ISVAQYFEFR GTGKASRDVD PRFGVSKESN
TSSTKAWLEK MKTLFIRKGS GFSSRSVITG DPYKGVNEIG LPFEIAQRIT FEERVSQHNM
NYLQKLVDEK LCLTYRDGMS TYSLREGSKG HTFLRPGQVV HRRIMDGDMV FINRPPTTHK
HSLQALSVYI HDDHTVKINP LICGPLSADF DGDCIHLFYP QSLAARSEVL ELFSVEKQLL
SSHTGNFNLQ LATDSLLSLK LMFKKYFFDR VAAEQLAMFV PAALPMPAVV KYRSSGPFWT
VLQLLQTALP ASFECSGERY LTHSSELVKL DFNRDLLQST FIDVITSIFF SKGPKEVLRF
FNFLTPLLME NLYSEGFSVC LEDFYIPKAI IEAVQQSLQD ISPLLYHMRS TQSESIKLQL
ENFLRGVKSP VSNFVLKSSA MGYLIDSKSE SALNKVVQQI GFLGMQISDK GKFYSSTLVN
DLAQLFKKKY PSSGHYPSEE YGLVRSCLFY GLDPYQEMVH SISSREVIVR STRGLTEPGT
LFKNLMAILR DVIICYDGTV RNMCSNSIIQ FEYGMNHGIS FQSEFGAGEP VGVLAATAMS
NPAYKAVLDS SPSSNSAWEM MKEILLCGVN FKNEVSDRRV ILYLNDCGCG RKYCRENAAY
VVKNQLRKVS LKDVAFELLI EYRQQYSVYE SSETDTGLVG HIHLNEAMMK SSNITMNEIL
SKCEERIISY QKRKKVGFKF KGVLLGVSDD CSFRQSSARK LAETPCLKFI CRDASDYQLE
QRSHVLAETI CPALLETVIK GDPRVSSVNI IWISPDTSTW ISSQCKSQRG ELALDVVLEK
DAVKQTGDAW RVVMDACLPV TQLIDTNRSI PYAIKQVQEL LGISCAFEQA VRRLSTSVML
VTKGVLKDHL VLLANSMTCA GNLIGFNIGG IKALSRSLDV QVPFTEATLS APRKCFERAA
EKCHVDSLSS VVGSCSWGKH VAVGTGSPFD ILLDTKKVEL NQPAGIDVYD FLQLVRGSSG
GDETNTTCLG AEIENLDLED EAMTFDLSPV RDSDQPTFED RHELENNLAN PRSKESIQRE
LGWERDSPQT AELGGGWEKA SKAQNTSANV LVSDSAWASW GGGTVGKEDN FSTMAKEDSR
SFTDWNSTQP GSLKQSGSSS VWGKMVDNER DSSFAAEPRS SWEQAADKSG NVWTGKKVSD
SAWSSWGSSP VDKEARFSNG VQKNSPKYGE WGAKELRSTG KQSESSPAWK KIDSLGNLPL
TAKASGGWDQ KFDKDQRHAA QTTALDPGWS SWNNCEPVER DSFSKRVQER SSSDGEWGKK
SQDTAKQSGS SFGWGKKFEA GSNSPLTTNG SASCGSGGWE LALDKAQRLV SQATVSDPTW
SSWGSGETNK EEIILNSGQG DTSNDHKWGA KESESTGKQL GFSSGWGTKV SSNENKTDEN
KDPVTVTTEN YSDWSKMNTD AVQGERSLPT NSEEGSWRSG GAVGIDTDGE RNKSTGTHAW
ENKKDAHSQR GPRKWFKGNG NESSRGWGSP SNGDWRNQRN RPAKAVDNVG ASGTFTLTKQ
RLDSFTAEEQ DILSDFEQMM QNIRRIIHQT GYNDGDPLSA DDQSYVVDNV LNYHPEKVLK
IGAGIKYIMV SKHASFQESR CFYVVSTDDH KQDFSYRKSL ENFARKKYPD KADAFLAKYF
SRKPPRPGWS RDHASTPDEA GSRQEQ
//