ID R6B327_9CLOT Unreviewed; 919 AA.
AC R6B327;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=RNA polymerase subunit alpha {ECO:0000313|EMBL:CDA53361.1};
GN ORFNames=BN491_01513 {ECO:0000313|EMBL:CDA53361.1};
OS Clostridium sp. CAG:138.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262775 {ECO:0000313|EMBL:CDA53361.1, ECO:0000313|Proteomes:UP000017905};
RN [1] {ECO:0000313|EMBL:CDA53361.1, ECO:0000313|Proteomes:UP000017905}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:138 {ECO:0000313|Proteomes:UP000017905};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA53361.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBBZ010000106; CDA53361.1; -; Genomic_DNA.
DR AlphaFoldDB; R6B327; -.
DR STRING; 1262775.BN491_01513; -.
DR Proteomes; UP000017905; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0003899; F:DNA-directed 5'-3' RNA polymerase activity; IEA:InterPro.
DR GO; GO:0006352; P:DNA-templated transcription initiation; IEA:InterPro.
DR CDD; cd06171; Sigma70_r4; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR007630; RNA_pol_sigma70_r4.
DR InterPro; IPR013324; RNA_pol_sigma_r3/r4-like.
DR InterPro; IPR011260; RNAP_asu_C.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR Pfam; PF03118; RNA_pol_A_CTD; 1.
DR Pfam; PF04545; Sigma70_r4; 1.
DR SUPFAM; SSF47789; C-terminal domain of RNA polymerase alpha subunit; 2.
DR SUPFAM; SSF88659; Sigma3 and sigma4 domains of RNA polymerase sigma factors; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017905}.
FT DOMAIN 127..180
FT /note="RNA polymerase alpha subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03118"
FT DOMAIN 337..368
FT /note="RNA polymerase sigma-70 region 4"
FT /evidence="ECO:0000259|Pfam:PF04545"
SQ SEQUENCE 919 AA; 104092 MW; 21BC2E489C6B2A6B CRC64;
MMNRTNKTEI ITKDDIRNTE ATLSDLLNVS PDVCGDDETT VLDLPVRSEK RIAGIGIRYV
KELLSIKLGR LLAIGGMGLG SLNDIAGAIA KRISEAPGDD GDGSKDIPDA PFTSNAQAEC
NELGRPVYIE ELGLSVRSYN CLKRSGITVV SDLEKLSRDE LLSIKNLGAS SADEILNVLK
DFDPKEKLEK RLEAEQEERE RQEECSALAS ELFTAYGCVP SEWSSMIRKC LIKEPGTGIG
GAVIRLYEQT ELFDHLRNLL LNLVEAHQDK LDVDELTTLL PKHLQGTDLA DKAVDSLKAD
GAVAIEDGII IRIYPSISDY VKQIEDERMR SVWSLKLEGL TLEQIGERIG GVTRERVRQI
LNKALKQLPR LREDRYLYMF GNYSFSYEDL NIAFGEPEYV FNYLDLAAGT KAAERRPLSE
ALEDENIKEE DRRNIERAVF KNSVMLDGVR VEKVRPELVK RFIKSRCKED THIDDLAGSF
NAAMAELGIT DERLFLPETN TVENHLQIAD YILWKNGRSF RAYDIESKDI DRLWSELDLE
QYEGLDISAL MLLRSEPELM REFDIRDEYE LHNLLKKTAA GRIRGLSFGR MPMLSVGSAS
RTEQMRELLV ENAPIEADTL CGLYEELYGM KAEVIKAYYL KEFDRYYHNG LFTFDAVRMP
PYVLSRMKEL LSNDSYTIAE IKSIFRREFP EESPEEINAY NLKELGFAVY SGYVIVNSYS
NAREYFRAVL TENEITDLTG EWERFGRFPT FSQEVYALED LRTITEYEPN RFIGIGKLNE
MGVTKEDLDD FCRSVRDHAG RGEYFTVKSL LGSGFDSPLV HSGLGEYFLS SVLASGEEGF
PYQRMGGVKL FCSSGERPSL PAFLRSIVRG HGSILIADLE NLLRDEYGIE LPPYKLTELI
KESGTGYDPI TRTAFDASR
//