RNA polymerase

Search
PMC	articles
PubMed	articles
NCBI	proteins

DNA-directed RNA polymerase
DNA-directed RNA polymerase
ExPASy	NiceZyme view
KEGG	KEGG entry
MetaCyc	metabolic pathway
PRIAM	profile
PDB structures	RCSB PDB PDBe PDBsum
Gene Ontology	AmiGO / QuickGO
PMC
Search
PMC	articles
PubMed	articles
NCBI	proteins

In molecular biology, RNA polymerase (abbreviated RNAP or RNApol), or more specifically DNA-directed/dependent RNA polymerase (DdRP), is an enzyme that catalyzes the chemical reactions that synthesize RNA from a DNA template.

Using the enzyme

promoter region before RNAP can initiate the DNA unwinding at that position. RNAP not only initiates RNA transcription, it also guides the nucleotides into position, facilitates attachment and elongation, has intrinsic proofreading and replacement capabilities, and termination recognition capability. In eukaryotes

, RNAP can build chains as long as 2.4 million nucleotides.

RNAP produces RNA that, functionally, is either for protein

non-coding

(so-called "RNA genes"). Examples of four functional types of RNA genes are:

Transfer RNA (tRNA): Transfers specific
polypeptide chains at the ribosomal site of protein synthesis during translation
;
Ribosomal RNA (rRNA): Incorporates into ribosomes;
Micro RNA (miRNA): Regulates gene activity; and, RNA silencing
Catalytic RNA (ribozyme ): Functions as an enzymatically active RNA molecule.

RNA polymerase is essential to life, and is found in all living

mitochondria, and is related to modern DNA polymerases.^[2]

Eukaryotic and archaeal RNAPs have more subunits than bacterial ones do, and are controlled differently.

Bacteria and archaea only have one RNA polymerase. Eukaryotes have multiple types of nuclear RNAP, each responsible for synthesis of a distinct subset of RNA:

RNA polymerase I synthesizes a pre-rRNA 45S (35S in yeast), which matures and will form the major RNA sections of the ribosome.
RNA polymerase II synthesizes precursors of mRNAs and most sRNA and microRNAs.
RNA polymerase III synthesizes tRNAs, rRNA 5S and other small RNAs found in the nucleus and cytosol.
RNA polymerase IV and V found in plants are less understood; they make siRNA. In addition to the ssRNAPs, chloroplasts also encode and use a bacteria-like RNAP.

Structure

T. aquaticus RNA polymerase core (PDB: 1HQM).

Yeast RNA polymerase II core (PDB: 1WCM).

Homologous subunits are colored the same:^[1]

orange: α1/RPB3,

yellow: α2/RPB11,

wheat: β/RPB2,

red: β′/RPB1,

pink: ω/RPB6.

The 2006 Nobel Prize in Chemistry was awarded to Roger D. Kornberg for creating detailed molecular images of RNA polymerase during various stages of the transcription process.^[3]^[4]

In most

kDa, a beta (β) subunit of 150 kDa, a beta prime subunit (β′) of 155 kDa, and a small omega (ω) subunit. A sigma (σ) factor binds to the core, forming the holoenzyme. After transcription starts, the factor can unbind and let the core enzyme proceed with its work.^[5]^[6] The core RNA polymerase complex forms a "crab claw" or "clamp-jaw" structure with an internal channel running along the full length.^[7] Eukaryotic and archaeal RNA polymerases have a similar core structure and work in a similar manner, although they have many extra subunits.^[8]

All RNAPs contain metal cofactors, in particular zinc and magnesium cations which aid in the transcription process.^[9]^[10]

Function

5′ end

, where the longer RNA molecules are completely transcribed.

Control of the process of gene transcription affects patterns of gene expression and, thereby, allows a cell to adapt to a changing environment, perform specialized roles within an organism, and maintain basic metabolic processes necessary for survival. Therefore, it is hardly surprising that the activity of RNAP is long, complex, and highly regulated. In Escherichia coli bacteria, more than 100 transcription factors have been identified, which modify the activity of RNAP.^[11]

RNAP can initiate transcription at specific DNA sequences known as

nucleotides (the full length of the dystrophin gene). RNAP will preferentially release its RNA transcript at specific DNA sequences encoded at the end of genes, which are known as terminators

.

Products of RNAP include:

Messenger RNA (mRNA)—template for the synthesis of proteins by ribosomes.
translation
. However, since the late 1990s, many new RNA genes have been found, and thus RNA genes may play a much more significant role than previously thought.
- polypeptide chains at the ribosomal site of protein synthesis during translation
- Ribosomal RNA (rRNA)—a component of ribosomes
- Micro RNA
  —regulates gene activity
- Catalytic RNA (Ribozyme)—enzymatically active RNA molecules

RNAP accomplishes de novo synthesis. It is able to do this because specific interactions with the initiating nucleotide hold RNAP rigidly in place, facilitating chemical attack on the incoming nucleotide. Such specific interactions explain why RNAP prefers to start transcripts with ATP (followed by GTP, UTP, and then CTP). In contrast to DNA polymerase, RNAP includes helicase activity, therefore no separate enzyme is needed to unwind DNA.

Action

Initiation

RNA polymerase binding in bacteria involves the sigma factor recognizing the core promoter region containing the −35 and −10 elements (located before the beginning of sequence to be transcribed) and also, at some promoters, the α subunit C-terminal domain recognizing promoter upstream elements.^[12] There are multiple interchangeable sigma factors, each of which recognizes a distinct set of promoters. For example, in E. coli, σ⁷⁰ is expressed under normal conditions and recognizes promoters for genes required under normal conditions ("housekeeping genes"), while σ³² recognizes promoters for genes required at high temperatures ("heat-shock genes"). In archaea and eukaryotes, the functions of the bacterial general transcription factor sigma are performed by multiple general transcription factors that work together. The RNA polymerase-promoter closed complex is usually referred to as the "transcription preinitiation complex."^[13]^[14]

After binding to the DNA, the RNA polymerase switches from a closed complex to an open complex. This change involves the separation of the DNA strands to form an unwound section of DNA of approximately 13 bp, referred to as the "

Supercoiling plays an important part in polymerase activity because of the unwinding and rewinding of DNA. Because regions of DNA in front of RNAP are unwound, there are compensatory positive supercoils. Regions behind RNAP are rewound and negative supercoils are present.^[14]

Promoter escape

RNA polymerase then starts to synthesize the initial DNA-RNA heteroduplex, with ribonucleotides base-paired to the template DNA strand according to Watson-Crick base-pairing interactions. As noted above, RNA polymerase makes contacts with the promoter region. However these stabilizing contacts inhibit the enzyme's ability to access DNA further downstream and thus the synthesis of the full-length product. In order to continue RNA synthesis, RNA polymerase must escape the promoter. It must maintain promoter contacts while unwinding more downstream DNA for synthesis,

"scrunching" more downstream DNA into the initiation complex.^[15]

During the promoter escape transition, RNA polymerase is considered a "stressed intermediate." Thermodynamically the stress accumulates from the DNA-unwinding and DNA-compaction activities. Once the DNA-RNA heteroduplex is long enough (~10 bp), RNA polymerase releases its upstream contacts and effectively achieves the promoter escape transition into the elongation phase. The heteroduplex at the active center stabilizes the elongation complex.

However, promoter escape is not the only outcome. RNA polymerase can also relieve the stress by releasing its downstream contacts, arresting transcription. The paused transcribing complex has two options: (1) release the nascent transcript and begin anew at the promoter or (2) reestablish a new 3′-OH on the nascent transcript at the active site via RNA polymerase's catalytic activity and recommence DNA scrunching to achieve promoter escape. Abortive initiation, the unproductive cycling of RNA polymerase before the promoter escape transition, results in short RNA fragments of around 9 bp in a process known as abortive transcription. The extent of abortive initiation depends on the presence of transcription factors and the strength of the promoter contacts.^[16]

Elongation

The 17-bp transcriptional complex has an 8-bp DNA-RNA hybrid, that is, 8 base-pairs involve the RNA transcript bound to the DNA template strand.^[17] As transcription progresses, ribonucleotides are added to the 3′ end of the RNA transcript and the RNAP complex moves along the DNA. The characteristic elongation rates in prokaryotes and eukaryotes are about 10–100 nts/sec.^[18]

Aspartyl (asp) residues in the RNAP will hold on to Mg²⁺ ions, which will, in turn, coordinate the phosphates of the ribonucleotides. The first Mg²⁺ will hold on to the α-phosphate of the NTP to be added. This allows the nucleophilic attack of the 3′-OH from the RNA transcript, adding another NTP to the chain. The second Mg²⁺ will hold on to the pyrophosphate of the NTP.^[19] The overall reaction equation is:

(NMP)_n + NTP → (NMP)_n+1 + PP_i

Fidelity

Unlike the proofreading mechanisms of DNA polymerase those of RNAP have only recently been investigated. Proofreading begins with separation of the mis-incorporated nucleotide from the DNA template. This pauses transcription. The polymerase then backtracks by one position and cleaves the dinucleotide that contains the mismatched nucleotide. In the RNA polymerase this occurs at the same active site used for polymerization and is therefore markedly different from the DNA polymerase where proofreading occurs at a distinct nuclease active site.^[20]

The overall error rate is around 10⁻⁴ to 10⁻⁶.^[21]

Termination

In bacteria, termination of RNA transcription can be rho-dependent or rho-independent. The former relies on the rho factor, which destabilizes the DNA-RNA heteroduplex and causes RNA release.^[22] The latter, also known as intrinsic termination, relies on a palindromic region of DNA. Transcribing the region causes the formation of a "hairpin" structure from the RNA transcription looping and binding upon itself. This hairpin structure is often rich in G-C base-pairs, making it more stable than the DNA-RNA hybrid itself. As a result, the 8 bp DNA-RNA hybrid in the transcription complex shifts to a 4 bp hybrid. These last 4 base pairs are weak A-U base pairs, and the entire RNA transcript will fall off the DNA.^[23]

Transcription termination in eukaryotes is less well understood than in bacteria, but involves cleavage of the new transcript followed by template-independent addition of adenines at its new 3′ end, in a process called polyadenylation.^[24]

Other organisms

Given that DNA and RNA polymerases both carry out template-dependent nucleotide polymerization, it might be expected that the two types of enzymes would be structurally related. However, x-ray crystallographic studies of both types of enzymes reveal that, other than containing a critical Mg²⁺ ion at the catalytic site, they are virtually unrelated to each other; indeed template-dependent nucleotide polymerizing enzymes seem to have arisen independently twice during the early evolution of cells. One lineage led to the modern DNA polymerases and reverse transcriptases, as well as to a few single-subunit RNA polymerases (ssRNAP) from phages and organelles.^[2] The other multi-subunit RNAP lineage formed all of the modern cellular RNA polymerases.^[25]^[1]

Bacteria

In

mRNA and non-coding RNA (ncRNA)

.

RNAP is a large molecule. The core enzyme has five subunits (~400

kDa):^[26]

β′

In order to bind promoters, RNAP core associates with the transcription initiation factor sigma (σ) to form RNA polymerase holoenzyme. Sigma reduces the affinity of RNAP for nonspecific DNA while increasing specificity for promoters, allowing transcription to initiate at correct sites. The complete holoenzyme therefore has 6 subunits: β′βα^I and α^IIωσ (~450 kDa).

Eukaryotes

α-amanitin (red), a strong poison found in death cap mushrooms

that targets this vital enzyme

Eukaryotes have multiple types of nuclear RNAP, each responsible for synthesis of a distinct subset of RNA. All are structurally and mechanistically related to each other and to bacterial RNAP:

siRNA-directed heterochromatin formation in plants.^[34]

Eukaryotic chloroplasts contain an RNAP very highly similar to bacterial RNAP ("plastid-encoded polymerase, PEP"). They use sigma factors encoded in the nuclear genome.^[35]

Chloroplast also contain a second, structurally and mechanistically unrelated, single-subunit RNAP ("nucleus-encoded polymerase, NEP"). Eukaryotic

mitochondria use POLRMT (human), a nucleus-encoded single-subunit RNAP.^[2] Such phage-like polymerases are referred to as RpoT in plants.^[35]

Archaea

Archaea have a single type of RNAP, responsible for the synthesis of all RNA. Archaeal RNAP is structurally and mechanistically similar to bacterial RNAP and eukaryotic nuclear RNAP I-V, and is especially closely structurally and mechanistically related to eukaryotic nuclear RNAP II.^[8]^[36] The history of the discovery of the archaeal RNA polymerase is quite recent. The first analysis of the RNAP of an archaeon was performed in 1971, when the RNAP from the extreme

Sulfolobus shibatae set the total number of identified archaeal subunits at thirteen.^[8]^[38]

Archaea has the subunit corresponding to Eukaryotic Rpb1 split into two. There is no homolog to eukaryotic Rpb9 (

iron–sulfur protein. RNAP I/III subunit AC40 found in some eukaryotes share similar sequences,^[38] but does not bind iron.^[39] This domain, in either case, serves a structural function.^[40]

Archaeal RNAP subunit previously used an "RpoX" nomenclature where each subunit is assigned a letter in a way unrelated to any other systems.[1] In 2009, a new nomenclature based on Eukaryotic Pol II subunit "Rpb" numbering was proposed.^[8]

Viruses

nucleocytoplasmic large DNA viruses synthesize RNA using a virally encoded multi-subunit RNAP. They are most similar to eukaryotic RNAPs, with some subunits minified or removed.^[41] Exactly which RNAP they are most similar to is a topic of debate.^[42]

Most other viruses that synthesize RNA use unrelated mechanics.

Many viruses use a single-subunit DNA-dependent RNAP (ssRNAP) that is structurally and mechanistically related to the single-subunit RNAP of eukaryotic chloroplasts (RpoT) and mitochondria (POLRMT) and, more distantly, to DNA polymerases and reverse transcriptases. Perhaps the most widely studied such single-subunit RNAP is bacteriophage T7 RNA polymerase. ssRNAPs cannot proofread.^[2]

B. subtilis prophage SPβ uses YonO, a homolog of the β+β′ subunits of msRNAPs to form a monomeric (both barrels on the same chain) RNAP distinct from the usual "right hand" ssRNAP. It probably diverged very long ago from the canonical five-unit msRNAP, before the time of the last universal common ancestor.^[43]^[44]

Other viruses use an

positive strand RNA viruses, such as poliovirus, also contain RNA-dependent RNAP.^[45]

History

RNAP was discovered independently by Charles Loe, Audrey Stevens, and Jerard Hurwitz in 1960.^[46] By this time, one half of the 1959 Nobel Prize in Medicine had been awarded to Severo Ochoa for the discovery of what was believed to be RNAP,^[47] but instead turned out to be polynucleotide phosphorylase.

Purification

RNA polymerase can be isolated in the following ways:

By a phosphocellulose column.^[48]
By glycerol gradient centrifugation.^[49]
By a DNA column.
By an ion chromatography column.^[50]

And also combinations of the above techniques.

References

^
PMID 11839495
.

^
S2CID 1624391
.

^ Nobel Prize in Chemistry 2006
doi:10.1146/knowable-022822-1
. Retrieved 25 March 2022.

^ Griffiths AJF, Miller JH, Suzuki DT, et al. An Introduction to Genetic Analysis. 7th edition. New York: W. H. Freeman; 2000. Chapter 10.

PMID 11118218
.

PMID 10499798
.

^
PMID 19419240
.

OCLC 887605755.{{cite book}}: CS1 maint: location missing publisher (link
)

PMID 10500100
.

PMID 11018136
.

^ InterPro: IPR011260

PMID 1776168
.

^ ^a ^b Watson JD, Baker TA, Bell SP, Gann AA, Levine M, Losick RM (2013). Molecular Biology of the Gene (7th ed.). Pearson.

PMID 17110577
.

PMID 19443781
.

PMID 15610738
.

^ Milo R, Philips R. "Cell Biology by the Numbers: What is faster, transcription or translation?". book.bionumbers.org. Archived from the original on 20 April 2017. Retrieved 8 March 2017.

PMID 22982365
.

PMID 19914059
.

^ Philips R, Milo R. "What is the error rate in transcription and translation?". Retrieved 26 March 2019.

PMID 12213656
.

doi:10.1016/j.tig.2016.05.007
.

PMID 17629387
.

PMID 9751740
.

PMID 11124018
.

PMID 6287430
.

PMID 1904436
.

PMID 16908155
.

PMID 9932453
.

PMID 15372072
.

PMID 8444147
.

S2CID 206507767
.

PMID 19377477
.

^
PMID 20701995
.

PMID 17697097
.

PMID 4940048
.

^
PMID 18235446
.

S2CID 205235881
.

PMID 27557794
.

PMID 28701329
.

PMID 31506349
.

PMID 28585540
.

PMID 31103775
.

S2CID 42526536
.

PMID 16230341
.

^ Nobel Prize 1959

PMID 3525543
.

PMID 2358436
.

PMID 2261443
.

External links

Wikimedia Commons has media related to RNA polymerase.

DNAi – DNA Interactive, including information and Flash clips on RNA Polymerase.

RNA+Polymerase at the U.S. National Library of Medicine Medical Subject Headings (MeSH)

EC 2.7.7.6

RNA Polymerase – Synthesis RNA from DNA Template

(Wayback Machine copy)

3D macromolecular structures of RNA Polymerase from the EM Data Bank(EMDB)

This article incorporates text from the public domain Pfam and InterPro: IPR011773

v
t
e
Gene expression
Introduction
to genetics

Genetic code

Central dogma
DNA → RNA → Protein

Special transfers
RNA→RNA

RNA→DNA

Protein→Protein

Transcription
Types

Bacterial

Archaeal

Eukaryotic

Key elements

Transcription factor

RNA polymerase

Promoter

Post-transcription

Precursor mRNA (pre-mRNA / hnRNA)

5' capping

Splicing

Polyadenylation

Histone acetylation and deacetylation

Translation
Types

Bacterial

Archaeal

Eukaryotic

Key elements

Ribosome

Transfer RNA (tRNA)

Ribosome-nascent chain complex (RNC)

Post-translational modification

Regulation

Epigenetic
imprinting

Transcriptional
Gene regulatory network

cis-regulatory element

lac operon

Post-transcriptional
sequestration (P-bodies)

alternative splicing

microRNA

Translational

Post-translational
reversible

irreversible

Influential people

François Jacob

Jacques Monod

v
t
e
Transferases: phosphorus-containing groups (EC 2.7)
2.7.1-2.7.4:
phosphotransferase/kinase
(PO₄)
2.7.1: OH acceptor

Hexo-

Gluco-

Fructo-
Hepatic

Galacto-

Phosphofructo-
1

Liver

Muscle

Platelet

2

Riboflavin

Shikimate

Thymidine
ADP-thymidine

NAD⁺

Glycerol

Pantothenate

Mevalonate

Pyruvate

Deoxycytidine

PFP

Diacylglycerol

Phosphoinositide 3
Class I PI 3

Class II PI 3

Sphingosine

Glucose-1,6-bisphosphate synthase

COOH
acceptor

Phosphoglycerate

Aspartate kinase

2.7.3: N acceptor

Creatine

2.7.4: PO₄ acceptor

Phosphomevalonate

Adenylate

Nucleoside-diphosphate

Uridylate

Guanylate

Thiamine-diphosphate

P₂O₇
)

Ribose-phosphate diphosphokinase

Thiamine diphosphokinase

PO₄-nucleoside)
Polymerase
DNA polymerase

DNA-directed DNA polymerase

I/A
γ

θ

ν

T7

Taq

II/B
α

δ

ε

ζ

Pfu

III/C

IV/X
β

λ

μ

TDT

V/Y
η

ι

κ

RNA-directed DNA polymerase

Reverse transcriptase
Telomerase

RNA polymerase

Template-directed

RNA polymerase I

II

III

IV

V

ssRNAP
POLRMT

Primase
1

2

PrimPol

RNA-dependent RNA polymerase

Polyadenylation

PAP

PNPase

Phosphorolytic
3' to 5' exoribonuclease

RNase PH

PNPase

Nucleotidyltransferase

UTP—glucose-1-phosphate uridylyltransferase

Galactose-1-phosphate uridylyltransferase

Guanylyltransferase

mRNA capping enzyme

Other

Recombinase (Integrase)

Transposase

2.7.8: miscellaneous
Phosphatidyltransferases

CDP-diacylglycerol—glycerol-3-phosphate 3-phosphatidyltransferase

CDP-diacylglycerol—serine O-phosphatidyltransferase

CDP-diacylglycerol—inositol 3-phosphatidyltransferase

CDP-diacylglycerol—choline O-phosphatidyltransferase

Glycosyl-1-phosphotransferase

N-acetylglucosamine-1-phosphate transferase

2.7.10-2.7.13: protein kinase
(PO₄; protein acceptor)
2.7.10: protein-tyrosine

see tyrosine kinases

2.7.11: protein-serine/threonine

see serine/threonine-specific protein kinases

2.7.12: protein-dual-specificity

see serine/threonine-specific protein kinases

2.7.13: protein-histidine

Protein-histidine pros-kinase

Protein-histidine tele-kinase

Histidine kinase

v
t
e
Enzymes
Activity

Active site

Binding site

Catalytic triad

Oxyanion hole

Enzyme promiscuity

Diffusion-limited enzyme

Cofactor

Enzyme catalysis

Regulation

Allosteric regulation

Cooperativity

Enzyme inhibitor

Enzyme activator

Classification

EC number

Enzyme superfamily

Enzyme family

List of enzymes

Kinetics

Enzyme kinetics

Eadie–Hofstee diagram

Hanes–Woolf plot

Lineweaver–Burk plot

Michaelis–Menten kinetics

Types

EC1 Oxidoreductases (list)

EC2 Transferases (list)

EC3 Hydrolases (list)

EC4 Lyases (list)

EC5 Isomerases (list)

EC6 Ligases (list)

EC7 Translocases (list)

Portal:
Biology

Authority control databases: National

Israel

United States

Japan

Retrieved from "https://en.wikipedia.org/w/index.php?title=RNA_polymerase&oldid=1219440686"

[pmid21233849-1] 
PMID 11839495
.

[pmid9419244-2] 
S2CID 1624391
.

[3] Nobel Prize in Chemistry 2006

[Stoddart-4] doi:10.1146/knowable-022822-1
. Retrieved 25 March 2022.

[5] Griffiths AJF, Miller JH, Suzuki DT, et al. An Introduction to Genetic Analysis. 7th edition. New York: W. H. Freeman; 2000. Chapter 10.

[6] PMID 11118218
.

[pmid10499798-7] PMID 10499798
.

[pmid19419240-8] 
PMID 19419240
.

[9] OCLC 887605755.{{cite book}}: CS1 maint: location missing publisher (link
)

[10] PMID 10500100
.

[11] PMID 11018136
.

[12] InterPro: IPR011260

[Roeder1991-13] PMID 1776168
.

[MBOG-14] Watson JD, Baker TA, Bell SP, Gann AA, Levine M, Losick RM (2013). Molecular Biology of the Gene (7th ed.). Pearson.

[15] PMID 17110577
.

[16] PMID 19443781
.

[17] PMID 15610738
.

[18] Milo R, Philips R. "Cell Biology by the Numbers: What is faster, transcription or translation?". book.bionumbers.org. Archived from the original on 20 April 2017. Retrieved 8 March 2017.

[pmid22982365-19] PMID 22982365
.

[20] PMID 19914059
.

[21] Philips R, Milo R. "What is the error rate in transcription and translation?". Retrieved 26 March 2019.

[22] PMID 12213656
.

[23] :10.1016/j.tig.2016.05.007
.

[24] PMID 17629387
.

[25] PMID 9751740
.

[26] PMID 11124018
.

[27] PMID 6287430
.

[28] PMID 1904436
.

[omid16908155-29] PMID 16908155
.

[30] PMID 9932453
.

[31] PMID 15372072
.

[32] PMID 8444147
.

[33] S2CID 206507767
.

[34] PMID 19377477
.

[pmid20701995-35] 
PMID 20701995
.

[36] PMID 17697097
.

[37] PMID 4940048
.

[pmid18235446-38] 
PMID 18235446
.

[39] S2CID 205235881
.

[40] PMID 27557794
.

[41] PMID 28701329
.

[predate-42] PMID 31506349
.

[43] PMID 28585540
.

[qde1-mono-44] PMID 31103775
.

[45] S2CID 42526536
.

[46] PMID 16230341
.

[47] Nobel Prize 1959

[48] PMID 3525543
.

[49] PMID 2358436
.

[50] PMID 2261443
.

[2]

[1]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[34]

[35]

[36]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]