Analyses of mRNA Cleavage by RelE and the Role of tRNA ...

Brigham Young UniversityBYU ScholarsArchive

All Theses and Dissertations

2016-06-01

Analyses of mRNA Cleavage by RelE and the Roleof tRNA Methyltransferase TrmD Using BacterialRibosome ProfilingJae Yeon HwangBrigham Young University

Follow this and additional works at: https://scholarsarchive.byu.edu/etd

Part of the Chemistry Commons

This Dissertation is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in All Theses and Dissertationsby an authorized administrator of BYU ScholarsArchive. For more information, please contact [email protected], [email protected].

BYU ScholarsArchive CitationHwang, Jae Yeon, "Analyses of mRNA Cleavage by RelE and the Role of tRNA Methyltransferase TrmD Using Bacterial RibosomeProfiling" (2016). All Theses and Dissertations. 6443.https://scholarsarchive.byu.edu/etd/6443

http://home.byu.edu/home/?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

http://home.byu.edu/home/?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

https://scholarsarchive.byu.edu?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

https://scholarsarchive.byu.edu/etd?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

https://scholarsarchive.byu.edu/etd?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/131?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

https://scholarsarchive.byu.edu/etd/6443?utm_source=scholarsarchive.byu.edu%2Fetd%2F6443&utm_medium=PDF&utm_campaign=PDFCoverPages

mailto:[email protected],%[email protected]

Analyses of mRNA Cleavage by RelE and the Role of tRNA Methyltransferase TrmD

Using Bacterial Ribosome Profiling

Jae Yeon Hwang

A dissertation submitted to the faculty of

Brigham Young University

in partial fulfillment of the requirements for the degree of

Doctor of Philosophy

Allen R. Buskirk, Chair

Barry M. Willardson

William R. McCleary

Joel S. Griffitts

Joshua L. Andersen

Department of Chemistry and Biochemistry

Brigham Young University

June 2016

Copyright © 2016 Jae Yeon Hwang

All Rights Reserved

ABSTRACT

Analyses of mRNA Cleavage by RelE and the Role of tRNA Methyltransferase TrmD

Using Bacterial Ribosome Profiling

Jae Yeon Hwang

Department of Chemistry and Biochemistry, BYU

Doctor of Philosophy

Protein synthesis is a fundamental and ultimate process in living cells. Cells possess

sophisticated machineries and continuously carry out complex processes. Monitoring protein

synthesis in living cells not only inform us about the mechanism of translation but also deepen

our insights about all aspects of life. Understanding the structure and mechanism of the ribosome

and its associated factors helped us enlarge our knowledge on protein synthesis.

Recently, with the dramatic advances of high-throughput sequencing and bioinformatics,

a new technique called ribosome profiling emerged. By retrieving mRNA fragments protected by

translating ribosomes, ribosome profiling reveals global ribosome occupancy along mRNAs in

living cells, which can inform us with the identity and quantity of proteins being made. Easily

adapted to other organisms, ribosome profiling technique is expanding its application in

revealing various cellular activities as well as the knowledge on protein synthesis.

Here, we report the mechanism of translating mRNA cleavage by endoribonuclease RelE

in vivo. RelE is an endoribonuclease that is induced during nutrient deficiency stress and

specifically cleaves translating mRNAs upon binding to the ribosomal A site. Overexpression of

RelE in living cells causes growth arrest by inhibiting global translation. We monitored RelE

activity in vivo upon overexpression using ribosome profiling. The data show that RelE actively

cuts translating mRNAs whenever the ribosomal A site is accessible, resulting in truncated

mRNAs. RelE causes the ribosome complexes to accumulate near the 5’ end of genes as the

process of ribosome rescue, translation, and cleavage by RelE repeats. RelE cleavage specific

sub-codon level ribosome profiling data also represent reading frame in Escherichia coli and

sequence specificity of RelE cleavage in vivo.

We report another ribosome profiling study on a methyltransferase TrmD in E. coli.

TrmD is known to methylate G37 (the residue at 3’ side of anticodon) of some tRNAs and be

responsible for codon-anticodon interaction. We constructed a TrmD depletion E. coli strain,

whose deletion results in lethality of cells. Resulting depletion of m1G37 in the strain leads to

growth arrest. Lack of m1G37 of some tRNAs whose codons start with C showed frequent

frameshift when translating the gene message in vitro. By using ribosome profiling, we

successfully observed significant difference on translation process when codons interact with

anticodons of tRNAs lacking m1G37. The data reveal slow translation rate or pauses on the

tRNAs when missing the appropriate methylation, which corresponds to the previous

biochemical data in vitro.

Keywords: RelE, endoribonuclease, TrmD, methyltransferase, ribosome profiling, m1G37

ACKNOWLEDGEMENTS

I had opportunities to experience three different laboratories across the U.S, which is not

common for a graduate student. After working two years at BYU, I worked at Scott Strobel’s

laboratory at Yale for about 10 months, then back to BYU for a year, and again to Rachel

Green’s laboratory at Johns Hopkins for two years so far. Moving from one place to another with

my family slowed down the progress of my research but, with the help from many people I could

continue studying.

I would like to first thank my former lab members at BYU. I still remember when I talked

to Chris Woolstenhulme for the first time while searching labs to join. He helped me to have an

idea what our lab was doing for research and helped me to learn many lab skills later on. Mickey

Miller was always nice and helped me to understand better what I was doing in the lab. He was

always patient in answering many questions I had.

I express thank you to the lab members at Yale University, who helped me to understand

kinetic assays for RelE activity and shared their findings from experiments. I am grateful for

their encouragement for me while I was struggling with continuous failing on my experiments.

They helped me to realized that I was not the only one who was discouraged with repeated

failures by sharing their own experiences. I also appreciate their talents and creativities in

approaching problems and carrying experiments.

I would like to thank current lab members at Johns Hopkins for the discussions and for

sharing their talents. With their mature citizenship and efficient setup at the lab, I had an

excellent environment to focus on the research. I would like to also express thank you to Isao

Masuda for the collaboration on TrmD study and for his friendship during the work at Ya-Ming

Hou’s lab at Thomas Jefferson University. I also thank Ya-Ming Hou for sharing her knowledge

on tRNAs and methylations.

I am grateful for the generosity of Scott Strobel and Rachel Green. They both allowed me

to work at their labs. I am honored to be part of their labs, being inspired by their insights and

superb attitudes towards science in proximity. Their enthusiasm and knowledge on the field

stimulate me to have passion on science as well. Rachel always talks about science without

ceasing and her energy and confidence remind me of what I want be.

I also thank for my committee members at BYU for their guidance. I especially

appreciate Barry Willardson for his attitude towards science and for teaching ethics in science

through a class. He told a group of students in a class, including me, that science is built on the

foundations of previous researchers so that we are obligated to share what we obtain. That lesson

was one of the most valuable lessons for me during graduate school years. I thank Joel Griffitts

for demonstrating his critical thinking in solving problems. I am also thankful for BYU and

funding sources that supported me to study during the program. I have been impressed by many

great people at BYU for their teachings and care for the students.

Most of all, I deeply thank my advisor and mentor, Allen Buskirk, who provided me with

many opportunities to learn. He taught me that science is a really fun thing that I should be

excited about. I like his sense of humor and the way he put things together in science. His talent

and knowledge on science make him special and that, somewhat, separates me from him, but I

always aimed to overcome the gap between mine and his and to catch up the level of his

proficiency, which still looks a long way. He has guided and helped me on the course with great

patience. I feel like I am just ready to start real science now.

Lastly, I would like to thank my family and friends for their support. I would like to

thank Kyu Tae Han and Keum Hee Song, who helped me broaden my views and shape my mind.

I extend special thank you to my eldest brother, Jang Yeon for his support since I was young.

Without his help, it would not be possible to come to the U.S. and continue to study what I

always wanted to. I am so grateful for my parents for their constant love and trust towards me. I

love my family, my beautiful wife, Jee Yeon Lee, my daughters, Yiwon and Jiwon, who always

give me motivation and strength to continue. There were hard times during the course and they

were always with me to encourage me. Without the love and trust from my wife, I would not be

able to accomplish this and will not be able to go further.

vi

TABLE OF CONTENTS

List of Tables ............................................................................................................................... viii

List of Figures ................................................................................................................................ ix

1 Introduction .................................................................................................................................. 1

1.1 Bacterial Translation .............................................................................................................. 1

1.1.1 Initiation ......................................................................................................................... 2

1.1.2 Elongation Cycle ............................................................................................................ 3

1.1.3 Termination .................................................................................................................... 4

1.1.4 Ribosome Recycling ....................................................................................................... 5

1.1.5 Ribosome Rescue ........................................................................................................... 5

1.2 Ribosome Profiling ................................................................................................................ 8

1.2.1 The Ribosome Profiling Procedure ................................................................................ 9

1.2.2 Analysis of Ribosome Profiling Data ........................................................................... 10

1.2.3 Power of Ribosome Profiling to Study Protein Synthesis ............................................ 11

1.2.4 Bacterial Ribosome Profiling: Promises and Obstacles ............................................... 14

1.3 The Endoribonuclease RelE ................................................................................................. 17

1.4 tRNA Methylations and TrmD ............................................................................................ 19

2 mRNA Cleavage by RelE........................................................................................................... 20

2.1 Introduction .......................................................................................................................... 20

2.2 Results and Discussion ........................................................................................................ 23

2.2.1 Ribosomes are Enriched at the 5’-end of Genes in Cells Overexpressing RelE .......... 23

2.2.2 RelE Predominantly Cleaves mRNA after the Second Nucleotide in Empty A sites .. 28

2.2.3 in vitro Digestion with RelE Reveals the Reading Frame ............................................ 31

2.2.4 RelE Prefers to Cleave after C and before G ................................................................ 34

2.2.5 The Specificity of RelE Interferes with Analyses of Ribosome Pausing ..................... 37

2.2.6 Refined RelE-Derived Ribosome Density Better Reflects Reading Frame ................. 39

2.2.7 Concluding Thoughts ................................................................................................... 41

vii

2.3 Materials and Methods ......................................................................................................... 42

2.3.1 Bacterial Strains and Growth Conditions ..................................................................... 42

2.3.2 Preparation of Ribo-seq and RNAseq Libraries ........................................................... 42

2.3.3 Data Analysis ............................................................................................................... 43

3 The Role of tRNA Methyltransferase TrmD ............................................................................. 44

3.1 Introduction .......................................................................................................................... 44

3.1.1 tRNA Synthesis and Post-Transcriptional Modifications ............................................ 45

3.2 The TrmD Methyltransferase ............................................................................................... 47

3.3 Results 49

3.3.1 Achieving Regulation of TrmD, an Essential Protein .................................................. 49

3.3.2 Depletion of m1G37 of tRNA Causes Pauses on CCG, CCA, and CGG Codons........ 53

3.4 Materials and Methods ......................................................................................................... 57

3.4.1 Strains and Construction .............................................................................................. 57

3.4.2 Cell Growth and Preparation of Ribo-seq Libraries ..................................................... 58

3.4.3 Data Analysis ............................................................................................................... 58

References ..................................................................................................................................... 59

viii

LIST OF TABLES

Table 1 Codons and their anti-codons with modifications ........................................................... 56

ix

LIST OF FIGURES

Figure 1 Average ribosome occupancy......................................................................................... 24

Figure 2 Ribosome enrichment at 5’-end genes of operon ........................................................... 26

Figure 3 Average ribosome density at start- and stop-codon ....................................................... 28

Figure 4 Ribo-seq data at nascent peptide-mediated stalling sites ............................................... 30

Figure 5 Average ribosome occupancy for a wild-type sample ................................................... 32

Figure 6 Ribo-seq and RNAseq density at each position ............................................................. 33

Figure 7 Sequence bias at the 3’-end of ribosome footprints ....................................................... 35

Figure 8 Sequence bias at the 3’-end of RNA fragments ............................................................. 36

Figure 9 The ribosome occupancy on codons............................................................................... 37

Figure 10 Correction for better reading frame .............................................................................. 39

Figure 11 Programmed frameshifting at the prfB gene ................................................................ 40

Figure 12 1-methyl-Guanosine ..................................................................................................... 47

Figure 13 TrmD depletion strain construction .............................................................................. 50

Figure 14 TrmD depletion ............................................................................................................ 51

Figure 15 Growth measurements upon TrmD depletion .............................................................. 52

Figure 16 Effect of TrmD depletion on tRNA .............................................................................. 53

Figure 17 Ribosome occupancy on codons in control strain ........................................................ 54

Figure 18 Ribosome occupancy on codons in degron strain ........................................................ 55

1

1 INTRODUCTION

1.1 Bacterial Translation

Ribosomes read the genetic information in mRNA and facilitate peptide-bond formation

to make proteins from single amino acids. Escherichia coli ribosomes are composed of two

subunits termed 50S and 30S because of their sedimentation coefficients. The larger 50S subunit

is itself composed of two ribosomal RNAs (23S and 5S) and 34 ribosomal proteins; the active

site that links amino acids together is found within the large subunit. The smaller 30S subunit

contains the 16S ribosomal RNA (rRNA) and 21 proteins and plays the dominant role in

decoding the genetic information in mRNA. The two subunits combine to make 70S ribosome

complexes that are active in translation.

Peptide synthesis occurs in the ribosome in a very sophisticated way with the help of key

adaptor molecules known as transfer RNAs (tRNAs). tRNAs are first charged with activated

amino acids to form aminoacyl-tRNAs (aa-tRNAs). There are 20 different aminoacyl-tRNA

synthetases (aaRSs) in E. coli1. Each aaRS activates the correct amino acid with ATP and then

links the amino acid to its cognate tRNA by an ester bond between the carboxyl group of the

amino acid and the 3’-terminal adenosine of the tRNA. The aa-tRNAs then enter the ribosome,

and when bound to their cognate codons, deliver the appropriate amino acid for the ribosome to

incorporate into the growing polypeptide chain. The ribosome has three sites for tRNA binding:

the aminoacyl-tRNA or A site, where the correct aa-tRNA is selected, the peptidyl-tRNA or P

2

site, where the growing nascent chain is attached, and the exit or E site, through which

uncharged tRNAs pass as they are released from the ribosome. The protein synthesis process is

complex and continuous but it can be conveniently broken down into four discrete steps:

initiation, elongation, termination and ribosome recycling2.

1.1.1 Initiation

At least in most cases, free 50S and 30S ribosomal subunits assemble into 70S complexes

at start codons to initiate protein synthesis. Initiation factor 1 (IF1) binds to the A site of the 30S

subunit and promotes dissociation of the 30S and 50S subunits. Initiation factor 3 (IF3) binds to

the E site of the 30S subunit and further prevents premature association with the 50S subunit3.

The 30S subunit bound with IFs interacts with the purine-rich region known as the Shine-

Dalgarno (SD) sequence on mRNA that is complementary to the 3’-end of the 16S rRNA4. The

pyrimidine-rich region of 16S rRNA, or the anti-Shine-Dalgarno (aSD) sequence, binds to the

SD sequence roughly 7 nt upstream from the start codon5. This interaction positions the start

codon of the mRNA at the P site of the 30S subunit. In bacteria, start codons are predominantly

AUG although GUG is also sometimes used. The fact that ribosomes can be recruited to many

internal sites within a polycistronic mRNA differentiates bacterial translation from eukaryotic

translation, where ribosomes are recruited by proteins that bind the 5’-cap and then scan and

initiate at the first start codon in mRNAs that are, as a rule, monocistronic6.

Guanosine 5’-triphosphate (GTP)-bound initiation factor 2 (IF2) recognizes the initiator

tRNA, N-formyl-methionyl-tRNAfMet (fMet-tRNAfMet)3, 7. IF2 binds in the A site of the 30S

subunit and places the initiator tRNA in the P site of the 30S subunit. Binding of initiator tRNA

to the start codon at the P site causes conformational changes of the 30S subunit that lead to

release of IF3. With the release of IF3, the 50S subunit can bind to the 30S subunit, inducing

3

GTP hydrolysis by IF2. At that point, IF1 and GDP-bound IF2 are released from the 30S subunit,

leaving behind an active 70S initiation complex. With initiator tRNA at its P site and an empty A

site, the complex is ready for elongation.

1.1.2 Elongation Cycle

At the start of the elongation cycle, the 70S initiation complex with initiator tRNA bound

to its P site accepts the next aa-tRNA into its empty A site8. aa-tRNAs are delivered to the

ribosome by the abundant GTPase, elongation factor-Tu (EF-Tu)9. Upon GTP binding to EF-Tu,

EF-Tu arranges its three domains in a compact structure allowing them to bind aa-tRNA10. When

the ternary complex (EF-Tu, GTP, and aa-tRNA) binds to the A site, it is rejected if the

anticodon of the tRNA does not match the codon in the A site8, 11. This process repeats until the

cognate tRNA comes in8, 11-12. Upon binding of the cognate tRNA in the A site, the correct

codon-anticodon base pairing causes conformational changes in the ribosome8. rRNA residues

surrounding the base pairs stabilize tRNA binding to the A site8, 12.

Conformational changes in the A site upon binding of the correct aa-tRNA result in a

distortion of the tRNA, triggering GTP hydrolysis by EF-Tu10-11. As the GTP of EF-Tu is

hydrolyzed and the γ-phosphate is released, the structure of the EF-Tu is rearranged and releases

the tRNA, and EF-Tu dissociates from the A site10, 13.

GTP hydrolysis occurs faster with cognate tRNAs than with near- or non-cognate tRNAs;

this kinetic difference is one basis for selectivity for the correct tRNA14. Following GTP

hydrolysis, the aminoacyl end of the A-site tRNA swings into the peptidyl-transferase site of the

50S subunit, a movement known as accommodation15. Cognate tRNAs are accommodated more

quickly than near- or non-cognate tRNAs, as second kinetic selection or proofreading step14-15.

4

RNA nucleotides in the peptidyl-transferase center bring the two amino acids together to

form a peptide bond15. There are two RNA elements in the 50S subunit called the A loop and P

loop that have highly conserved sequences and interact with the 3’-ends of aa-tRNAs positioned

at A site and P site15. Once the C-terminus of growing peptide and the amino group of the A-site

bound aa-tRNA are positioned correctly, peptide bond formation occurs intrinsically by a

nucleophilic displacement, perhaps involving side chains on the tRNAs themselves but not the

rRNA nucleotides in the active site2, 15.

For translation to continue, the ribosome must move to the next codon on the mRNA.

Following peptide-bond formation, the deacylated tRNA is at the ‘P/E’ hybrid state and the

peptidyl-tRNA with two amino acids is at the ‘A/P’ hybrid state15-16. The tRNAs need to be

resolved to a new classical state with the deacylated tRNA completely in the E site and the

peptidyl-tRNA positioned at the P site15. This process is called translocation and is catalyzed by

elongation factor G (EF-G), a ribosome-activated GTPase. Hydrolysis of GTP by EF-G helps the

30S subunit move to the right position for the next codon at A site, maintaining reading frame by

moving exactly three nucleotides on mRNA17. The uncharged tRNA at the E site is released from

the ribosome and another aa-tRNA enters the A site2, 15. This cycle repeats until a stop codon

(UAG, UAA, or UGA) appears at the A site. Translocation is thought to be relatively fast

compared to decoding, and bacterial peptides are formed in the ribosome at a rate of about 20

amino acid residues per second2, 15.

1.1.3 Termination

When the ribosome reaches a stop codon, protein synthesis is terminated. Stop codons are

recognized by two release factors, RF1 and RF218. By mimicking tRNA structurally, these

factors bind in the A site and interact directly with mRNA19. RF1 recognizes the UAA and UAG

5

stop codons and RF2 recognizes the UAA and UGA stop codons20. Release factors hydrolyze the

ester bond on peptidyl-tRNA and release the newly synthesized peptide chain from the ribosome,

leaving uncharged tRNA in the P site2, 20. Both RF1 and RF2 contain a highly conserved

tripeptide motif (GGQ motif) that is responsible for the catalysis of peptide release from the

peptidyl-tRNA at P site20-21. Following hydrolysis of peptidyl tRNA, RF3, a GTPase found in

some but not all bacteria, catalyzes the release of RF1 or RF2 at the end of the termination

process. GTP hydrolysis then promotes the dissociation of RF320-22.

1.1.4 Ribosome Recycling

After termination, ribosomes need to be dissociated for efficient initiation at another start

codon to begin. Ribosome recycling factor (RRF) and EF-G dissociate the ribosome into

subunits, releasing mRNA and tRNAs from the ribosome upon GTP hydrolysis2. The structure of

RRF mimics tRNA in the A site2, 23. When the ribosome dissociates, IF1 replaces the deacylated

tRNA and the mRNA is released. All translational components are now free for another round of

translation23.

1.1.5 Ribosome Rescue

In bacteria, the translation and transcription of genes are coupled. Since genomic DNA is

not separated from the cytoplasm by a nuclear membrane like it is in eukaryotic cells, the whole

process of gene expression happens simultaneously and is faster than it is in eukaryotic cells. As

soon as an mRNA begins to be synthesized, translation can begin. This can lead to problems

when ribosomes translate messages that are defective as when RNA polymerase prematurely

terminates prior to reaching the stop codon in the message. Other defective RNAs come from

6

mutations, chemical damage, and the endonucleolytic cleavage that is part of normal mRNA

decay pathways24.

Truncated mRNAs are deleterious to the cell for at least two reasons: they cause

ribosomes to stall at the 3’-end of the message and produce incomplete or premature proteins

which might be harmful to the cell. When ribosomes reach the 3’-end of a truncated mRNA, RFs

cannot be recruited, since there is no stop codon, and the ribosome-mRNA complex is stably

maintained25. In the cell, stalled ribosomes can affect the global protein synthesis if not rescued.

Bacterial cells have three mechanisms that can resolve stalled ribosomes. One involves a

small, stable RNA known as tmRNA and the other two involve proteins that are alternative

rescue factors, ArfA and ArfB25. tmRNA rescues the stalled ribosome by acting both as a tRNA

and an mRNA25-26. tmRNA-mediated ribosome rescue not only dissociates the stalled ribosome

for recycling but also tags the incomplete nascent peptide for degradation. ArfA is a small

protein that can bind to the A site and recruit RF2, hydrolyzing the nascent peptide and leading

to recycling27. ArfB has a GGQ domain in its structure as RFs do and facilitates the termination

of protein synthesis. Among these three mechanisms, tmRNA is the best characterized28.

The sophisticated structure of tmRNA makes it act as both tRNA and mRNA. In E. coli,

tmRNA is 363 nt long. It contains an tRNA-like domain (TLD) and an mRNA-like domain with

its own small open reading frame (ORF)29. The TLD contains 3’-terminal CCA residues like

tRNAs do and has a G3-U357 wobble base pair, which alanyl-tRNA synthetase recognizes and

charges with alanine at its 3’-end. With several helices and pseudoknots in its structure, tmRNA

is relatively stable against degradation by nucleases. Its partner protein, SmpB, further stabilizes

tmRNA29-30. The structure of the TLD allows EF-Tu to bind Ala-tmRNA just like other tRNAs

do. The GTP-bound EF-Tu-tmRNA-SmpB complex enters the A site of stalled ribosomes and

7

undergoes peptidyl transfer following GTP hydrolysis by EF-Tu30. After the growing peptide is

attached to tmRNA, the mRNA-like domain of tmRNA serves as a transcript for the ribosome to

continue translation. The ORF encodes the ANDENYALAA peptide and a stop codon at the

end31. Because the ribosome switches templates, ribosome rescue by tmRNA is referred to as

trans-translation32. The 11 amino acids added by tmRNA target the aborted nascent peptide for

degradation. The ClpXP protease system recognizes the hydrophobic C-terminal residues of the

tagged peptide and degrades it31.

While tmRNA and SmpB are found in all bacterial genomes, the ArfA backup system

found in some bacteria also facilitate ribosome rescue. It is proposed that ArfA, previously

known as YhdL, binds to the ribosomal A site and recruits RF2. Then, RF2 hydrolyzes the

peptidyl-tRNA and releases translation complex. It is not known, however, how ArfA recognizes

the 3’-end of non-stop mRNA and how it recruits RF2. The deletion of tmRNA and ArfA

together are lethal whereas a single deletion of either is viable, which suggests a mutual role of

the two mechanisms in ribosome rescue26.

ArfB, previously known as YaeJ, also rescues stalled ribosomes. In E. coli ArfB is a

small basic protein with 140 amino acids. As a homolog of RFs, ArfB contains the conserved

GGQ motif for hydrolysis of peptidyl-tRNA28. Instead of having a specific sequence for

interacting with stop codons, ArfB has a positively charged C-terminal tail that interacts with the

mRNA channel, selectively binding ribosomes with little or no mRNA downstream of the A-site

codon. This is reminiscent of the C-terminal tail of the SmpB protein that provides selectivity for

the tmRNA rescue system28.

8

1.2 Ribosome Profiling

Insight into protein synthesis can shed light on many aspects of biology; in particular, it is

a powerful way to monitor gene expression levels in vivo. Previously, researchers have achieved

the goal by monitoring transcriptional changes with microarrays or RNAseq. Another approach,

proteomics based on mass spectrometry (MS), reveals the identity and quantity of proteins in the

cell. However, it is difficult to accurately quantify proteins with low abundance or instability. In

addition, MS focuses on the steady-state levels of proteins in the cell rather than the rate of

protein synthesis. While steady-state levels are very informative in determining the level of

protein activity, they are less helpful in understanding the mechanism of protein synthesis or the

translational control of gene expression33.

Polysome profiling is another method used to monitor protein synthesis in vivo34. This

technique is based on the fact that the number of translating ribosomes bound to mRNAs can tell

us the translational status of that transcript. Ribosome-bound mRNAs are collected by

ultracentrifugation of cell lysates over sucrose gradients. The identity of heavily translated

mRNAs is determined and compared to the background of total mRNA using microarrays or

RNAseq34. Polysome profiling allowed researchers to move from monitoring transcription levels

to monitoring translation levels34. However, the approach has some difficulties in separating

messages depending on the number of ribosome bound to them. Sucrose gradient fraction

collection does not resolve clearly complexes heavier than disomes. The many fractions per

sample (based on the number of ribosomes bound to mRNAs) makes it a lot of work to do the

following analyses. More importantly, the data generated from polysome profiling cannot give

the exact position of ribosomes on mRNAs; rather, they only give some idea of ribosome

abundance on certain mRNAs.

9

To get better information on translation in vivo, Ingolia and Weissman developed a

powerful new technique called ribosome profiling or Ribo-seq35. Simply stated, ribosome

profiling improved polysome profiling by isolating short fragments of mRNA corresponding to

ribosome footprints, revealing the exact positions of ribosomes on mRNAs. Ribosome profiling

provides a genome-wide snapshot of ribosome occupancy on mRNA templates, telling us which

proteins are being made and to what extent35.

The development of ribosome profiling was made possible by advances in sequencing

technology. Ribosome profiling involves deep sequencing millions of ribosome-protected short

mRNA fragments. With the completion of the human genome project, significant improvements

were made in high-throughput DNA sequencing, making the sequencing of hundreds of millions

of short reads economically viable. Methods for preparing cDNA libraries have been well

developed, including RNA ligation methods and techniques for multiplexing a number of

libraries for more efficient and cost effective sample preparation. Another improvement came

from easily accessible databases from bioinformatics. Annotated whole genomes provide the

standard against which sequencing reads can be mapped or aligned. In addition, bioinformatics

has provided many algorithms for processing and analysis of the data generated from high-

throughput sequencing. Together, these technologies and methods make possible the ribosome

profiling technique that is increasing our understanding of gene regulation and particularly the

protein synthesis process.

1.2.1 The Ribosome Profiling Procedure

To collect ribosome-protected mRNA fragments for deep sequencing, grown cells are

effectively captured by flash freezing using liquid nitrogen. For lysis, cells are pulverized

mechanically in liquid nitrogen in the presence of lysis buffer containing drugs that arrest

10

ribosomes on mRNA complexes. The pulverized lysate is thawed and centrifuged to pellet cell

debris. The lysate is then subject to digestion using a nuclease to degrade naked mRNA, leaving

only ribosome-protected mRNA fragments. Ribosome complexes are isolated on sucrose

gradients by ultracentrifugation; digestion with the nuclease has reduced the polysomes to single

monosomes. Ribosome-protected mRNA fragments are released from the monosomes by

treatment with phenol and chloroform followed by RNA precipitation. At this point, the purified

RNAs contains rRNA and tRNA fragments as well as mRNA fragments.

To purify ribosome footprints away from other RNA fragments, the total RNA is run on a

denaturing acrylamide gel for size selection. RNA corresponding to the size that best represents

the footprint of the ribosome are collected. Generally, we isolate mRNA fragments on the gel

that are between 10 to 40 nt in length. The strategy to sequence these mRNA fragments is to

make a cDNA library. First, a defined RNA linker that contains a primer binding sequence is

attached at the 3’-end of each mRNA fragment by ligation with T4 RNA ligase. Linker-ligated

fragments are then gel-purified. rRNA fragments are selectively subtracted by hybridization with

oligonucleotides complementary to rRNA. Then, the purified mRNA fragments are reverse

transcribed to yield cDNAs and these single-stranded cDNAs are circularized by ligation,

amplified by a few cycles of PCR, and submitted for 50 bp, single end, Illumina sequencing.

1.2.2 Analysis of Ribosome Profiling Data

Data analysis involves processing the raw sequencing data into genome-wide maps of

ribosome occupancy and then further analysis for specific needs. High-throughput sequencing is

performed on several samples per lane in a single experiment (a strategy known as

‘multiplexing’). Each library is synthesized with a characteristic sequence (usually 6 nt) termed a

‘barcode’ that distinguishes that library from the others during data analysis. In the first step of

11

data processing, raw sequences are de-multiplexed by sorting them out according to their

barcode sequences. In addition, reads are filtered based on the quality of data as quantified in

Phred quality scores, which is a measure of the quality of the identification of the nucleobases,

and ambiguous or suspicious reads are excluded. After filtering, the reads are trimmed to remove

the constant 3’-linker sequence, leaving only the ribosome footprints. Next, any reads that map

to tRNA and rRNA genes are discarded before mapping reads to annotated genomic DNA. Final

ribosome occupancy data are represented as densities along the genome sequence; these maps of

ribosome occupancy are used for additional analysis using our own computing scripts, which are

written mainly using a computer programming language Python.

1.2.3 Power of Ribosome Profiling to Study Protein Synthesis

Although ribosome profiling has mainly been used to monitor gene expression under two

or more conditions, using ribosome occupancy to report on protein levels, the method has

tremendous potential to shed light on the mechanism of protein synthesis in vivo. There have

been many insights obtained using ribosome profiling since the method was first introduced by

Ingolia and Weissman in 2009. In that first report, several millions of sequencing reads of

ribosome-protected mRNAs were obtained in yeast Saccharomyces cerevisiae. 28-nt long reads

yielded the best information about the position of the ribosome, suggesting that the nuclease

digested completely to the 5’- and 3’-boundaries of the ribosome35. Sequencing reads that

aligned to ORFs revealed ribosome positions on mRNAs with nearly single-nucleotide precision

and showed a strong 3-nt periodicity, providing a means of monitoring the ribosome’s reading

frame35. Genome-wide measurements of ribosome occupancy also generated estimates of the

levels of protein synthesis, which were found to be better predictors of protein abundance by MS

than were measurements of mRNA levels.

https://en.wikipedia.org/wiki/Nucleobase

12

In a later paper by Ingolia and Weissman, the ribosome profiling technique was adapted

to mammalian cells36. Using the drug harringtonine, which specifically inhibits elongating

ribosomes near the start codon (but not at later sites in the ORF), new translation was inhibited in

cells36. Following the addition of harringtonine, ribosomes were allowed to run off for various

time periods to generate a series of snapshots and reveal a moving picture of translation in vivo36.

Ribosome footprints successfully revealed a progressive depletion of ribosomes from the 5’- to

the 3’-end along mRNAs as the run-off time increased after harringtonine treatment36. The data

showed that ribosomes translate at a rate of 5.6 amino acids per second, consistent with previous

biochemical observations.

In addition to their work on measuring rates of translation with harringtonine, Ingolia and

Weissman also detected sites where ribosome occupancy was strongly enriched due to pausing

during elongation. Programmed nascent chain-mediated pauses were observed on the Sec61b and

Xbp1 genes36. More generally, the PPE/D motif was shown to be associated with internal pauses,

consistent with the slow translation of polyproline stretches. Finally, they also found that

ribosome footprints revealed many alternate ORFs, upstream of canonical ORFs on the same

mRNA, many non-canonical translational start sites (especially at the CUG codon), and evidence

for translation of putative long, non-coding RNAs (lncRNA)36.

High-precision ribosome footprints obtained by ribosome profiling make it possible to

detect programmed frameshift sites. Coupled with a computational method for detecting

transitions between reading frames, ribosome profiling data obtained from human cells revealed

where the same genomic segment is translated in more than one reading frame37. The 3-nt

periodicity of ribosome occupancy shifts if the ribosome has moved into a different reading

frame during translation. Ribosome profiling data is a powerful tool for searching for alternative

13

coding regions in endogenous genes and perhaps in the future in viruses and other mobile genetic

elements rich in such phenomena.

Ribosome profiling in Drosophila melanogaster expanded our understanding of ribosome

readthrough, in which the ribosome continues to translate past a stop codon. Previous studies

have shown that this can occur in a regulated fashion. However, it was very difficult to detect

readthrough; only a small fraction of ribosomes keeps translating, extending proteins past their

predicted C-termini. The presence of ribosome occupancy after stop codons successfully showed

that readthrough is far more than previously predicted in D. melanogaster38.

Ribosome profiling helped to clarify the enzymatic activity of Dom34, a yeast protein

essential for releasing stalled ribosomes39. Dom34 is a homolog of eukaryotic release factor 1

(eRF1) though it lacks a canonical GGQ motif required for peptide release as well as the codon

recognition motif required for interacting with stop codons. Given these differences and its role

in rescuing stalled ribosomes, Guydosh and Green used ribosome profiling to identify specific

targets of Dom34, comparing wild type and dom34 knock-out strains of yeast39. Stalled

ribosomes are stabilized by the lack of Dom34; there are higher levels of ribosome occupancy on

certain mRNAs at stalling sites in the dom34 knock-out strain. The fragments protected by

stalled ribosomes tend to be shorter (15-18 nt) than normal reads. This is because Dom34

preferentially rescues stalled ribosome at the end of truncated mRNAs, meaning that the mRNA

bound to the ribosome is roughly 10 nt shorter at the 3’-end. Furthermore, they found that

Dom34 rescues ribosomes that are found in 3’ untranslated regions (UTRs) of mRNAs39. The

codon-independent rescue mechanism of Dom34 nicely supports the previously known

biochemistry of Dom34, which lacks conserved codon recognition motif39.

14

Ribosome profiling has become a powerful way to understand complex cellular processes

in living cells including protein synthesis. By monitoring protein synthesis in real time, ribosome

profiling has clarified previous biochemical observations and given ample insights on many

aspects. Since 2009, the technique is being more broadly adapted to study various organisms and

the data analysis methods are improving immensely.

1.2.4 Bacterial Ribosome Profiling: Promises and Obstacles

Ribosome profiling has also employed to study bacterial translation. The relatively small

genome of bacteria and less complex gene regulation would suggest that it should be equally

powerful in bacteria as in yeast, where it was first developed. However, there have been several

hurdles to its application in bacteria. The first is that RNase I, the nuclease used to digest naked

mRNA in the yeast protocol, is inhibited by E. coli ribosomes, preventing its use. RNase I

exhibits very little sequence specificity in cleavage and degrades RNA cleanly to the boundaries

of the ribosome, meaning that the 28 nt fragments observed in yeast yield single-nucleotide

resolution and information about the reading frame of the ribosome40. In contrast, all published

bacterial ribosome profiling studies use micrococcal nuclease (MNase), an enzyme with

pronounced sequence specificity40-41. The distribution of fragments generated by MNase is very

broad, with most reads roughly 15 – 35 nt in length, and no 3 nt periodicity or reading frame is

evident. Rather than assigning ribosome occupancy to the 5’-end of reads as was done originally

in yeast, Li and Weissman used a center-assignment strategy in which partial occupancy was

distributed over the length of the entire read, further blurring the position of the ribosome on the

message41. The use of MNase in bacterial ribosome profiling has interfered with an accurate

determination of the ribosomes position, interfering with analyses of ribosome pausing. In spite

15

of these limitations, there have been many advances in understanding bacterial translation using

ribosome profiling, and there have been on-going improvements on the technique itself.

The first ribosome profiling paper using E. coli was reported in 2011 by the Weissman

and Bukau groups in a study of the interaction of the nascent peptide chaperone trigger factor

(TF)40. The authors selectively purified ribosomes bound to TF by cross-linking the nascent

peptide to TF to stabilize transient interactions followed by immunoprecipitation with anti-TF

antibodies. Ribosome footprints from TF-bound and non-TF-bound ribosomes were then

obtained following the regular ribosome profiling procedure40. It turns out that trigger factor

binds about 120 codons downstream from the translational start site, contradicting previous

models that TF interacts as soon as the peptide comes out from the exit tunnel of the 50S subunit

in vitro. Selective ribosome profiling in E. coli by cross-linking the chaperone TF with the newly

synthesized polypeptide revealed valuable information of the kinetics of protein synthesis in

vivo.

Another early ribosome profiling study in E. coli claimed that the main source of pausing

during elongation in bacteria is Shine-Dalgarno-like sequences within open reading frames41. SD

motifs accounted for 70% of strong pauses in their dataset—sites where ribosome occupancy

were enriched more than 10-fold over the gene average. The authors concluded that elongation is

retarded by transient base pairing between SD motifs within ORFs and the aSD sequence in 16S

rRNA41. However, Mohammad et al. contradicted the finding with improved footprint

resolution42. The precision of the ribosome position is significantly improved by assigning

ribosome occupancy using the 3’-end of sequencing reads (3’-assignment) rather than

distributing occupancy over the whole read (center-assignment)42. This simple adjustment seems

to better position the ribosome occupancy along its mRNA transcript on all of known E. coli

16

footprint data so far, lining up the peaks seen at start and stop codons as well as programmed

pauses like the one observed in the SecM protein. It seems that MNase does a better job in

cleaving mRNAs up to the boundary of the ribosome at the 3’-end site of the bound mRNA,

resulting in more tight cleavage at the 3’ end but relaxed cleavage at the 5’ end of ribosome-

protected mRNAs42.

With the higher resolution obtained by 3’-end assignment it became clear that the signal

observed by Li and Weissman could be split into two separate signals due to two distinct

phenomena: first, ribosomes pause with Gly codons in the E site, perhaps due to the nature of the

inhibitor in the lysis buffer used to arrest translation. Second, the bona fide SD pauses arise from

preferential enrichment of long RNA fragments from the population of ribosome footprints. It

turns out that mRNAs that interact with the aSD in 16S rRNA are protected from digestion at the

5’-end and so are longer on average than reads lacking SD motifs. By failing to isolate the entire

population of ribosome-protected mRNA fragments, Li and Weissman had artificially enriched

for SD containing reads, observing pausing that is not necessarily there in vivo.

Ribosome profiling data can be used to quantify the absolute levels of protein synthesis

genome-wide. In bacteria, many protein complexes are made of components synthesized from a

single operon. In the operon encoding ATP synthase, for example, eight proteins are encoded on

a single polycistronic RNA. The components are incorporated into the complex with various

stoichiometries: 1:10:2:1:3:1:3:143. Li and Weissman showed that bacteria optimize expression

of such complexes by proportional expression, matching the level of expression to the amount

required for complex formation43. This avoids wasteful synthesis of extra protein that would

need to be degraded and raises very interesting questions about how these differences in

translational efficiency are generated.

17

1.3 The Endoribonuclease RelE

In 1983, a mechanism that ensures stable maintenance of plasmids in E. coli was

reported44. When a strain of E. coli cell carrying the F plasmid loses it while dividing, the cell

without the plasmid is not viable44. Only the cells carrying the F plasmid are viable and

continued to divide44. This is because a segment of the F plasmid plays an important role in

maintaining the plasmid in the cell44. This segment, designated ccd (coupled cell division)44, is

composed of two parts: one (ccdB) that expresses an inhibitor of DNA gyrase45 and another

(ccdA) that expresses a protein that inactivates CcdB through forming a protein-protein

interaction44, 45b, 46. When cells inherit the F plasmid, both CcdA and CcdB are expressed and

form complexes that do no harm to the cell44, 46. However, if cells loose the plasmid after

division, the unstable CcdA46 gets degraded rapidly, leaving excess CcdB. Once freed, CcdB

becomes active and arrests cell growth, leading to death upon further division45b, 47. With this

mechanism, the plasmid ensures that it is replicated and passed on to all the viable daughter

cells45b, 47. In this context, the mechanism is called post-segregational killing (PSK)45b, 48 and the

protein module is sometimes called as addiction module49. Later, many such protein pairs were

found on the E. coli chromosome and named as toxin-antitoxin (TA) pairs, since one has

cytotoxic activity and the other antagonizes its pair48.

RelE is a toxin widely found in bacteria and archaea48. As an endoribonuclease, RelE is

known to selectively cleave translating mRNAs by binding to the ribosomal A site and inducing

global translational inhibition50. RelE activity is induced under nutritional depletion and is

closely related to prokaryotic cell survival and pathogenicity including general stress responses

and persister formation50b. RelE is one of the most studied toxins, but its activity and mechanism

are still not well understood.

18

Pedersen et al. showed codon-specific mRNA cleavage by RelE in their in vitro kinetic

assays50a. They observed that UAG and UAA among stop codons and CAG and UCG among

sense codons were the most rapidly cleaved by RelE at the ribosomal A site, normally between

the second and third bases50a. Neubauer et al. solved the structure of 70S ribosomes bound to

RelE; their data partly explain why RelE cleaves mRNA in a codon specific manner and

suggested a general acid-base catalytic mechanism51. They proposed that purines are favored for

the second and third base of A-site codon for better stacking with residues from RelE and 16S

ribosomal RNA in facilitating RelE activity51.

Hurley et al., however, observed different RelE cleavage patterns in vivo52. They found

that RelE cut mRNAs at several sites frequently and efficiently within the first 100 codons of

coding regions, and rarely cut mRNAs at sites near the 3’ end52. They did not see any preference

for CAG or UCG sense codons or any statistically significant sequence preferences. Moreover,

they noticed that some cleavages occurred after the third base of A-site codon while most

cleavages occurred between the second and third as previously known. They argued that their

observance is more consistent with the fact that RelE causes rapid and comprehensive mRNA

degradation and concomitant growth arrest52.

Recently, we found in our work that RelE has some advantages for use as a nuclease in

the ribosome profiling protocol. When added to cell lysates, purified RelE can cleave translating

mRNAs at ribosomal A site and provides better information on ribosome position and reading

frame than does MNase digestion. However, the sequence preference of RelE activity is

problematic and requires further refinement and computational adjustment in analyzing data.

This work will be described in Chapter 2.

19

1.4 tRNA Methylations and TrmD

TrmD is a bacterial-specific S-adenosyl-methionine (AdoMet)-dependent tRNA methyl

transferase. Methylation of tRNA can affect the accuracy of protein synthesis and the

maintenance of reading frame. The N1-methylguanine at position 37 (m1G37)-tRNA product of

TrmD improves the accuracy of protein synthesis on the ribosome. m1G37-tRNA reduces +1

frameshift (+1FS) errors at slippery mRNA sequences and decoding errors of uridine-5-

oxyacetic acid at position 34 (cmo5U34), a wobble modification that frequently accompanies

m1G37 in natural tRNAs53. Loss of TrmD leads to accumulation of +1FS errors leading to pre-

mature termination of protein synthesis at out of frame stop codons. trmD is an essential gene in

E. coli and many bacteria and its biochemical mechanism is very different from eukaryotic tRNA

methyltransferases, leading to interest in TrmD as a drug target50b, 54. We sought to achieve a

cellular-level understanding of how inactivation of TrmD reduces synthesis of proteins to

determine the reasons for its essentiality. In Chapter 3, we report ribosome profiling studies to

observe defects in translation and identify genes whose expression is altered in TrmD deficient

cells.

20

2 MRNA CLEAVAGE BY RELE

2.1 Introduction

Bacteria face enormous selective pressure from their physical and chemical environment

and from competing micro-organisms. In response to this pressure, bacteria have evolved

mechanisms to rapidly regulate gene expression in response to reduction in the levels of

available nutrients, for example, or in response to antibiotics released by other organisms.

Another strategy bacteria use to deal with these stresses is to maintain a small fraction of the

population in a dormant state that survives environmental insults and then resumes growth when

conditions improve55. This strategy plays an important role in antibiotic resistance because

dormant cells (known as persisters) are not killed, even at high antibiotic concentrations.

Shutting down protein synthesis is an essential step for both of these strategies55; for example,

regulating gene expression in response to nutrient starvation via the classical stringent response

and maintaining persister cells in a dormant state as a bet-hedging strategy56.

The relE gene plays a role in stress-response pathways by blocking translation50b. The

gene was originally discovered in genetic screens involving nutrient starvation and was named

for its effect in the stringent response. During amino acid starvation, the concentration of

uncharged tRNAs increases; as these tRNAs bind to the ribosome, they activate the synthesis of

guanosine tetraphosphate (ppGpp) by the RelA protein57. Through its interactions with RNA

polymerase, ppGpp leads to stringent inhibition of rRNA synthesis57. E. coli mutants in which

21

RelE is more highly expressed quickly shut down translation upon amino acid depletion,

stopping the consumption of amino acids and rapidly lowering ppGpp to its pre-starvation levels,

allowing rRNA synthesis to resume after a brief delay58. The synthesis of rRNA during amino

acid starvation is characteristic of the ‘relaxed’ phenotype observed in mutants of genes in

stringent response pathway, hence the name RelE59.

In addition to its role in gene expression, RelE is also a member of the type II toxin-

antitoxin family implicated in persister-cell formation59a. Overexpression of the RelE toxin

causes a reversible inhibition of cell growth resembling the dormant state characteristic of

persister cells59a. Growth resumes when RelE is neutralized by overexpression of its binding

partner, the RelB anti-toxin. Under rich conditions, RelB is expressed at a slightly higher level,

masking RelE activity, but under stress conditions, Lon protease degrades enough of the more

labile RelB anti-toxin to induce RelE activity60. Even under rich conditions, stochastic activation

of RelE and similar toxins is thought to be responsible for inducing a persister-like state in a

small fraction of cells in culture61.

Like at least ten members of the type II toxin-antitoxin family in E. coli, RelE exerts its

effects by cleaving mRNA and inducing translational arrest62. RelE binds in the ribosomal A site

and cleaves the RNA after the second nucleotide in the A-site codon50a. It is an unusual

endonuclease in the sense that it does not cleave RNA outside of this context50a; the ribosome

provides an environment essential to its catalytic activity. A catalytic mechanism has been

proposed based on the X-ray crystal structure of RelE in a 70S ribosome complex and follow-up

kinetic studies51, 63. Initially, RelE was reported to be highly specific for a select codons (CAG,

UCG, and the stop codon UAG) based on its in vitro kinetics50a, but more recent studies suggest

that it has quite a broad specificity in vivo with only a modest sequence preference52. Analyses of

22

cleavage sites on a handful upon highly-expressed genes identified many cleavage sites with

only a modest preference for cleavage before G50a, 52. Woychik and co-workers further made the

puzzling observation that RelE cleaves primarily at the 5’-end of mRNAs, within about the first

100 codons52; the mechanism underlying this polarity is unknown.

Here we report a genome-wide characterization of protein synthesis in E. coli upon RelE

overexpression. Unlike the elegant method of Woychik and co-workers, who adapted RNAseq to

detect cleavage by the MazF toxin64, the ribosome profiling method we employed reports on the

position of ribosomes, allowing us to observe RelE’s effect on translation. We find that ribosome

density is strongly enriched at the 5’-end of genes and propose a model involving cycles of

mRNA cleavage, rescue of stalled ribosomes, and initiation that explains the earlier observation

of preferential RelE cleavage in the first 100 codons52.

Furthermore, we find that RelE can be used to improve the resolution and power of

ribosome profiling in bacteria. As originally developed in yeast, the protocol calls for digestion

of naked mRNA with RNase I to generate ribosome footprints35. As RNase I is inhibited by E.

coli ribosomes, bacterial ribosome profiling studies have used MNase instead. Unfortunately,

MNase is sequence specific, creating strong sequence bias at both the 5’- and 3’-ends of the

RNA fragments, enriching or suppressing specific RNA fragments depending on their nucleotide

sequence. Additionally, MNase generates a broad distribution of the lengths of the ribosome

footprints, unlike the tight 28 nt footprint observed in ribosome profiling studies in yeast42. This

means that the position of the ribosome cannot be determined with sufficient precision to

determine its reading frame42. Addition of purified RelE to cell lysates generates ribosome

footprints that, for the first time, provide excellent information on the position and reading frame

of the ribosome.

23

2.2 Results and Discussion

2.2.1 Ribosomes are Enriched at the 5’-end of Genes in Cells Overexpressing RelE

To study the effect of RelE on protein synthesis in vivo, we overexpressed RelE in wild-

type E. coli MG1655 cells and analyzed ribosome occupancy genome-wide using ribosome

profiling. RelE expression from an araBAD promoter was induced for one hour beginning in

early log phase. Given the high level of RelE expression (one-quarter of ribosome footprints map

to the RelE mRNA) and the long induction period, we observed very pronounced effects on

protein synthesis, presumably representing an equilibrium state in which RelE activity has

exerted its effect and cell growth has slowed or halted. Although strong overexpression no doubt

exaggerates the effects that we observe in comparison to activation of endogenous RelE by

physiological stimuli, the resulting data provide a clear picture of the scope and specificity of this

endonuclease and its effects on gene expression.

The most striking effect of RelE overexpression is the dramatic enrichment of ribosome

occupancy at the 5’-end of genes. In the model of the manX gene shown in Figure 1, for

example, far more ribosome footprints map to the 5’-end of the gene than the 3’-end (red). In

contrast, in the wild-type control (WT1, black), ribosome occupancy is distributed more or less

similarly across the manX gene. These observations hold true genome wide. In a plot of average

ribosome occupancy of about 1000 genes over 1000 nt long that were aligned at the start codon,

ribosome coverage remains fairly constant in the wild-type control, whereas in the RelE1 data

ribosomes are highly enriched in the first 400 nt and depleted after the first 400 nt. (Note that the

vertical spread in the RelE1 data arises from the reading frame, as will be discussed below). In

addition, a very strong peak of ribosome density is observed at the start codon in the RelE1 data.

These data show that as a result of RelE activity, ribosomes accumulate at the 5’-end of genes

24

and are depleted at the 3’-end. This suggests that the number of ribosomes completing the

synthesis of full-length proteins is strongly reduced, consistent with RelE’s known function of

inhibiting protein synthesis and arresting cell growth.

Given that RelE is an endonuclease that cleaves mRNA inside the ribosomal active

site50a, we also looked at steady-state levels of mRNA in RNAseq libraries prepared from the

WT1 and RelE1 samples. Here again, we see enriched RNA density at the 5’-end of the manX

Figure 1 Average ribosome occupancy

Ribo-seq (left) and RNAseq (right) reads at the manXYZ operon (top). Average ribosome occupancy at

about 1000 genes aligned at their start codons (bottom).

25

gene when RelE is overexpressed (Figure 1, top right, blue), whereas the coverage is relatively

uniform in the wild-type control (grey). When the RNAseq from many genes is averaged

together, a modest two-fold increase in density at the 5’-end is observed in the RelE1 data

(Figure 1, bottom, blue). These findings are consistent with the ribosome profiling data in

showing a 5’ to 3’ polarity.

Finally, a careful inspection of gene expression across polycistronic transcripts suggests

that ribosome density decays not only across single genes but across operons as well. The

manXYZ operon, for example, is expressed as a single transcription unit from a single promoter,

a fact supported by equal levels of RNA density for the three genes in the WT1 RNAseq dataset

(Figure 1, top right). Translation of all three genes is also observed in the WT1 Ribo-seq data

(top left, black). In contrast, lower levels of RNA and far lower levels of ribosome footprints are

observed for the downstream manY and manZ genes compared with manX.

To quantify this effect, we compared the levels of about 2500 genes in the RelE1 and

WT1 samples; these ratios are shown separated by the position of the gene within its operon

(Figure 2). The transcriptome of E. coli is remarkably complex: many operons overlap and often

multiple promoters regulate subsets of genes within an operon. To define single transcription

units, we started with roughly 4,000 annotated operons in the RegulonDB database and excluded

those in which the genes varied in RNAseq density by more than five-fold (using RNAseq data

from the WT1 sample). Like the manX gene, which has three-fold more ribosome footprints in

the RelE1 sample than in the WT1 sample, most genes that are at the first position in an operon

or are monocistronic have more ribosome footprints in the RelE1 sample. In contrast, genes that

lie downstream on the operon have fewer ribosome footprints. The manZ gene has two-fold

26

fewer reads in the RelE1 sample

than in the WT1 sample (Figure

1). The distribution for genes at

the first position versus genes in

the fifth position or further

downstream is statistically

significant (Mann-Whitney P-

value of 8.4 x 10-22). These

differences are also observed at

the RNA level, although

perhaps to a lesser extent. In

summary, our data indicate that

upon RelE overexpression, ribosomes are enriched not only at the 5’-end of the units of

translation (open reading frames) but at the 5’-end of units of transcription (polycistronic

mRNAs).

We propose the following model for enrichment of ribosomes at the 5’-end of genes upon

RelE expression. As RelE begins to accumulate in the cell, it enters the A site of ribosomes and

cleaves mRNA, preventing further cycles of elongation. Stalled ribosomes are rescued by

tmRNA-SmpB or by the backup system comprised of the ArfA protein and RF2. Several

biochemical studies have shown that tmRNA reacts particularly well with ribosomes following

the loss of mRNA downstream of the A site upon RelE cleavage65. Both the tmRNA and ArfA

systems release the nascent peptide from the stalled ribosome and promote recycling of the

subunits. The free subunits are then free to assemble on another transcript at the start codon and

Figure 2 Ribosome enrichment at 5’-end genes of operon

Ribosome occupancy (red) and RNAseq density (blue) are

enriched in genes at the 5’-end of operons upon RelE

overexpression.

27

begin elongation again. If RelE cleaves the initiation complex, a strong start codon peak is

observed. If a few cycles of elongation take place before cleavage of the mRNA, ribosomes

move into the coding sequence away from the start codon. In this manner, cycles of mRNA

cleavage, ribosome rescue, and initiation could enrich for ribosomes at the 5’-end of genes.

It is possible that RNA decay mechanisms contribute to this effect as well. Following

cleavage by RelE, the 3’-end of the upstream RNA fragment is protected from degradation by

exonucleases by the stalled ribosome. The higher ribosome density on the upstream fragment

could also prevent degradation by endonucleases such as RNase E. In addition, the upstream

fragment has a 5’-triphosphate which is stabilizing. In contrast, the downstream fragment may be

less able to recruit ribosomes and will have a 5’-monophosphate that helps to recruit RNase E

promoting further endonucleolytic cleavage and RNA decay.

In support of this model, we note that in a previous study of RelE cleavage in highly

expressed genes using primer extension, cleavage sites were over-represented at the 5’-end of

genes, although no explanation for this observation was provided. This observation is related but

subtly different from ours: we observed enrichment of ribosomes at the 5’-end of genes, and they

observed high levels of RelE cleavage at the 5’-end of genes. The reloading of ribosomes at start

codons followed by a few rounds of elongation prior to additional cleavage events could tie these

observations together: RelE is targeted to the 5’-end of genes because that is where the

ribosomes accumulate over time.

As further evidence of the importance of rescuing stalled ribosomes, we note that some of

the rescue machinery is strongly upregulated in the RelE1 sample. Steady-state levels of the

backup rescue factor ArfA are 15-fold higher at the RNA level in the RelE1 sample. The levels

of synthesis of the ArfA protein are more than 80-fold increased. ArfA expression is regulated by

28

tmRNA in an elegant feedback mechanism: the transcript is cleaved by RNase III within the

open reading frame, leading to ribosome stalling and production of a truncated protein product.

tmRNA rescues these ribosomes stalled during ArfA synthesis and tags the truncated product for

proteolysis. If the capacity of tmRNA is exceeded, ArfA begins to accumulate through a poorly

understood drop-off mechanism. The fact that ArfA is strongly upregulated in cells treated with

RelE indicates that the tmRNA system is overwhelmed and that cells are responding for the need

for more ribosome rescue activity by strongly upregulating ArfA expression. Taken together, our

findings support a model in which mRNA cleavage by RelE, ribosome recycling, and initiation

enrich ribosome occupancy at the 5’-end of genes.

2.2.2 RelE Predominantly Cleaves mRNA after the Second Nucleotide in Empty A sites

The ribosome profiling data obtained from cells overexpressing the RelE endonuclease

provide information about cleavage sites genome-wide. In a sense, this information is indirect;

we measure ribosome footprints and not the products of RelE cleavage themselves. In preparing

Figure 3 Average ribosome density at start- and stop-codon

Average ribosome density at start (left) and stop codons (right) for Ribo-seq data from wild-type

cells or cells overexpressing RelE.

29

the footprints, another nuclease (MNase) is used to digest mRNA unprotected by ribosomes.

Although the MNase treatment could potentially interfere with our ability to detect RelE

cleavage events, we find that for the majority (if not all) of the footprints, the 3’-end is generated

by RelE cleavage and not MNase. Perhaps this is due to the high concentration of RelE in the

cell over the hour-long induction period.

The fact that most ribosome footprints are generated by RelE is clearly evident in the

position of the peak at the start codon in plots of average ribosome occupancy (Figure 3, left).

During initiation, the AUG start codon is positioned in the ribosomal P site. In the WT1 control

sample, MNase digests mRNA back to the 3’-boundary of the ribosome, 15 nt downstream of the

first nucleotide of the start codon (black). Although the start codon peak at +15 is small in the

WT1 sample, its position matches what we previously observed in many bacterial profiling

libraries generated with MNase. The position of the ribosome is most accurately and precisely

determined from the 3’-end of footprints in bacterial ribosome profiling; plots of ribosome

density reflect the 3’-end of sequencing reads. Furthermore, this position is consistent with the

distances observed in toeprinting assays in which reverse transcriptase is blocked by the 3’-

boundary of the ribosome and arrests 15-16 nt downstream of the first nucleotide in the P site

codon66. In contrast, the start codon peak in the RelE1 data is four nt downstream of the first

nucleotide of the start codon (red). In other words, the mRNA is cleaved between the second and

third nucleotide in the A-site codon within initiation complexes. The fact that the signal is far

higher at +4 than at +15 means that most if not all of the ribosome footprints are cleaved by RelE

at their 3’-ends.

The same phenomenon is observed at peaks at stop codons in plots of average ribosome

occupancy (Figure 3, right). During termination, the stop codons UAG, UGA, or UAA are

30

positioned in the ribosomal A site. In the WT1 sample, MNase digests back to the 3’-boundary

of the ribosome, 12 nt downstream of the first nucleotide in the stop codon (black). In contrast,

RelE cleaves after the second nt in the stop codon (red) in the A site. Together, the data at start

and stop peaks show that RelE cleaves predominantly after the second nucleotide in the A-site

codon and that most ribosome footprints were generated by RelE cleavage at their 3’-ends. This

pattern of cleavage is consistent with previous enzymatic, structural, and primer extension

studies of RelE activity.

In examining the peaks at other well-characterized ribosome pausing sites, we observed

that the ribosomal A site must be empty for RelE cleavage to occur. The SecM and TnaC

polypeptides interact with the ribosome to inhibit their own synthesis in response to specific

cellular signals67. The ribosome stalls at the RAGP sequence in SecM, for example, with the Gly

codon in the P site and unreactive Pro-tRNA bound in the A site67a. A strong peak 15 nt

downstream of the Gly codon in the WT1 sample corresponds to this well-understood pausing

event when the samples are treated only with MNase (Figure 4, left). In the RelE1 data, the

expected peak after the second nucleotide in the Pro codon is not observed; instead, the strongest

Figure 4 Ribo-seq data at nascent peptide-mediated stalling sites

SecM (left) and TnaC (right).

31

peak overlaps with the one observed in the WT1 sample 15 nt downstream of the Gly codon.

This suggests that RelE is unable to cleave the Pro codon in the A site in stalled SecM

complexes, presumably due to the Pro-tRNA trapped and blocking RelE entry. Similarly,

ribosome stalling at the C-terminus of the TnaC peptide (Figure 4, right) is observed in the WT1

data with a strong peak 15 nt downstream of the final residue, Pro67b. In the RelE1 data, a peak is

expected in the stop codon positioned in the A site, but none is observed, only the same peak

observed in the WT1 data. RelE is probably blocked by trapped release factor 2 that is recruited

but cannot complete hydrolysis of the nascent peptide. These findings suggest that RelE is

capable of competing with aminoacyl-tRNA and release factors for access to the A site during

normal elongation and termination, but that factors that are trapped long-term during

programmed ribosomal arrest block RelE activity.

2.2.3 in vitro Digestion with RelE Reveals the Reading Frame

It has been impossible to detect reading frame in ribosome profiling studies in bacteria

and the MNase enzyme is at least partly to blame, leaving one or more nucleotides undigested at

the 3’-boundary of the ribosome. Given the precision with which RelE cleaves after the second

nucleotide in the A site when expressed in vivo, we hypothesized that RelE might prove to be a

useful tool in ribosome profiling, generating more reliable 3’-ends and improving the resolution

at which the position of the ribosome can be determined. We purified RelE and added it to cell

lysates under our regular digestion conditions in vitro; MNase was included at the regular

concentrations to digest mRNA to the 5’-boundary of the ribosome. Ribosome footprints 10 – 40

nt in length were then cloned and sequenced following our usual protocol.

32

Comparison of profiling libraries obtained by expression of RelE in vivo (RelE1, Figure

1) with those obtained by in vitro RelE digestion (RelE2, Figure 5) reveals many similarities but

also an important difference. The activity of RelE in vitro can be seen in plots of average

ribosome occupancy across many genes aligned at start codons. A strong start codon peak in the

RelE-treated sample (RelE2) is found at the +4 and not the +15 site (Figure 5). Furthermore, like

the RelE1 data in Figure 1, the vertical spread in the RelE2 signal is substantially larger than that

observed in a library prepared from the same biological sample using only MNase (WT2),

indicative of 3’-periodicity reflecting reading frame. On the other hand, we do not observe the

marked decay of ribosome density at the 3’-end of genes that we observe when RelE is

overexpressed in vivo (see the data from the RelE1 library in Figure 1). This is because

translation is blocked in the cell lysate; elongation is arrested by chloramphenicol and initiation

is probably inhibited by dilution of the necessary factors and exhaustion of GTP. This is further

Figure 5 Average ribosome occupancy for a wild-type sample

RNA was digested in vitro with MNase alone (black) or with purified RelE protein (red).

33

evidence that the accumulation of ribosome occupancy at the 5’-end of genes in vivo is due to the

dynamics of cleavage, rescue, and initiation and not RelE activity per se.

Returning to the question of reading frame, we calculated the average ribosome density at

all three sub-codon positions in open reading frames throughout the genome. The library

prepared with MNase (WT2) shows slightly higher density at the first nucleotide (40%) than the

other two nucleotides (30% each, Figure 6, top left). Given that this effect is lacking in footprints

mapping to 3’-

untranslated regions (3’-

UTRs) (where the reading

frame is defined as the

same as the preceding

ORF), it may be tempting

to attribute this 3 nt

periodicity to the

ribosome’s reading frame

(bottom left). We found,

however, that RNAseq

libraries prepared by

digestion of total RNA by

low concentrations of

MNase yielded the same

result—ribosome occupancy is enriched at the first sub-codon position in ORFs but not in 3’-

UTRs. The sequence specificity of MNase coupled with the nucleotide bias in ORFs explain this

Figure 6 Ribo-seq and RNAseq density at each position

Ribosome density within codons in coding sequences (top) or 3’-

untranslated regions (bottom). Right: sequence bias in the genome

in the CDS or 3’-UTR.

34

observation. MNase cleaves preferentially before A and T which are enriched at position two in

codons in ORFs but not 3’-UTRs (Figure 6, right). Cleavage before position two makes the 3’-

end of footprints align with the first sub-codon position. Given that ribosome occupancy is

assigned using the 3’-end of footprints, ribosome density appears higher at the first position

(40%) than the other two (30% each).

In contrast, the libraries prepared by RelE digestion in vitro show the strongest density at

sub-codon position two (Figure 6, top left), corresponding to cleavage before the third nucleotide

in the A site codon, as observed at both start and stop codons in average ribosome density plots

(Figures 3 and 4). Roughly 60% of footprints map to the second sub-codon position; 30% map to

position three while only about 10% map to position one. The large difference between the

density at the first two positions explains the large vertical spread in the RelE2 data observed in

Figure 5 and provides the first reliable information about ribosome reading frame in bacterial

ribosome profiling data. Footprints from the 3’-UTR show no evidence of reading frame,

consistent with the expectation that ribosomes found there are not synthesizing protein or

moving in 3 nt steps (Figure 6, bottom left).

2.2.4 RelE Prefers to Cleave after C and before G

By analyzing the 3’-ends of ribosome footprints, we are able to observe the specificity of

the MNase and RelE enzymes. As others have shown, MNase preferentially cleaves before the

nucleotides A and T; in our data we observe that A and T are strongly enriched downstream of

the cleavage site, both +1 nt and to a lesser extent +2 nt as well (Figure 7, left). In contrast, A

and T are underrepresented upstream of the cleavage site, both −1 nt and to a lesser extent −2 nt.

The sequence preferences of RelE are very different from those of MNase: C is preferred at the

−1 position while G is selected against (Figure 7, right). Following the cleavage site, G is

35

strongly preferred and C is selected against. There appears to be a slight avoidance of A and T at

both positions. These tendencies hold true whether RelE digestion occurs in vitro or in vivo. In

contrast with what we observed with MNase, the sequence two or more nucleotides away from

cleavage site exert a small effect.

Although initial reports of RelE activity described the endonuclease as highly specific for

a few codons, additional studies revealed a more relaxed specificity. The preference for G after

the cleavage site was anticipated by the kinetic studies of Ehrenberg and co-workers, who found

that kcat/KM values for RelE cleavage were markedly higher for codons ending in G50a. The

presence of G at the third position of the codon had the strongest effect in explaining their kinetic

data. Woychik and co-workers also reported a strong preference for cleavage before G in five

highly expressed genes in E. coli52. Finally, the X-ray crystal structure of RelE bound in the A

Figure 7 Sequence bias at the 3’-end of ribosome footprints

Different cleavage patterns due to cleavage activity by RelE both in vivo and in vitro. All samples

were also digested with MNase in vitro.

36

site of 70S ribosomes offers a possible explanation of this preference: the G in the third position

of the A site codon stacks on the base of C1054, a conserved 16S rRNA nucleotide in the

decoding center. The structure suggests that direct contact of RelE residues with the G nucleotide

are also possible.

In contrast to the good agreement of our data with these earlier studies regarding the

specificity of the downstream nucleotide at the cleavage site, our observations of a marked

preference upstream of the cleavage site are unexpected. There is little evidence in the literature

of a strong bias for C and against G at the −1 position, raising the possibility that this effect is an

artifact of ribosome

profiling, arising from

biases in cloning

ribosome footprints.

We find, however,

that RNAseq libraries

show no such bias

when fragments were

prepared by alkaline

hydrolysis and then cloned and sequenced exactly as the profiling libraries. This finding rules out

the possibility that the preference for C is the result of cloning bias and strongly implicates RelE

cleavage itself as the cause, given that the method of RNA digestion is the only difference

between the two protocols.

Figure 8 Sequence bias at the 3’-end of RNA fragments

in two libraries. The fragmentation was done by alkaline hydrolysis.

37

2.2.5 The Specificity of RelE Interferes with Analyses of Ribosome Pausing

Although the generation of ribosome

footprints with RelE allows us to determine

the ribosome position and reading frame

with high resolution, the use of RelE incurs

certain disadvantages as well. Both MNase

and RelE are sequence-specific nucleases, a

property that creates bias at the ends of

ribosome footprints. As shown in Figure 9

(top), plots of average ribosome occupancy

at sense codons are noisy when the 3’-end of

the reads line up with the codon of interest

(that is, the distance from the codon is close

to zero). This noise arises from enrichment

of certain sequences at the 3’-end of reads at

the specific nucleotides in the codon being

averaged. For example, there is a strong

peak at 0 in the GTA codon (green) because

cleavage after G and before T is optimal

given the specificity of MNase. In contrast,

there is a strong peak at 1 for the CCA

codon (blue) because cleavage after position

2 is preferred. Although the sequence bias at

Figure 9 The ribosome occupancy on codons

Top: Average ribosome occupancy at all 61

sense codons. The position corresponding to

the codon in the A, P, or E site is indicated.

Middle: The same plot with data from the RelE

in vitro digest. Bottom: the differences in pause

score at all 61 sense codons between ribosome

profiling libraries.

38

the 3’-end of reads skews the data, in practice this noise can be ignored because the noise that

this creates is at the 3’-boundary of the ribosome, far away from functional sites such as the A, P,

and E sites for tRNA-binding. When the 3’-end of reads are 15 nt downstream of codons of

interest, positioning the CCA and GTA codons in the P site, for example, noise from cloning bias

does not interfere.

In contrast, the pronounced sequence specificity of RelE creates noise that obscures the

signal right at functional sites in the ribosome. Because RelE cleaves mRNA in the ribosomal A

site, the noise generated by bias at the 3’-end exactly overlaps the A site, making it difficult to

draw conclusions about the dwell time of ribosomes on specific codons during elongation or

termination (Figure 9, middle). We computed pause scores for three ribosome profiling libraries;

two biological replicates in which the footprints were generated by MNase (WT2 and WT3) and

the third library from the same biological sample as the WT2 library but using both RelE and

MNase (RelE2). We observed relatively small differences in the pause scores between the WT2

and WT3 libraries, in spite of the fact that they were derived from independent biological

samples (Figure 9, bottom). This holds true for all three tRNA-binding sites. There were

significant differences, however, between the levels of pausing in the WT2 and RelE2 data, even

though they were derived from the same biological sample. The differences were more

pronounced in the A site than the P or E sites. We conclude that in vitro digestion by RelE

introduces noise that complicates the analysis of ribosome pausing and argue that the MNase

data are more likely to represent an accurate view of pausing in vivo given that the cloning bias

in the MNase libraries is farther away from the sites of interest.

39

2.2.6 Refined RelE-Derived Ribosome Density Better Reflects Reading Frame

The sequence specificity of RelE alters the pattern of cleavage in predictable ways; by

taking this into account, we can obtain even better information regarding reading frame. Looking

in more detail at average ribosome occupancy at sense codons positioned in the A site, we find

that most codons are cleaved after the second nucleotide, as expected from the analyses of start

and stop codons above. In Figure 10 (left), the ribosome occupancy has been shifted such that the

A-site codon starts at zero; the peak at 1 observed in most codons (grey) corresponds to cleavage

after the second nucleotide. A subset of codons has higher density at position 2 corresponding to

cleavage after the third nucleotide in the codon (Figure 10, red): all of these codons end in C.

Given the specificity depicted in Figure 7, where cleavage after C is preferred and cleavage

before C is strongly inhibited, it makes sense that NNC codons are cleaved after the third

position, between the A site codon and the codon downstream. To compensate for this cleavage

bias, we shifted the ribosome density at all NNC codons from the third nucleotide to the second.

This improved the signal at the second position from about 60% to about 80%, with 10%

remaining at position one and 10% at position three (Figure 10, right). This suggests that two-

Figure 10 Correction for better reading frame

Left: average ribosome occupancy on all sense codons, shifted from Figure 9 so that the A site

codon starts at zero. Right: reading frame when the reads on NNC are shifted from position 3 to

position 2.

40

thirds of reads that align to the third sub-codon position arise from NNC codons. Given what we

know about sequence specificity a more sophisticated algorithm could be employed in the future

to realign ribosome density to optimize reading frame.

Even by simply shifting density on NNC codons from the third to second position,

reading frame is significantly improved. With this level of resolution, frameshifting events can

be detected in bacterial ribosome profiling data, as can be seen in analysis of the programmed

frameshift on the prfB gene encoding RF268. When RF2 levels are limiting, ribosomes pause at a

stop codon at the 28th codon in the gene and then shift into the +1 frame. Following the

frameshift, ribosomes complete the synthesis of the RF2 protein which is encoded in the new

reading frame. When we split the ribosome occupancy signal into three components, with reads

that map to the first, second, or third sub-codon position, this frameshifting behavior is evident in

models of the prfB gene. Upstream of the frameshift site, the majority of ribosome footprints are

cleaved after the second sub-codon position, as expected. But downstream of the frameshifting

event, most footprints are cleaved after the third position, consistent with a +1 frameshift. Very

few reads map to the first sub-codon position, either before or after the frameshift site.

Figure 11 Programmed frameshifting at the prfB gene

Left: ribosome occupancy split into three component parts, by sub-codon position. Given RelE’s

preference for cleaving after the second nt, the blue data represent in-frame translation upstream of

the frameshift site and the red data represent in-frame translation downstream (in the +1 frame).

41

Quantitation of the reading frame upstream and downstream of the programmed frameshift site

further supports this conclusion. Going forward, the high degree of three nucleotide periodicity

arising from translational reading frame can be used to search for other programmed

frameshifting sites in bacteria, as has been done in human ribosome profiling studies37 .

2.2.7 Concluding Thoughts

Our ribosome profiling analyses of RelE activity in vivo reveal dynamic cycles of

cleavage, ribosome rescue, and initiation that enrich ribosomes at the 5’-end of genes. Although

it was previously known that RelE-cleaved mRNAs are good targets for the tmRNA rescue

system, our data highlight the importance of the tmRNA and ArfA systems in the cellular

response to and recovery from RelE activity. How cells exit the dormant state with high RelE

activity is not yet clear, but it seems likely that ribosome rescue is essential for this to occur.

Antibiotics are beginning to be developed to target the tmRNA rescue pathway, raising the

possibility of targeting persister cells due to their reliance on toxins to cleave mRNA and block

translation.

The use of RelE as a nuclease for ribosome profiling suffers from the same problem of

sequence specificity as the enzyme normally used, MNase. This makes the enzyme less than

ideal for generating libraries where careful measurement of ribosome pausing is the goal.

However, RelE cleaves the 3’-end of fragments more precisely, allowing us to observe the

ribosome’s reading frame for the first time in bacteria.

In the future, it may be possible to generate ribosome profiling libraries using RelE alone.

In our in vitro digests, we also added MNase to cleave the mRNA back to the 5’-end of the

ribosome generating a ribosome footprint. With RelE alone, cleavage would only occur in the A

site, generating fragments that are far longer than the 10 – 40 mer ribosome footprints that we

42

work with now. In theory, these fragments could be cloned without size selection. The distance

between ribosomes on messages could then be determined by sequencing both ends using 50 bp

paired Illumina sequencing. This might give insights into ribosome stacking at strong pausing

sequences and a picture of ribosome distribution on single messages, since the sequence between

two ribosomes would remain intact.

2.3 Materials and Methods

2.3.1 Bacterial Strains and Growth Conditions

E. coli K-12 strain MG1655 was used as a wild-type strain. The RelE overexpression

strain was constructed by transformation with a plasmid, pJC203, which contains an araBAD

promoter. Cell cultures were grown at 37°C in either LB media or MOPS media supplemented

with 1% glucose, all 20 amino acids, and other nutrients (Teknova) and RelE overexpression was

induced by adding arabinose (final 0.2%) at an O.D600 of 0.2-0.3. Cell cultures were grown

further either for 1h or 20 min before harvesting.

2.3.2 Preparation of Ribo-seq and RNAseq Libraries

Ribo-seq libraries were generated as described previously66 with some modifications.

Ribosome-protected fragments were size-selected between 20 – 40 nt in length initially, but 10 –

40 nt in the latter libraries in an attempt to include short fragments generated by RelE cleavage

activity. For the library digested by RelE in vitro, 1 nmol of purified RelE was also added when

25 AU of RNA in the lysate was digested with 3000 U of MNase (Roche) for 1 h at 25 °C.

43

RNAseq libraries were also size-selected between 10 – 40 nt in length in the latter

preparation to reveal 3’-end sequences, which was not available due to longer reads size

selection with the length between 40 – 60 nt.

2.3.3 Data Analysis

Data analysis obtained from Illumina sequencing were described previously66.

44

3 THE ROLE OF TRNA METHYLTRANSFERASE TRMD

This work was a collaboration with Isao Masuda of Ya-Ming Hou’s lab at Thomas

Jefferson University. Dr. Masuda created and characterized the TrmD depletion system and his

work is described here for completeness. My contribution was to assist him in optimizing the

growth conditions, generate the ribosome profiling libraries, and analyze and interpret the

profiling sequencing data.

3.1 Introduction

tRNAs are adaptor molecules that deliver amino acids to the ribosome for protein

synthesis. To ensure the fidelity of translation, each tRNA must carry its cognate amino acid and

decode mRNA using the correct match between its anticodon and the codon in mRNA. tRNAs

undergo numerous chemical modifications after their transcription; an average E. coli tRNA

contains 77 nucleotides and have about 8 of them modified. These modifications play an

important role in the fidelity of both aminoacylation and decoding.

Most tRNA modifications are located near the anticodon69. One of the purposes of

modification is to fine tune the anticodon so that a single tRNA can read multiple, degenerate

codons. There are 47 different tRNAs produced from 86 genes in E. coli; these tRNAs are able to

decode all 61 sense codons. tRNA modifications promote this flexibility in codon recognition

while at the same time maintaining fidelity. This is a difficult but critical balance to achieve: the

45

tRNAs must be allowed a certain amount of broadening of specificity but only under the right

circumstances.

tRNA modifications also play a critical role in maintaining the reading frame during

translation46. One particularly important enzyme for preventing frameshifting is TrmD, a

bacterial enzyme responsible for methylation of N1 of G37 of several tRNAs70. Unlike the case

with other modifications, deficiency of m1G37 leads to cell death in many bacteria including E.

coli70b, 71. TrmD is very different structurally and mechanistically from its eukaryotic homolog,

Trm572. Studying E. coli TrmD can give us a framework for understanding methyltransferases,

the role of modifying enzymes in tRNA stability and activity, and insight into a promising

antibiotic target. Here we report our efforts to understand the role of TrmD in living cells using

ribosome profiling.

3.1.1 tRNA Synthesis and Post-Transcriptional Modifications

In E. coli, tRNAs are transcribed as long precursors73. Various RNases process the

precursor tRNAs (pre-tRNAs) to mature, functional tRNAs by cutting excess residues at the 5’-

and 3’-ends74. For most pre-tRNAs, RNase E removes nucleotides at the 3’-end by an

endonucleolytic cleavage, leaving only a short 3’-trailer75. Then, RNase P interacts with the CCA

motif and removes nucleotides at the 5’-end of the pre-tRNA with a single endonucleolytic

cleavage; this creates the mature 5’-end of the tRNA. Finally, exonucleases such as RNase T,

PH, II, and D further trim the remaining 3’-trailer nucleotides up to the mature CCA motif at the

3’-end of the tRNA73.

All tRNAs have L-shaped tertiary structures formed by base pairing and coaxial stacking

of the helices. Each tRNA is slightly different but shares common secondary structural

features76. When depicted in two-dimensions, tRNA have a cloverleaf structure consisting of an

46

acceptor stem, D (dihydrouridine) loop, anticodon loop, and T loop. The acceptor stem contains

the conserved 3’-terminal CCA motif where the amino acid is covalently linked77. The structure

of the anticodon loop presents the anticodon in a way that allows it to form the correct base pairs

during decoding77. The other structures contribute to interactions with the translation machinery,

including the ribosome, aminoacyl-tRNA synthetases (aaRSs), and EF-Tu.

Following transcription, tRNAs are modified at several sites to enhance their cellular

functions, promoting their structure and stability, optimizing translational fidelity and accuracy,

and maintaining the correct reading frame. The first modification identified was pseudouridine

(Ψ), as the fifth nucleotide, which is an isomer of uridine78. Ψ is present in the T loop that is

recognized by aaRS’s and EF-Tu. Likewise, the dihydrouridine modification is found in the D

loop79. With two extra hydrogens added to its pyrimidine ring, dihydrouridine is no longer planar

or aromatic, disturbing base stacking, and destabilizes the structure, conferring flexibility79.

Most critical modifications of tRNA, however, occur in the anticodon loop. The first two

positions in a given codon base pair (in classical Watson-Crick fashion) with the second and

third positions of the anticodon, nucleotides 35 and 36 in the tRNA. The third nucleotide in the

codon in the ‘wobble’ position, however, can participate in a number of different pairing

geometries in its interaction with position 34 of the tRNA. Modification of position 34 can

broaden base pairing80: deaminating A34 to inosine, for example, in tRNAArg in bacteria allows

for pairing with A, C, and U at the wobble position. Alternatively, modifications of U34 to 5-

methoxycarbonyl-methyl-2-thouridine (mcm5s2U34) and the 5-methylation of cytosine 34

(m5C34) allow a single tRNA isoacceptor to read multiple codons81. The modifications promote

accurate codon-anticodon interactions, optimizing translation of the genetic code.

47

tRNA modifications also play a critical role in maintaining the reading frame of

translation80, 82. In most cases, frameshifting is far more detrimental than other decoding errors.

Missense errors that replace one amino acid for another often have little or no effect on the

stability or activity of the protein product. On the other hand, most shifts in reading frame lead to

pre-mature termination at a stop codon in the new reading frame, yielding truncated and inactive

polypeptides. Pseudouridine and 2-thiouridylation (s2U34) play roles in frame maintenance in

addition to their other functions78a, 83. One of the best characterized modifications involved in

frame maintenance is the methylation of G37, just downstream of the anticodon.

3.2 The TrmD Methyltransferase

The m1G37 modification involves a very ancient

pathway; proteins catalyzing G37 methylation are found in

all three domains of life84. m1G37 is found in most tRNAs

that read codons starting with C. It is thought that

methylation of G37 may block base pairing between this

nucleotide and the mRNA that would promote decoding of a

quadruplet instead of a triplet (Figure 12)84. Codons starting with C are read by tRNAs with G36;

the combination of G36 and unmethylated G37 is thought to be problematic. Other mechanisms

involving structural changes in the tRNA or faulty translocation on the ribosome may also play a

role.

There are many enzymes that facilitate tRNA methylation. Some enzymes add methyl

groups to carbon or nitrogen atoms in the bases, while others methylate oxygens in the ribose

sugars. In general, methyltransferases catalyze the transfer of a methyl group (CH3) from the

donor to a substrate molecule. In most cases, the methyl donor is S-adenosyl-L-methionine

Figure 12 1-methyl-Guanosine

48

(AdoMet)85. AdoMet-dependent methylation often involves SN2 substitution reactions in which

the substrate attacks on the reactive methyl group in AdoMet, yielding a single stereoisomeric

product.

In bacteria, methylation of G37 is accomplished by the TrmD enzyme, which is

composed of 255 amino acids (28.4 kD) and acts as a homodimer. The active site is formed by

residues from each of the monomers86. The structure contains a unique trefoil knot deep within

each of the homodimers whose function is not fully understood. When AdoMet binds to the

structure, it causes conformational changes. The adenosine moiety of AdoMet is surrounded by a

loop and AdoMet is in a bent conformation85.

It has been proposed that the TrmD homodimer requires the anticodon loop of tRNA to

be precisely positioned near AdoMet for the methylation of G37 since bound AdoMet is located

in a deep, seemingly inaccessible pocket near the center of TrmD85. The conserved

phenylalanine residue is proposed to provide critical stacking interactions required for the final

positioning of G3786. It has been proposed that an aspartate of TrmD acts as a general base while

other conserved residues provide G37 specificity for the reaction allowing methylation of the N1

position of G via a deprotonation, followed by the transfer of the methyl group of AdoMet85.

TrmD binds to tRNAs non-specifically searching for its recognition element, the

dinucleotide G36-G3786. Once TrmD recognizes the two consecutive G residues at the 3’ end of

the anticodon, AdoMet-dependent methylation occurs86. tRNA is positioned in the cleft and, if

the identity element G36pG37 is present, G36pG37 might be ‘flipped’ into the dual pocket

catalytic center for subsequent methylation by bound AdoMet85-86. If a G is not found at position

36, then a stable complex of G37 near AdoMet cannot form and catalysis does not proceed86.

49

The bacterial TrmD enzyme is substantially different from its eukaryotic homolog, Trm5,

raising the possibility of targeting TrmD with antibiotics87. The two enzymes show no detectable

similarity in sequence or structure. While TrmD is an obligate dimer, Trm5 is a monomer88.

TrmD binds AdoMet with the rare trefoil knot, Trm5 binds AdoMet in the open space of a

dinucleotide-fold. As noted above, TrmD binds AdoMet in a bent conformation, where the

adenosine and methionine moieties are bent toward each other by 90° whereas Trm5 binds

AdoMet in the more common straight conformation, with an 180° angle between the two

moieties86, 89. The fact that these enzymes have different active site geometries and both bind

small molecules makes them attractive drug targets, and pharmaceutical companies are working

on developing small molecule inhibitors of TrmD84. Loss of TrmD activity is lethal in E. coli;

this is unusual for tRNA modification enzymes, most of which can be deleted with little or no

detectable phenotype. Although increased levels of frameshifting are observed when TrmD and

m1G37 are lost, it is not entirely clear what the mechanism of cell death is. To obtain a better

understanding of protein synthesis in living cells upon depletion of G37 methylation, we

performed ribosome profiling.

3.3 Results

3.3.1 Achieving Regulation of TrmD, an Essential Protein

An inducible TrmD-depletion E. coli strain was constructed to observe defects in

translation and identify genes whose expression is altered when m1G37 of tRNA is depleted in

cells. Since m1G37 is an essential modification of several tRNAs and deletion of trmD is lethal,

we constructed a strain in which TrmD can be conditionally depleted, starting with the G78

strain90. In G78, proteolysis of certain target proteins can be regulated by controlling the

50

expression of the peptide recognition protein, ClpX, which binds to hydrophobic C-terminal

peptide tags, and Clp, a protease. The ClpXP complex degrades proteins tagged by tmRNA as

well as many other proteins with C-termini recognized by the complex91. In the G78 strain, the

ClpXP genes are regulated by an arabinose inducible promoter, allowing us to regulate its

expression by changing media conditions90. We also express TrmD from a constitutive promoter

with the tmRNA tag translationally fused to its C terminus; this effectively targets the protein for

degradation when ClpXP is expressed upon induction with arabinose.

To construct the TrmD-depeletion strain, the trmD gene, which is expressed with another

three genes on an operon, is modified at its 3’-end to contain a peptide tag (Figure 13). This tag

contains a sequence that targets it for degradation by ClpXP (YALAA) as well as His6 and

FLAG epitopes (Figure

13)90. This is called the

degron strain (with the

Deg-tag); the control

strain also contains the

epitope tags but not the

YALAA degradation

signal (the Cont-tag).

The trmD mutations

were first established by

λ red recombination in

E. coli strain SM140590. Then, the trmD alleles with the added degradation tag or control tag are

Figure 13 TrmD depletion strain construction

51

verified by PCR, and then the alleles were transduced using P1 phage in the G78 strain in which

ClpXP can be regulated, generating the trmD-deg and trmD-cont strains (Figure 13).

The two strains were tested by growing on agar plates (Figure 14, top). When cells were

plated on media containing glucose, both strains formed colonies (data not shown). However,

when cells were grown on the plate with 0.2%

arabinose added, there were significant

differences in colony formation. The degron

strain (trmD-deg) formed fewer colonies than

the control strain, suggesting that the tagged

TrmD was successfully depleted upon ClpXP

overexpression, arresting cell growth. In

addition, steady state levels of TrmD were

visualized by western blot (Figure 14, bottom). Cells were first grown in LB media (with trace

amount of glucose) to suppress ClpXP expression. Upon induction with arabinose (final

concentration of 0.2%), the cells were collected and subjected to western blot assay with various

time points. The western blot shows that TrmD levels drop significantly in the cell between 30 –

40 min after ClpXP overexpression.

Having confirmed that colonies fail to grow when TrmD is depleted, and that the

depletion kinetics are fairly rapid, growth curves were obtained to determine the optimal time

after induction for harvesting the cultures for ribosome profiling. Surprisingly, however, the

growth curves of degron and control strains did not show much differences (data not shown). We

hypothesized that the slow decay of residual methylated tRNAs, produced prior to TrmD

Figure 14 TrmD depletion

Cell growths on plate (Top) and western

blot for tagged proteins (Bottom).

52

depletion, left enough tRNA for the cells to continue to function and grow relatively normally

over short time frame.

By diluting the

culture twice back to an

OD600 of 0.1, we found that

differences in growth rates

between the degron and

control strains could be

observed after about 4 h of

induction of the ClpXP

protease (Figure 15).

Without the dilutions, the

cultures reached saturation

at nearly the same rates.

With the dilutions,

however, differences could

be observed when the cultures were still growing in exponential phase. It was important not to

reach saturation because starvation for certain amino acids ensues during stationary phase,

creating a signal of pausing at those codons in the ribosome profiling data that would interfere

with our analyses of the effects of depletion of trmD (Figure 15).

We then monitored the level of m1G37 over the 4 h induction time course. A 14-nt long

primer complementary to tRNALeu/CAG was used for extension by reverse transcriptase (Figure

16, left). Methylation blocks extension, generating a 15 nt product. In the absence of methylation

Figure 15 Growth measurements upon TrmD depletion

Cell growth in LB media. At Time 0, exponentially growing cells

were diluted in fresh media with 0.2% arabinose for ClpXP

induction. Additional dilutions of cell culture media to fresh media

after 1h and 2h.

53

at G37, a 53 nt product is generated. In the degron strain, a gradual depletion m1G37 methylated

tRNA was observed (Figure 16, right). At the same time, newly synthesized tRNA is

unmethylated, as evidenced by the gradual accumulation of the 53 nt product over time in the

degron strain (Figure 16, right). No unmethylated tRNA was evident in the control strain.

3.3.2 Depletion of m1G37 of tRNA Causes Pauses on CCG, CCA, and CGG Codons

Ribosome profiling libraries were generated from cultures of the control and degron

strains after 4 h of induction. We analyzed the data for pauses at specific codons that are read by

tRNAs normally containing m1G37. To detect pauses, we averaged the ribosome occupancy

surrounding instances of the 61 sense codons. Plots of average ribosome occupancy for the

control strain, for example, show very strong pauses when Ser codons are found in the ribosomal

A, P, and E sites (Figure 17). In these plots (Figure 17 and 18), the signal at 0 corresponds to the

Figure 16 Effect of TrmD depletion on tRNA

Primer extension assay using tRNAs as templates for verifying methylation depletion on tRNA.

54

codon is positioned in the P site, in the A site at -3, and in the E site at +3. (Note that the noise

between -10 and -20 corresponds to sequence bias at the 3’-end of the reads and can be ignored).

The Ser codons TCG, TCC, TCT, and TCA have the strong signals in the control strain, shown

in color (Figure 17). Strong pauses on serine codons are a signature of growth in LB media that

is frequently observed in E. coli ribosome profiling. Presumably this occurs because levels of Ser

within the cell are limiting for protein synthesis.

In the degron strain (Figure 18), on the other hand, the usual Ser pauses are not observed

and other pauses have become rate limiting. Ribosome occupancy is strongly enriched at CCG,

CCA, and CGG codons. This result, seemingly representing the effect of depletion of tRNAs

containing m1G37, makes sense because all three of these codons are decoded by tRNAs with

m1G37. Perhaps ribosomes pause at these codons because the concentration of tRNA is reduced

or because unmethylated tRNA behaves poorly in decoding or translocation.

Figure 17 Ribosome occupancy on codons in control strain

High level of ribosome occupancy on some codons (colored) compared to other codons (grey).

55

What is surprising is that only these three codons have strong pauses in the profiling data

from the degron strain, when seven or perhaps nine codons are predicted to be decoded by

tRNAs containing m1G37. Table 1 lists the codons that start with C and are decoded by tRNA

isoacceptors that are candidate substrates of TrmD. Note that the CAN codons encoding His and

Gln are not shown in the table; the corresponding tRNAs contain A37 rather than G37 and are

therefore not substrates of TrmD; pauses are not observed on these codons. The same thing is

true for three of the CGN Arg codons: no pauses are observed for the CGU, CGC, and CGA,

codons that are decoded by a tRNA containing an ICG anticodon with A37 and not G37 and so

not a TrmD substrate. In contrast, the CGG Arg codon whose cognate tRNA has m1G37

modification shows very strong pauses, as noted above. This is perhaps the simplest case.

Figure 18 Ribosome occupancy on codons in degron strain

High level of ribosome occupancy on some codons (colored) compared to other codons

(grey).

56

Surprisingly, the CUN leucine codons do not show pausing, even though they are read by

tRNAs thought to contain m1G37. There is some controversy in the literature over whether E.

coli tRNALeu/GAG (green) is actually methylated; this tRNA decodes the CUC and CUU codons.

Further work will be required to ascertain the methylation status of G37 of this tRNA, but our

data may argue against it. Another possibility is that because both the GAG and CAG anticodon

isoacceptors for tRNALeu have pseudouridine at position 38, they may be able to perform correct

base pairing even when m1G37 is depleted. Only these two tRNAs have pseudouridine at

position 38 among the isoacceptors in Table 1.

The pausing on Pro codons is likewise surprising. All three tRNAPro isoacceptors are

bona fide substrates for TrmD; why are strong pauses observed only at CCG and CCA? One

thought is that the GGG

isoacceptor that reads the

other two codons (CCC and

CCU) can maintain some

activity even in the absence

of methylation. The

possibilities for

frameshifting are increased

dramatically with this

isoacceptor (which has the

four nt anticodon GGGG in

the absence of methylation). Perhaps pausing on the CCC and CCU codons leads to rapid

frameshifting so that strong pauses are not observed.

Codon Anti-codon

Leucine

CUC

GAGm1G37

CUA

CUG CAGm1G37

CUU

Proline

CCU GGGm1G37

UGGm1G37 CCC

CCA

CCG CGGm1G37

Arginine

CGU

ICG

CGC

CGA

CGG CCGm1G37

Table 1 Codons and their anti-codons with modifications

Codons with most ribosome occupancy (red) and isoacceptors with

m1G37 modification.

57

3.4 Materials and Methods

3.4.1 Strains and Construction

The figure 13 shows the scheme of constructing the trmD-deg and trmD-cont cells. First,

the degron- and control-tag followed by tetracycline marker were amplified from plasmids

created by Carr et al, provided by Dr. Sean Moore90.

The primers contain homologous extensions at 5' ends to the trmD-flanking regions

(Forward: CGGAACACGCACAACAGCAACATAAACATGATGGGATGGCGGGTGGCTCC

GACTACAAGG, Reverse: ATAATTTAATCTCTTATCCTGGGTAAACTGATATCTCGGGG

GCTTAGGTCGAGGTGGCCC).

PCR products were electroporated into E. coli recombinogenic strain SM140590 and

homologous recombination was confirmed by colony-PCR using primers targeting 5'-terminal

and 3'-flanking regions of trmD (Forward: ATGTGGATTGGCATAATTAGCCTGTTT

CC, Reverse: GAATTCCGGTTACGAATAGCGATAACCACGCC).

trmD-deg SM1405 and trmD-cont SM1405 were used as a donor for P1 phage lysate

preparation, and the tagged chromosomal trmD loci were transferred by P1 transduction to the

recipient E. coli strain G78, in which the chromosomal clpX gene is knocked out. After

tetracycline marker selection, genotype was confirmed again by colony-PCR.

As described90, trmD-deg G78 strain was then transformed with a library of pClpPX

which contains random mutations at the promoter region of the clpPX genes, and transformant

clones were screened out to pick up the one with highest efficiency of degradation, as proven

also by the rapid degradation at protein level. This specific plasmid clone that confers high

degradation efficiency was extracted from trmD-deg G78 and was transformed into trmD-cont

G78 strain. This pair of trmD-cont G78 and trmD-deg G78 is used for all the experiments.

58

3.4.2 Cell Growth and Preparation of Ribo-seq Libraries

Each cell from degron- and control-strain was grown in LB media supplemented with

0.2% glucose, 0.2% glycerol and an appropriated antibiotic at 37° C for overnight. The next day,

fresh 10-mL LB media supplemented with 0.002% glucose, 5 mM serine and an appropriate

antibiotic were inoculated with overnight cell culture media with 100-fold dilution, followed by

incubation for 1.5-2 h shaking at 37° C.

By measuring the O.D600 to reach up to 0.3-0.4, cell cultures were diluted aiming O.D600

≈ 0.1 in fresh 300 mL LB media supplemented with 0.2% arabinose, 5 mM serine and an

appropriate antibiotic, followed by incubation for 1 h shaking at 37° C. Then, by measuring the

O.D600, cell cultures were further diluted in 300-mL LB media supplemented with 0.2%

arabinose, 5 mM serine and an appropriate antibiotic and this was repeated once more after

another hour. Finally, when the O.D600 reached up to 0.3, cells were harvested by filtering cell

culture using 49 µM filter paper and the cell pellet was flash frozen in liquid nitrogen.

Ribo-seq libraries were generated as described previously66 with some modifications.

Ribosome-protected fragments were size-selected between 10 – 40 nt. Cell lysates were digested

with 3000 U of MNase (Roche) for 1 h at 25 °C.

3.4.3 Data Analysis

Data analysis obtained from Illumina sequencing were described previously66.

59

REFERENCES

1. Ibba, M.; Soll, D., Aminoacyl-tRNAs: setting the limits of the genetic code. Genes Dev

2004, 18 (7), 731-8.

2. Ramakrishnan, V., Ribosome structure and the mechanism of translation. Cell 2002, 108

(4), 557-72.

3. Boelens, R.; Gualerzi, C. O., Structure and function of bacterial initiation factors. Curr

Protein Pept Sci 2002, 3 (1), 107-19.

4. Steitz, J. A.; Jakes, K., How ribosomes select initiator regions in mRNA: base pair

formation between the 3' terminus of 16S rRNA and the mRNA during initiation of protein

synthesis in Escherichia coli. Proc Natl Acad Sci U S A 1975, 72 (12), 4734-8.

5. (a) Shine, J.; Dalgarno, L., The 3'-terminal sequence of Escherichia coli 16S ribosomal

RNA: complementarity to nonsense triplets and ribosome binding sites. Proc Natl Acad Sci U S A

1974, 71 (4), 1342-6; (b) Marzi, S.; Myasnikov, A. G.; Serganov, A.; Ehresmann, C.; Romby, P.;

Yusupov, M.; Klaholz, B. P., Structured mRNAs regulate translation initiation by binding to the

platform of the ribosome. Cell 2007, 130 (6), 1019-31.

6. Shatkin, A. J., Capping of eucaryotic mRNAs. Cell 1976, 9 (4 PT 2), 645-53.

7. Carter, A. P.; Clemons, W. M., Jr.; Brodersen, D. E.; Morgan-Warren, R. J.; Hartsch, T.;

Wimberly, B. T.; Ramakrishnan, V., Crystal structure of an initiation factor bound to the 30S

ribosomal subunit. Science 2001, 291 (5503), 498-501.

8. Ogle, J. M.; Carter, A. P.; Ramakrishnan, V., Insights into the decoding mechanism from

recent ribosome structures. Trends Biochem Sci 2003, 28 (5), 259-266.

9. Diaconu, M.; Kothe, U.; Schlunzen, F.; Fischer, N.; Harms, J. M.; Tonevitsky, A. G.; Stark,

H.; Rodnina, M. V.; Wahl, M. C., Structural basis for the function of the ribosomal L7/12 stalk in

factor binding and GTPase activation. Cell 2005, 121 (7), 991-1004.

10. Berchtold, H.; Reshetnikova, L.; Reiser, C. O.; Schirmer, N. K.; Sprinzl, M.; Hilgenfeld,

R., Crystal structure of active elongation factor Tu reveals major domain rearrangements. Nature

1993, 365 (6442), 126-32.

11. Nissen, P.; Kjeldgaard, M.; Thirup, S.; Clark, B. F.; Nyborg, J., The ternary complex of

aminoacylated tRNA and EF-Tu-GTP. Recognition of a bond and a fold. Biochimie 1996, 78 (11-

12), 921-33.

12. Ogle, J. M.; Brodersen, D. E.; Clemons, W. M.; Tarry, M. J.; Carter, A. P.; Ramakrishnan,

V., Recognition of cognate transfer RNA by the 30S ribosomal subunit. Science 2001, 292 (5518),

897-902.

60

13. Voorhees, R. M.; Schmeing, T. M.; Kelley, A. C.; Ramakrishnan, V., The mechanism for

activation of GTP hydrolysis on the ribosome. Science 2010, 330 (6005), 835-8.

14. Blanchard, S. C.; Gonzalez, R. L.; Kim, H. D.; Chu, S.; Puglisi, J. D., tRNA selection and

kinetic proofreading in translation. Nat Struct Mol Biol 2004, 11 (10), 1008-14.

15. Voorhees, R. M.; Ramakrishnan, V., Structural basis of the translational elongation cycle.

Annu Rev Biochem 2013, 82, 203-36.

16. (a) Li, W.; Frank, J., Transfer RNA in the hybrid P/E state: correlating molecular dynamics

simulations with cryo-EM data. Proc Natl Acad Sci U S A 2007, 104 (42), 16540-5; (b) Flanagan,

J. F. t.; Namy, O.; Brierley, I.; Gilbert, R. J., Direct observation of distinct A/P hybrid-state tRNAs

in translocating ribosomes. Structure 2010, 18 (2), 257-64.

17. Dorner, S.; Brunelle, J. L.; Sharma, D.; Green, R., The hybrid state of tRNA binding is an

authentic translation elongation intermediate. Nat Struct Mol Biol 2006, 13 (3), 234-41.

18. Klaholz, B. P.; Pape, T.; Zavialov, A. V.; Myasnikov, A. G.; Orlova, E. V.; Vestergaard,

B.; Ehrenberg, M.; van Heel, M., Structure of the Escherichia coli ribosomal termination complex

with release factor 2. Nature 2003, 421 (6918), 90-4.

19. Ito, K.; Ebihara, K.; Uno, M.; Nakamura, Y., Conserved motifs in prokaryotic and

eukaryotic polypeptide release factors: tRNA-protein mimicry hypothesis. P Natl Acad Sci USA

1996, 93 (11), 5443-5448.

20. Korostelev, A. A., Structural aspects of translation termination on the ribosome. Rna-a

Publication of the Rna Society 2011, 17 (8), 1409-1421.

21. Mora, L.; Heurgue-Hamard, V.; Champ, S.; Ehrenberg, M.; Kisselev, L. L.; Buckingham,

R. H., The essential role of the invariant GGQ motif in the function and stability in vivo of bacterial

release factors RF1 and RF2. Mol Microbiol 2003, 47 (1), 267-75.

22. Freistroffer, D. V.; Pavlov, M. Y.; MacDougall, J.; Buckingham, R. H.; Ehrenberg, M.,

Release factor RF3 in E.coli accelerates the dissociation of release factors RF1 and RF2 from the

ribosome in a GTP-dependent manner. Embo J 1997, 16 (13), 4126-33.

23. Pavlov, M. Y.; Freistroffer, D. V.; MacDougall, J.; Buckingham, R. H.; Ehrenberg, M.,

Fast recycling of Escherichia coli ribosomes requires both ribosome recycling factor (RRF) and

release factor RF3. Embo J 1997, 16 (13), 4134-41.

24. Doma, M. K.; Parker, R., Endonucleolytic cleavage of eukaryotic mRNAs with stalls in

translation elongation. Nature 2006, 440 (7083), 561-4.

25. Keiler, K. C., Mechanisms of ribosome rescue in bacteria. Nat Rev Microbiol 2015, 13 (5),

285-297.

26. Pech, M.; Nierhaus, K. H., Three mechanisms in Escherichia coli rescue ribosomes stalled

on non-stop mRNAs: one of them requires release factor 2. Mol Microbiol 2012, 86 (1), 6-9.

61

27. Chadani, Y.; Ono, K.; Ozawa, S.; Takahashi, Y.; Takai, K.; Nanamiya, H.; Tozawa, Y.;

Kutsukake, K.; Abo, T., Ribosome rescue by Escherichia coli ArfA (YhdL) in the absence of trans-

translation system. Mol Microbiol 2010, 78 (4), 796-808.

28. Chadani, Y.; Ono, K.; Kutsukake, K.; Abo, T., Escherichia coli YaeJ protein mediates a

novel ribosome-rescue pathway distinct from SsrA- and ArfA-mediated pathways. Mol Microbiol

2011, 80 (3), 772-85.

29. Metzinger, L.; Hallier, M.; Felden, B., The highest affinity binding site of small protein B

on transfer messenger RNA is outside the tRNA domain. RNA 2008, 14 (9), 1761-72.

30. Miller, M. R.; Healey, D. W.; Robison, S. G.; Dewey, J. D.; Buskirk, A. R., The role of

upstream sequences in selecting the reading frame on tmRNA. BMC Biol 2008, 6, 29.

31. Karzai, A. W.; Roche, E. D.; Sauer, R. T., The SsrA-SmpB system for protein tagging,

directed degradation and ribosome rescue. Nat Struct Biol 2000, 7 (6), 449-55.

32. Keiler, K. C., Biology of trans-translation. Annu Rev Microbiol 2008, 62, 133-51.

33. Zechel, D. L.; Konermann, L.; Withers, S. G.; Douglas, D. J., Pre-steady state kinetic

analysis of an enzymatic reaction monitored by time-resolved electrospray ionization mass

spectrometry. Biochemistry-Us 1998, 37 (21), 7664-9.

34. Arava, Y.; Wang, Y.; Storey, J. D.; Liu, C. L.; Brown, P. O.; Herschlag, D., Genome-wide

analysis of mRNA translation profiles in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A

2003, 100 (7), 3889-94.

35. Ingolia, N. T.; Ghaemmaghami, S.; Newman, J. R. S.; Weissman, J. S., Genome-Wide

Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling. Science

2009, 324 (5924), 218-223.

36. Ingolia, N. T.; Lareau, L. F.; Weissman, J. S., Ribosome profiling of mouse embryonic

stem cells reveals the complexity and dynamics of mammalian proteomes. Cell 2011, 147 (4), 789-

802.

37. Michel, A. M.; Choudhury, K. R.; Firth, A. E.; Ingolia, N. T.; Atkins, J. F.; Baranov, P. V.,

Observation of dually decoded regions of the human genome using ribosome profiling data.

Genome Res 2012, 22 (11), 2219-2229.

38. Dunn, J. G.; Foo, C. K.; Belletier, N. G.; Gavis, E. R.; Weissman, J. S., Ribosome profiling

reveals pervasive and regulated stop codon readthrough in Drosophila melanogaster. Elife 2013,

2, e01179.

39. Guydosh, N. R.; Green, R., Dom34 rescues ribosomes in 3' untranslated regions. Cell 2014,

156 (5), 950-62.

62

40. Oh, E.; Becker, A. H.; Sandikci, A.; Huber, D.; Chaba, R.; Gloge, F.; Nichols, R. J.; Typas,

A.; Gross, C. A.; Kramer, G.; Weissman, J. S.; Bukau, B., Selective ribosome profiling reveals the

cotranslational chaperone action of trigger factor in vivo. Cell 2011, 147 (6), 1295-308.

41. Li, G. W.; Oh, E.; Weissman, J. S., The anti-Shine-Dalgarno sequence drives translational

pausing and codon choice in bacteria. Nature 2012, 484 (7395), 538-U172.

42. Mohammad, F.; Woolstenhulme, C. J.; Green, R.; Buskirk, A. R., Clarifying the

Translational Pausing Landscape in Bacteria by Ribosome Profiling. Cell Rep 2016, 14 (4), 686-

94.

43. Li, G. W.; Burkhardt, D.; Gross, C.; Weissman, J. S., Quantifying absolute protein

synthesis rates reveals principles underlying allocation of cellular resources. Cell 2014, 157 (3),

624-35.

44. Ogura, T.; Hiraga, S., Mini-F Plasmid Genes That Couple Host-Cell Division to Plasmid

Proliferation. P Natl Acad Sci-Biol 1983, 80 (15), 4784-4788.

45. (a) Bernard, P.; Kezdy, K. E.; Vanmelderen, L.; Steyaert, J.; Wyns, L.; Pato, M. L.; Higgins,

P. N.; Couturier, M., The F-Plasmid Ccdb Protein Induces Efficient Atp-Dependent DNA

Cleavage by Gyrase. J Mol Biol 1993, 234 (3), 534-541; (b) Bernard, P.; Couturier, M., Cell Killing

by the F-Plasmid Ccdb Protein Involves Poisoning of DNA-Topoisomerase-Ii Complexes. J Mol

Biol 1992, 226 (3), 735-745.

46. Tam, J. E.; Kline, B. C., The F-Plasmid Ccd Autorepressor Is a Complex of Ccda and Ccdb

Proteins. Mol Gen Genet 1989, 219 (1-2), 26-32.

47. Jaffe, A.; Ogura, T.; Hiraga, S., Effects of the Ccd Function of the F-Plasmid on Bacterial-

Growth. J Bacteriol 1985, 163 (3), 841-849.

48. Gerdes, K., Toxin-antitoxin modules may regulate synthesis of macromolecules during

nutritional stress. J Bacteriol 2000, 182 (3), 561-572.

49. Lehnherr, H.; Maguin, E.; Jafri, S.; Yarmolinsky, M. B., Plasmid Addiction Genes of

Bacteriophage-P1 - Doc, Which Causes Cell-Death on Curing of Prophage, and Phd, Which

Prevents Host Death When Prophage Is Retained. J Mol Biol 1993, 233 (3), 414-428.

50. (a) Pedersen, K.; Zavialov, A. V.; Pavlov, M. Y.; Elf, J.; Gerdes, K.; Ehrenberg, M., The

bacterial toxin RelE displays codon-specific cleavage of rnRNAs in the ribosomal A site. Cell

2003, 112 (1), 131-140; (b) Christensen, S. K.; Mikkelsen, M.; Pedersen, K.; Gerdes, K., ReIE, a

global inhibitor of translation, is activated during nutritional stress. P Natl Acad Sci USA 2001, 98

(25), 14328-14333.

51. Neubauer, C.; Gao, Y. G.; Andersen, K. R.; Dunham, C. M.; Kelley, A. C.; Hentschel, J.;

Gerdes, K.; Ramakrishnan, V.; Brodersen, D. E., The Structural Basis for mRNA Recognition and

Cleavage by the Ribosome-Dependent Endonuclease ReIE. Cell 2009, 139 (6), 1084-1095.

63

52. Hurley, J. M.; Cruz, J. W.; Ouyang, M.; Woychik, N. A., Bacterial Toxin RelE Mediates

Frequent Codon-independent mRNA Cleavage from the 5 ' End of Coding Regions in Vivo. J Biol

Chem 2011, 286 (17), 14770-14778.

53. Gamper, H. B.; Masuda, I.; Frenkel-Morgenstern, M.; Hou, Y. M., The UGG Isoacceptor

of tRNAPro Is Naturally Prone to Frameshifts. Int J Mol Sci 2015, 16 (7), 14866-83.

54. Brule, H.; Elliott, M.; Redlak, M.; Zehner, Z. E.; Holmes, W. M., Isolation and

characterization of the human tRNA-(N1G37) methyltransferase (TRM5) and comparison to the

Escherichia coli TrmD protein. Biochemistry-Us 2004, 43 (28), 9243-55.

55. Lewis, K., Persister cells, dormancy and infectious disease. Nat Rev Microbiol 2007, 5 (1),

48-56.

56. Veening, J. W.; Smits, W. K.; Kuipers, O. P., Bistability, Epigenetics, and Bet-Hedging in

Bacteria. Annu Rev Microbiol 2008, 62, 193-210.

57. Hauryliuk, V.; Atkinson, G. C.; Murakami, K. S.; Tenson, T.; Gerdes, K., Recent

functional insights into the role of (p)ppGpp in bacterial physiology. Nat Rev Microbiol 2015, 13

(5), 298-309.

58. Christensen, S. K.; Maenhaut-Michel, G.; Mine, N.; Gottesman, S.; Gerdes, K.; Van

Melderen, L., Overproduction of the Lon protease triggers inhibition of translation in Escherichia

coli: involvement of the yefM-yoeB toxin-antitoxin system. Mol Microbiol 2004, 51 (6), 1705-

1717.

59. (a) Gotfredsen, M.; Gerdes, K., The Escherichia coli relBE genes belong to a new toxin-

antitoxin gene family. Mol Microbiol 1998, 29 (4), 1065-76; (b) Christensen, S. K.; Gerdes, K.,

Delayed-relaxed response explained by hyperactivation of RelE. Mol Microbiol 2004, 53 (2), 587-

597.

60. Brzozowska, I.; Zielenkiewicz, U., Regulation of toxin-antitoxin systems by proteolysis.

Plasmid 2013, 70 (1), 33-41.

61. Gerdes, K.; Maisonneuve, E., Bacterial persistence and toxin-antitoxin loci. Annu Rev

Microbiol 2012, 66, 103-23.

62. Maisonneuve, E.; Shakespeare, L. J.; Jorgensen, M. G.; Gerdes, K., Bacterial persistence

by RNA endonucleases. Proc Natl Acad Sci U S A 2011, 108 (32), 13206-11.

63. Griffin, M. A.; Davis, J. H.; Strobel, S. A., Bacterial Toxin RelE: A Highly Efficient

Ribonuclease with Exquisite Substrate Specificity Using Atypical Catalytic Residues.

Biochemistry-Us 2013, 52 (48), 8633-8642.

64. Schifano, J. M.; Vvedenskaya, I. O.; Knoblauch, J. G.; Ouyang, M.; Nickels, B. E.;

Woychik, N. A., An RNA-seq method for defining endoribonuclease cleavage specificity

identifies dual rRNA substrates for toxin MazF-mt3. Nat Commun 2014, 5.

64

65. Christensen, S. K.; Gerdes, K., RelE toxins from bacteria and Archaea cleave mRNAs on

translating ribosomes, which are rescued by tmRNA. Mol Microbiol 2003, 48 (5), 1389-400.

66. Woolstenhulme, C. J.; Guydosh, N. R.; Green, R.; Buskirk, A. R., High-Precision Analysis

of Translational Pausing by Ribosome Profiling in Bacteria Lacking EFP. Cell Rep 2015, 11 (1),

13-21.

67. (a) Nakatogawa, H.; Ito, K., The ribosomal exit tunnel functions as a discriminating gate.

Cell 2002, 108 (5), 629-636; (b) Gong, F.; Yanofsky, C., Instruction of translating ribosome by

nascent peptide. Science 2002, 297 (5588), 1864-1867.

68. Craigen, W. J.; Caskey, C. T., Expression of peptide chain release factor 2 requires high-

efficiency frameshift. Nature 1986, 322 (6076), 273-5.

69. Qian, Q.; Bjork, G. R., Structural requirements for the formation of 1-methylguanosine in

vivo in tRNA(GGG)(Pro) of Salmonella typhimurium. J Mol Biol 1997, 266 (2), 283-296.

70. (a) Holmes, W. M.; Andraos-Selim, C.; Redlak, M., tRNA-m1G methyltransferase

interactions: touching bases with structure. Biochimie 1995, 77 (1-2), 62-5; (b) Bjork, G. R.;

Wikstrom, P. M.; Bystrom, A. S., Prevention of Translational Frameshifting by the Modified

Nucleoside 1-Methylguanosine. Science 1989, 244 (4907), 986-989; (c) Bystrom, A. S.; Bjork, G.

R., Chromosomal location and cloning of the gene (trmD) responsible for the synthesis of tRNA

(m1G) methyltransferase in Escherichia coli K-12. Mol Gen Genet 1982, 188 (3), 440-6.

71. Bystrom, A. S.; Bjork, G. R., The structural gene (trmD) for the

tRNA(m1G)methyltransferase is part of a four polypeptide operon in Escherichia coli K-12. Mol

Gen Genet 1982, 188 (3), 447-54.

72. Christian, T.; Gamper, H.; Hou, Y. M., Conservation of structure and mechanism by Trm5

enzymes. Rna-a Publication of the Rna Society 2013, 19 (9), 1192-1199.

73. Li, Z. W.; Deutscher, M. P., Maturation pathways for E-coli tRNA precursors: A random

multienzyme process in vivo. Cell 1996, 86 (3), 503-512.

74. Frank, D. N.; Pace, N. R., Ribonuclease P: Unity and diversity in a tRNA processing

ribozyme. Annual Review of Biochemistry 1998, 67, 153-180.

75. Li, Z. W.; Deutscher, M. P., RNase E plays an essential role in the maturation of

Escherichia coli tRNA precursors. Rna-a Publication of the Rna Society 2002, 8 (1), 97-109.

76. Perona, J. J.; Rould, M. A.; Steitz, T. A., Structural basis for transfer RNA aminoacylation

by Escherichia coli glutaminyl-tRNA synthetase. Biochemistry-Us 1993, 32 (34), 8758-71.

77. Rould, M. A.; Perona, J. J.; Steitz, T. A., Structural basis of anticodon loop recognition by

glutaminyl-tRNA synthetase. Nature 1991, 352 (6332), 213-8.

65

78. (a) Charette, M.; Gray, M. W., Pseudouridine in RNA: What, where, how, and why. Iubmb

Life 2000, 49 (5), 341-351; (b) Davis, F. F.; Allen, F. W., Ribonucleic acids from yeast which

contain a fifth nucleotide. J Biol Chem 1957, 227 (2), 907-15.

79. Dalluge, J. J.; Hashizume, T.; Sopchik, A. E.; McCloskey, J. A.; Davis, D. R.,

Conformational flexibility in RNA: The role of dihydrouridine. Nucleic Acids Res 1996, 24 (6),

1073-1079.

80. Urbonavicius, J.; Qian, Q.; Durand, J. M.; Hagervall, T. G.; Bjork, G. R., Improvement of

reading frame maintenance is a common function for several tRNA modifications. Embo J 2001,

20 (17), 4863-73.

81. Nasvall, S. J.; Chen, P.; Bjork, G. R., The modified wobble nucleoside uridine-5-oxyacetic

acid in tRNAPro(cmo5UGG) promotes reading of all four proline codons in vivo. RNA 2004, 10

(10), 1662-73.

82. Gamper, H. B.; Masuda, I.; Frenkel-Morgenstern, M.; Hou, Y. M., Maintenance of protein

synthesis reading frame by EF-P and m(1)G37-tRNA. Nat Commun 2015, 6, 7226.

83. Bregeon, D.; Colot, V.; Radman, M.; Taddei, F., Translational misreading: a tRNA

modification counteracts a+2 ribosomal frameshift. Gene Dev 2001, 15 (17), 2295-2306.

84. Christian, T.; Evilia, C.; Williams, S.; Hou, Y. M., Distinct origins of tRNA(m1G37)

methyltransferase. J Mol Biol 2004, 339 (4), 707-19.

85. Ito, T.; Masuda, I.; Yoshida, K.; Goto-Ito, S.; Sekine, S.; Suh, S. W.; Hou, Y. M.;

Yokoyama, S., Structural basis for methyl-donor-dependent and sequence-specific binding to

tRNA substrates by knotted methyltransferase TrmD. Proc Natl Acad Sci U S A 2015, 112 (31),

E4197-205.

86. Elkins, P. A.; Watts, J. M.; Zalacain, M.; van Thiel, A.; Vitazka, P. R.; Redlak, M.;

Andraos-Selim, C.; Rastinejad, F.; Holmes, W. M., Insights into catalysis by a knotted TrmD

tRNA methyltransferase. J Mol Biol 2003, 333 (5), 931-949.

87. O'Dwyer, K.; Watts, J. M.; Biswas, S.; Ambrad, J.; Barber, M.; Brule, H.; Petit, C.; Holmes,

D. J.; Zalacain, M.; Holmes, W. M., Characterization of Streptococcus pneumoniae TrmD, a tRNA

methyltransferase essential for growth. J Bacteriol 2004, 186 (8), 2346-54.

88. Christian, T.; Hou, Y. M., Distinct determinants of tRNA recognition by the TrmD and

Trm5 methyl transferases. J Mol Biol 2007, 373 (3), 623-32.

89. Goto-Ito, S.; Ito, T.; Ishii, R.; Muto, Y.; Bessho, Y.; Yokoyama, S., Crystal structure of

archaeal tRNA(m(1)G37)methyltransferase aTrm5. Proteins 2008, 72 (4), 1274-89.

90. Carr, A. C.; Taylor, K. L.; Osborne, M. S.; Belous, B. T.; Myerson, J. P.; Moore, S. D.,

Rapid depletion of target proteins allows identification of coincident physiological responses. J

Bacteriol 2012, 194 (21), 5932-40.

66

91. Gottesman, S.; Roche, E.; Zhou, Y.; Sauer, R. T., The ClpXP and ClpAP proteases degrade

proteins with carboxy-terminal peptide tails added by the SsrA-tagging system. Genes Dev 1998,

12 (9), 1338-47.

Analyses of mRNA Cleavage by RelE and the Role of tRNA ...

Documents

Transcript of Analyses of mRNA Cleavage by RelE and the Role of tRNA ...