EF-P Dependent Pauses Integrate Proximal and Distal Signals during Translation
Elongation factor P (EF-P) is a well-conserved bacterial protein. Although it can enhance protein synthesis in vitro, it is generally regarded as an ancillary factor required for robust translation of transcripts with stretches of consecutive prolines. In this work we performed ribosome profiling to better understand the role of EF-P during translation. Our data confirmed that translational effects due to lack of EF-P are mainly confined to PPX–encoding genes. Wide variations in EF-P dependent translation of these PPXs led us to investigate the effect of sequences upstream of diproline-containing motifs. We found that amino acids encoded upstream of PPX play a key role in EF-P-dependent translation. Finally, comparison of ribosome profiling data to existing proteomic data indicates that although many PPX-containing patterns have increased ribosome occupancies, this does not necessarily lead to altered protein levels. Taken together these data show a direct role for EF-P during synthesis of PPX motifs, and indirect effects on other post-transcriptional regulators of gene expression.
Published in the journal:
. PLoS Genet 10(8): e32767. doi:10.1371/journal.pgen.1004553
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1004553
Summary
Elongation factor P (EF-P) is a well-conserved bacterial protein. Although it can enhance protein synthesis in vitro, it is generally regarded as an ancillary factor required for robust translation of transcripts with stretches of consecutive prolines. In this work we performed ribosome profiling to better understand the role of EF-P during translation. Our data confirmed that translational effects due to lack of EF-P are mainly confined to PPX–encoding genes. Wide variations in EF-P dependent translation of these PPXs led us to investigate the effect of sequences upstream of diproline-containing motifs. We found that amino acids encoded upstream of PPX play a key role in EF-P-dependent translation. Finally, comparison of ribosome profiling data to existing proteomic data indicates that although many PPX-containing patterns have increased ribosome occupancies, this does not necessarily lead to altered protein levels. Taken together these data show a direct role for EF-P during synthesis of PPX motifs, and indirect effects on other post-transcriptional regulators of gene expression.
Introduction
During protein synthesis each amino acid is detached from an aminoacyl-tRNA and incorporated into the nascent peptide. Although the basic peptidyl transfer reaction is the same for all amino acids, the speed of incorporation is not uniform. It is affected by several factors including the abundance of each individual aminoacyl-tRNA, the structure of the incorporated amino acid, and structural features of the mRNA and the nascent peptide. For example, mRNA sequences upstream of the peptidyl-tRNA site (P site) codon that interact with the anti Shine Dalgarno sequence (aSD) from 16S rRNA [1] or regions from the nascent peptide that interact with the ribosome exit tunnel, have been shown to slow translation [2]–[4].
Decreasing the speed of translation, or even pausing it, can have important roles in protein synthesis. For instance, sequence context dependent pausing during translation of secM is known to regulate synthesis of the membrane protein SecA [5]. More broadly, changes in translation speed can affect co-translational folding of proteins, controlling not only the fraction of active protein [6], but potentially also providing new functionality through alternative folds [7]. Although in these cases translation pausing has beneficial physiological roles, in other cases it could be detrimental if it significantly decreases the efficiency of protein synthesis. Accordingly, patterns that induce ribosome pausing are often excluded from coding regions [1], [4]. Exceptions to this include PPP and PPG sequences. This is most probably due to the presence of elongation factor P (EF-P) [4], a protein that has been recently shown to prevent the pauses produced by these and other sequences, most of which contain a PP motif [8]–[11]. It has been described that in the absence of EF-P, mRNA coding for PPG will pause with Gly-tRNAGly located at the A site of the ribosome and peptidyl-tRNAPro at the P site [8]. A similar effect has been reported for PPP sequences that pause with the second Pro at the P site [4].
EF-P is a remarkable example of molecular mimicry. The protein is similar in shape and size to a tRNA and interacts with the ribosome via the exit (E) site on the 30S subunit and the peptidyl-transferase center (PTC) of the 50S subunit (PDB 1UEB, [12]–[14]). Presumably during a PPX-induced stall in translation the E-site tRNA is ejected, allowing EF-P access to the ribosome where its N-terminal domain can insert into the PTC to re-initiate synthesis. EF-P activity requires post-translational modification by the addition of (R)-β-lysine to Lys34 (Escherichia coli numbering) in a reaction that is catalyzed by PoxA, a paralog of the catalytic domain of lysyl-tRNA synthetase [13], [15]. Further hydroxylation of EF-P Lys34 has also been observed, but the role of EF-P hydroxylation is unclear as no adverse fitness effects have been found in its absence [11], [16], [17]. EF-P is homologous to the eukaryotic eIF5A protein, which is post-translationally modified at an analogous lysyl residue and has also been shown to stimulate the synthesis of proteins containing polyproline motifs [18], [19].
While PPP, PPG and some other PPX sequences (where X represents any of the 20 proteinogenic amino acids) usually trigger pausing in vitro in the absence of EF-P, this is not always true in vivo. We and others observed that the total levels of most proteins containing PPP or PPG sequences are not affected by the loss of EF-P in either E. coli or Salmonella [10], [11]. For example the atpA and atpD genes both encode a PPG motif, but proteomic analysis demonstrates that only AtpD levels are affected by efp deletion [10], [11]. Proteomic approaches are sensitive to protein degradation and synthesis, making it difficult to distinguish the underlying cause of changes in protein levels [20], which could also result from ribosome pausing or indirect changes in regulatory proteins upon efp deletion. To address how EF-P regulates protein levels in the cell, here we apply ribosome profiling to globally identify the set of sequences that trigger ribosomal pausing in the absence of EF-P. We analyzed this dataset in conjunction with earlier proteomic data to define the specific features that differentiate the PPX sequences that produce pausing during translation from those that do not. Our data indicate that pausing-potential is largely influenced by the local context of the PPX pattern, and that specific amino acids upstream of the PPX motif can modulate whether or not a particular A site residue can trigger a stall.
Results
Ribosome profiling of wild type, Δefp and Δefp complemented E. coli strains
Ribosome profiling, or ribo-seq, is a genome-wide, quantitative analysis of ribosome occupancy in vivo by nuclease footprinting and deep sequencing. It can map the precise position and density of ribosomes on transcripts, and provides a direct readout of which sequences cause stalling [21]. Ribo-seq was performed for wild type E. coli, Δefp and Δefp complemented strains (Δefp pEF-P, complemented with a plasmid expressing efp). Cells were harvested at mid-log phase and collected by rapid filtration followed by rapid freezing in liquid nitrogen [22]. Nuclease-treated (footprints) and untreated (total) mRNA samples were processed for each of the strains. The correlation between the two biological replicates for each strain was between 96–98% (Fig. S1).
Increased ribosome occupancy of diprolyl-encoding messages
EF-P prevents translational pausing during synthesis of some polyproline-containing proteins [8]–[11]. E coli has over 2000 PPX motifs encoded in its genome, of which 913 had significant reads in our ribo-seq data (i.e. with a coverage of at least 3 sequencing reads per codon). Translational pauses cause the accumulation of ribosomes at the pausing site and increased density in ribosome profiling (i.e. a significant increase in sequence reads at the pause site compared to reads obtained at neighboring regions of the same transcript). It has been previously observed that at strong pauses such accumulations produce at least a ten fold increase in ribosome density at the pause site when compared to the full gene ribosome density [1]. We analyzed the pausing tendency of each PPX site by measuring the ratio of ribosome density between the PPX and the full gene and refer to this as the pausing index. In the wild-type strain only 14.6% of the PPX motifs had a pausing index above 2 compared to 50.4% of the PPX motifs in the Δefp strain (Fig. 1A). By more stringent criteria, only 0.22% of PPX motifs had a pausing index higher than 10 in the wild-type strain compared to 2.8% in the Δefp strain. Table 1 shows the 26 PPX motifs where a diprolyl or triprolyl sequence had a pausing index higher than 10 in the Δefp strain. Proteomic data from SILAC showed that not all of these proteins had a significant difference in protein levels between WT and Δefp strains [10]. Although both experiments were performed using different growth media, comparison of these data sets suggest that E. coli can compensate for decreased translation efficiency by other mechanisms related to changes in mRNA levels or protein stability. Table S1 shows the pausing index for the 16 proteins that both contain a PPX sequence and displayed at least three fold higher protein abundance in wild-type vs. the Δefp strain in the E. coli SILAC dataset [10].
Although Δefp strains showed increased ribosome occupancy at most of the PPX-encoding sequences (Fig. 1B), only a small subset of these genes had a pausing index high enough above the threshold to be considered as strong pauses (i.e. having a pausing index 10 fold above the gene average [1] (Fig. 1A). This variability holds true for all PPX patterns including many of the PPP or PPG sequences that have been reported to produce strong translation pauses in the absence of EF-P [8], [9], [11] (Fig. S2). Notable examples include ubiD and malE; both have a PPK sequence, but only translation of ubiD pauses at this position in the Δefp strain (Fig. 1C). This indicates that other factors influence the tendency of the ribosome to pause at a particular PPX sequence. To investigate what other sequence determinants contribute to pausing we compared the strong EF-P dependent pauses (defined as regions with a pausing index of at least 10 [1] in the Δefp strain) with the PPX sequences that have the lowest pausing index in the Δefp strain (with a pausing index equal or below 1). 31 EF-P dependent pausing sites were identified, 26 of which (distributed in 22 genes) contained a PPX motif (Table 1). The five other EF-P alleviated pauses contained no PPX motif (Table S2), consistent with our previous observation of EF-P mediated relief of non-PPX pauses such as the GSCGPG motif in the poxB gene [11]. These five non-PPX containing genes were further investigated by introducing the sequence coding for the pausing segment into a GFP reporter system. In this reporter GFP is in a transcriptional fusion to mCherry, which has a separate Shine-Dalgarno sequence and serves as an internal control (Fig. S3A, [11]). After inserting these non-PPX motifs into the reporter system (at the amino terminus of GFP, between codons 3 and 4), the EF-P dependency could not be reproduced (Fig. S3B). Other longer sequences were additionally tested without positive results (Table S2), suggesting that these pauses might depend at least in part on sequence features outside the cloned segments.
Common patterns in PPX pausing motifs
The large variability in PPX-mediated pausing patterns revealed by the ribo-seq data (Fig. S2) led us to search for additional sequence features that might affect pausing at PPX sequences. We compared the sets of well-defined pausing and non-pausing motifs (Tables 1 and S3, respectively). Some patterns such as PPD or PPN were only found in the pausing PPX sequences while PPQ or PPK were present in both gene sets. When comparing alignments of the amino acid or nucleotide sequences, we were unable to identify any common patterns within either the pausing or the non-pausing PPX sequences (Fig. S4). It has been proposed that several translation pauses do not depend purely on one mechanism, but instead integrate different signals that slow down translation [4]. To investigate if other known mechanisms might contribute to pausing, the role of Shine-Dalgarno (SD) sequences upstream of the PPX sequence, the utilization of low usage tRNAs at the A site codon, and combinations of specific amino acids at the A site and upstream of PPX were tested.
Exploring the role of the weak internal Shine-Dalgarno sequences
Most translation pauses in wild type E. coli are due to interactions of the mRNA's coding region with the anti-SD (aSD) sequence of the 16S rRNA [1]. We reasoned that having a motif capable of interacting with the aSD upstream of a PPX might contribute to pausing. The RNAsubopt program in the Vienna RNA package [23] was used to search for the presence of nucleotide sequences upstream of the PPX coding region that are predicted to have affinity for the aSD sequence (5′-CACCUCCU-3′), referred to here as aSD-weak binding sequences. Several paused PPX sequences also contain a sequence 7 to 9 bases upstream of the third position of the X codon from PPX predicted to weakly bind the aSD. The median affinity of these sequences for the aSD was ∼−2 Kcal/mol, about half of the minimum affinity found to produce an increased pausing index by itself in previous studies (4 to 12 Kcal/mol) [1]. It is possible that these low affinities could enhance the ability of PPX sequences to produce a pause in translation, a hypothesis that was supported by the absence of these aSD-weak binding sequences upstream of PPXs that do not produce a pause (Fig. S5A). Another possible feature of pausing patterns might be the use of rare codons that could slow translation and increase the strength of pauses [24], [25]. Consistent with this, when analyzing the codons used for PPX patterns, the stronger EF-P dependent pauses frequently use rare tRNAs for decoding the codons at the A site (Fig. S5B).
To further determine the possible role of the aSD-weak binding sequences and the use of rare codons several sequences were introduced at the amino terminus of a GFP reporter (Fig. S3A) and tested for their effect on translation in WT and Δefp E. coli strains. GFP fluorescence values were normalized against the fluorescence of mCherry encoded on the same transcript immediately downstream of gfp. As the diverse sequence patterns are introduced at the beginning of the gfp sequence (between the 3rd and 4th codons) we do not expect that pauses have substanital effects on protein folding. Thus, the expectation is that most of the changes in GFP production associated with efp deletion will come from the reduction in the number of ribosomes able to cross the pausing site. This is in accordance with previous reports where comparable experiments correlated well with changes in the level of protein [10], [11]. Several codon variations coding for PEPPK were tested, a translation pause site in ubiD, and PNPPK found at a non-pausing segment of malE (Figs. 1C, 2 and Tables 1 and S3). Sequences encoding PEPPK are predicted using RNAsubopt to bind to the aSD with affinities ranging from −5 to 0 kcal/mol. Conversely, all PNPPK variants present a binding energy of 0 kcal/mol and are predicted to be easily translated by the ribosome (Fig. 2A). In addition, plasmids bearing PDPPK and PQPPK sequences were constructed as controls for the role of an acidic versus an amide containing amino acid 2 positions upstream of the P site amino acid of pausing ribosomes. GFP levels did not correlate with the sequence affinity for the ribosome aSD, indicating that this does not play a significant role in EF-P dependent pausing for P[E/D/Q/N]PPK sequences (Fig. 2A). Also, similar constructs using diverse codons at the pausing A site position did not show any effect of low usage codons (Fig. S6). Instead, a consistent tendency was observed of decreased translation efficiency for clones bearing an acidic amino acid at position −2 with regard to the Pro at the ribosome P site independent of aSD affinity (Fig. 2B). This pausing was also observed with other basic amino acids (Arg or His) at the A site position (Fig. 3A) and was independent of codon usage (Fig. S6).
The identity of the amino acid immediately upstream of the diprolyl motif can influence the translation pausing efficiency
The finding that PP-basic pausing depends on the identity of the amino acid 2 positions upstream of the P site position (Z−2 on Z−2P−1PPXA), suggests a possible role of this position in determining the A site selectivity for EF-P relieved translation pausing. A similar effect has been previously observed for the macrolide dependent pausing of ermAL1 translation, at the leader sequence of ermA. In this example, the presence of an Ala two positions upstream of the P-site amino acid will pause translation only in the presence of certain A site amino acids such as Glu. Conversely, the presence of Phe or Gly in the −2 position produces a non-selective ribosome that either pauses (Phe) or continues translation (Gly) irrespective of the A site amino acid [26]. Amino acids at −2 have also been shown to be important in other translation pausing examples [4] and Peil et al. have also recently suggested that some Z−2P−1PX patterns (with Z and X representing any proteinogenic amino acid) could also induce EF-P relieved pauses [10].
In order to determine if there is a general role of the −2 amino acid on EF-P dependent pauses, the PPX ribosome densities for all possible Z−2P−1PX amino acid combinations were analyzed (Figs. 3 and S7, Table S4). When exclusively comparing the well-defined pausing and non-pausing sequences (Tables 1 and S3, respectively) acidic amino acids at the A site (X position on Z−2P−1PX) were found to stall translation independent of the identity of the −2 amino acid (Z on Z−2P−1PX). Similarly, hydrophobic or aromatic amino acids at the A site do not produce a pause independent of the identity of the amino acid at the −2 position. Some examples of these were confirmed using the GFP/mCherry system described above (Fig. 3B). Conversely to what was observed for acidic, hydrophobic and aromatic moieties, other amino acids at the A site have a pausing behavior that is context dependent. Four examples of this variable PPX behavior were further investigated: PP-basic, PP-amide, PP-OH and PPP. Similar to previous results with PPK patterns, pausing was only observed at PP-basic motifs when the −2 residue was acidic. In contrast, PP-amides always pause with the exception of some specific cases where there is an OH containing amino acid at the −2 position. These activities were confirmed for both patterns in the GFP/mCherry system, although acidic-PP-basic patterns have a weak effect on GFP translation as compared to the other patterns analyzed (Figs. 3A and 3C).
In ribosome profiling, PP-OH was only observed to pause when the A site was occupied by a Ser, whereas the presence of Tyr or Thr did not cause a translation pause. In the GFP/mCherry reporter system, PP-S and PP-T produced some decrease of translation in the Δefp strain. In some cases (with Gln or Val preceding PPX) Ser in the A site produced a stronger effect than Thr, but in others (with Asp preceding PPX) no difference was observed (Fig. 3D). Contrary to predictions based on previous reports, PPP sequences were only observed to pause with an Arg or Ile at −2 in the ribosome profiling data. The effect of amino acids at the −2 position was also studied using the GFP system (Fig. 3E). All PPP motifs produced at least a 4-fold decrease in GFP production in the Δefp strain compared to WT. Previous studies have suggested that longer Pro stretches will induce stronger pauses. For instance, we have previously shown that a 6 Pro stretch will reduce GFP translation 3- to 4-fold more than a 3 Pro stretch [11]. By contrast, the addition of only one Pro before PPP (PPPP) does not have any effect on either GFP expression or ribosome occupancies in ribosome profiling experiments (Figs. 3E and S7, Table S4) indicating that addition of a single prolyl residue is not enough to significantly reduce translation efficiency.
Effects of distal upstream sequences on PPX translation
The finding that all ZPPP motifs produced a ∼4 fold effect in the GFP/mCherry reporter system was unexpected, as in the ribo-seq data only RPPP and IPPP were observed to produce a strong pause. Moreover Val, that only appeared preceding non-pausing PPP in our ribo-seq data, had the strongest effect compared to the other ZPPP patterns (Figs. 3E and S7). The finding that some of the patterns tested in the reporter system were unable to reproduce the pausing tendency observed in ribo-seq suggests that other sequence features might have additional effects on the pausing of PPX. No obvious correlation was observed between the “X” amino acid and up to 12 codon positions further upstream of PPX in our set of validated in vivo pausing sequences.
To more broadly explore the contextual effect of larger sequences on EF-P dependent translation the expression efficiency of the atpA and atpD genes was compared in Salmonella. Both genes encode for similar proteins that contain a PPG motif and are expressed from the same mRNA transcript. However, proteomic analysis of Salmonella showed that only atpD expression appeared to be affected by efp deletion (20.6 fold difference expression for atpD contrasting with 1.05 for atpA) [11]. A similar trend, although less dramatic, was recently observed in a SILAC experiment performed with E. coli (5.18 fold difference in synthesis for atpD and 1.88 for atpA) [10]. Conversely, in the current ribosome profiling experiment, the pausing index at PPG in the Δefp strain is similar between the genes, 3.80 for atpA and 5.04 for atpD (Fig. 4A).
To verify the results from the high-throughput analyses, we conducted western blotting that confirmed the AtpD protein level is lower in an efp mutant of Salmonella (Fig. 4B, quantified in 4C). We then addressed the discrepancy in EF-P dependent expression of atpA and atpD by employing the previously used pXG10sf translational fusion system to compare translation in wild type and efp mutant Salmonella [11], [27], [28]. The constructs allowed for the constitutive transcription of mRNA bearing full-length atpA or atpD genes with “super-folder” GFP as a C-terminal translational fusion [29]. Consistent with proteomic analysis, the fluorescence measurements revealed that the expression of atpD was dependent on EF-P whereas atpA was not (Fig. 4D). Mutation of the PPG motif to PLG (P214L) abolished the EF-P requirement for atpD and, conversely, lengthening the PPG of atpA to PPPG (R279P) induced strong EF-P dependence. Unsurprisingly, mutation of the atpA PPG motif to PLG (P281L) did not have a significant affect as atpA expression was already independent of EF-P. No change in EF-P dependence was observed for either construct upon switching the second proline codon or by altering the upstream codons to strengthen or weaken binding to the aSD sequence of 16s rRNA (Fig. S8), consistent with results described above.
Since interactions between the nascent polypeptide chain and the ribosomal exit tunnel can affect translational stalling, the role in EF-P dependence of the regions upstream of the atpD and atpA PPG motifs was also investigated [2], [4], [30]–[32]. Up to 40 codons upstream of the atpD PPG motif were swapped into the pXG10sf-atpA construct while leaving the PPG motif and the remainder of the ORF intact, or vice versa. Swapping as few as two upstream codons from atpA into atpD led to a significant increase in expression of the atpD-gfp construct in the efp mutant of Salmonella (Figs. 5 and S9). This effect increased when four codons were swapped, but returned to a similar degree when six or more residues were switched. A similar reversal of EF-P dependence was observed for swapping upstream regions of atpD into atpA: a four codon swap led to a very small increase in EF-P dependence, which increased only marginally when six or more codons were swapped. The observation that swapping greater than six residues upstream had marginal or no additional effect suggests that, at least in this instance, the important interactions with the ribosomal exit tunnel are occurring close to the peptidyl transferase center and prior to the exit tunnel constriction that has been implicated in other extended translational stall motifs [2], [31], [33].
Interestingly, no effect was observed for either atpA or atpD when the Z−2 amino acid was swapped (E212R for AtpD; R279E for AtpA). However, swapping of the Z−3 position residues resulted in a drastic effect on EF-P dependence (N211R for AtpD; R278N for AtpA) (Fig. 5). The atpD N211R mutant construct partially alleviated EF-P dependence, though to a lesser degree than the Z−3Z−2 amino acid swap. In contrast, changing the Z−3 position of atpA to arginine as found in atpD (R279N) led to a dramatic increase in EF-P dependence, surpassing the effect of all other constructs where more residues were altered. Thus, while a common pattern could not be detected within the pausing PPX genes, at least in the case of atpA/D (and possibly of other PPG containing genes) the residue three positions upstream of the P site plays an important role in determining whether progression through PPG motifs will depend on EF-P. Furthermore, the data shows that other nearby residues can dampen this effect.
Discussion
Ribosome profiling and protein synthesis
A challenging aspect of analyzing ribosome profiling data is that increased density of ribosome footprints can indicate many ribosomes actively translating a transcript or an increased translation time [20]. The ribosome profiling data introduced here was compared with the available proteomic data obtained by SILAC [10] of wild type E. coli and Δefp strains. Upon comparing total footprints/gene ratios for Δefp/WT strains (obtained by ribo-seq) to differences in protein abundance for Δefp/WT strains detected by SILAC, most proteins (77%) seem unaffected, with only 2% of proteins showing a greater than 2-fold increase in both datasets (Fig. S10). This comparison also showed that 5.5% of the proteins having 2-fold higher protein abundance as detected by SILAC show decreased or unchanged ribosome occupancies from ribo-seq. This could be a result of many factors including differences in protein half-life or mRNA abundance between WT and Δefp strains. In this sense it is particularly interesting to note that synthesis of RNase II, which plays a critical role in mRNA turnover, is highly EF-P dependent. Ribo-seq showed that the PPQ motif in rnb (encoding RNAse II) had a 7.4 pausing index (Table S1) while SILAC detected RNase II to be 7 times more abundant in the WT versus the Δefp strain [10].
The differences between Ribo-Seq and SILAC may also be influenced by inherent biases of the ribosomal profiling method [34]. The protocol involved in generating footprints captures short mRNA fragments covered by exactly one ribosome. It remains possible that mRNA fragments with very closely located ribosomes (as we expect near pausing sites) could be lost as has been previously shown for other pauses [35]. Moreover, determining pausing sites by computing motif reads divided by gene average [1] can be misleading when the gene has more than one pausing motif or when the pausing motif is at the start of the gene; in both of these cases the average reads/gene would probably be inaccurate.
Defining sequence elements that require EF-P for efficient translation
The results presented here confirm previous observations that most PPX motifs do not require EF-P for proper translation [10], [11]. Instead, potential EF-P alleviated pauses are restricted to a small subset of these proteins. With the exception of PPG and PPP that have special structural features, all the strong EF-P dependent pauses found in the ribosome profiling data (Table 1) have a polar amino acid at the A site. Trp at the A site has been previously found to produce translation pausing in vitro [4], [10], but could not be detected in our in vivo ribosome profiling data. Similar results have been previously obtained through proteomic data [10]. This suggests a possible effect of some polar groups on the positioning of the amino acid moiety of aminoacyl-tRNA at the A site of ribosomes. More importantly, our data show a dependency of PPX pauses on the identity of amino acids located N-terminal of the pausing site. This is in accordance with recent reports from Peil et al. indicating that amino acids like Asp, Ala or Ile can stimulate EF-P dependent pauses if located just upstream of a PPX sequence [10]. In contrast to their report, our results indicate that the effects of preceding amino acids are highly dependent on the context in which they are located. For instance, an Asn will prevent pausing when located before PPK, but allow it when located next to PP [N/Q] (Fig. 3). More striking is a comparison of the pausing strength of [Q/V]PPS to [Q/V]PPT or of QPPS to NPPS (Fig. 3D). In both cases a single methyl group is enough to determine whether or not translation pauses. This shows an exquisite level of selectivity at either the PTC or the exit tunnel, both of which are usually expected to be fairly non selective in order to facilitate synthesis of all proteins.
Our finding that amino acids located two residues upstream of PPX (the Z−2 and Z−3 positions) influence pausing is supported by our previous finding that the motif generating the EF-P alleviated pause in PoxB requires a stretch of 6 amino acids [11]. Moreover, in this work we now observe that the region upstream of the PPG motifs in atpA and atpD can significantly influence EF-P dependence, with particular emphasis on the −3 position. Other residues located further upstream can also modulate the strength of the PPX pause (Fig. S9). One possible explanation for these results is that interactions between the nascent peptide and the exit tunnel modulate the selectivity of the A site, similar to what has been observed for the macrolide relieved pausing of ermAL1 translation [26] or the translocon relieved translation of secM [36]. In these two cases the role of the amino acid two positions upstream of the P site is as important as that observed here for most EF-P relieved pauses. In contrast, pausing during translation of TnaC seems to depend on the −3 position [37], similar to what we observed for atpD translation. Effects from positions further upstream also have relevant roles for some EF-P relieved pauses, as we have previously observed for the non-PPX pause on PoxB translation that depends on 6 continuous amino acids [11]. Thus, EF-P relived pauses depend on an array of diverse amino acid sequence contexts that interact with the PTC or the exit channel. Additional local effects of mRNA structure or interactions with the ribosome can not be ruled out as we did observe some variability in GFP expression depending on codon usage in the regions upstream of PPK (Fig. 2A). Nevertheless, these effects were usually small and did not correlate with either codon usage or affinity for the ribosomal aSD.
The role of EF-P in integrating different signals to regulate translation
We were surprised that only 2.8% of the PPX motifs detected by ribosome profiling had a pausing index of 10 or more (the threshold considered as strong pausing) [1]. Although, the remainder of the PPX motifs may not be pausing translation, the observed increase in ribosome density likely reflects that many PPX motifs can still slow translation. This might offer some explanation for the increased polysome retention previously observed for E. coli Δefp strains [17]. Similar effects are also observed after addition of chloramphenicol to E. coli cultures [17] or depletion of eIF5A (EF-P paralog) in eukaryotic cells [38]–[40]. These observations suggest that EF-P enhances the translation speed of several slightly slower segments of mRNA that collectively would have an important effect on global translation dynamics. Part of this translation enhancement may not come directly from EF-P binding to the ribosome, but from the release of tRNAPro that is trapped in other stalled ribosomes.
Both ribosome profiling and SILAC data show that EF-P directly affects the synthesis of several key components of translation, and therefore the loss of EF-P may have a broad but indirect impact on protein synthesis. YjjK (or EttA, Energy-dependent translational throttle A [41]) has recently been shown to be sensitive to the ATP/ADP ratio in the cell. EttA can control the progression of 70S ribosome initiation complexes into translation elongation and thus alter protein synthesis in energy-depleted cells [41], [42]. EttA has 2 PPX motifs, PPG and PPK, and in the ribo-seq data PPG caused a pausing index of 3.1 in the Δefp strain, a ∼14 fold increase compared to WT (Table S2). Similarly, SILAC data showed that Etta is 3.5 and 15.4 times more abundant in WT than in efp deletion strains in E. coli and Salmonella, respectively [10], [11]. In view of the fact that components of ATP synthase and also Etta are affected by the loss of EF-P [10], [11], it is conceivable that loss of EF-P may perturb the energy state of the cell, which may in turn contribute to the growth defect of the Δefp mutant. Our ribo-seq data also shows that another translation factor, LepA, has 2 PPXs that both caused a pausing index higher than 13.3, and SILAC data showed a significant change in the WT/Δefp ratio of 3.6. (Table 1 and S1 [10]). Changes in the levels of other proteins observed in the E. coli SILAC data [10] could further affect protein translation. Examples of this are RaiA (a translation inhibitor and ribosomal stability enhancer [43] that shows 8 fold increase in Δefp strain), Sra (a protein of unknown function that binds 30S ribosomal subunits during stationary phase [44] and shows a ∼2 fold increase in Δefp strain) and several proteins involved in tRNA processing (RNaseII(Rnb) [45]), modification (MnmE, MnmG, SelU [46], [47]) or aminoacylation (LysU, ValS [48]) that present 2- to 7-fold reduced levels in the Δefp strain. In addition, SILAC data shows a ∼2 fold decrease in the levels of the chaperone HslU in the Δefp strain. As this chaperone is also part of the HslVU protease [49], changes in its levels could affect protein stability. Additionally, a 3 fold increase in the levels of HchA(Hsp31) in the Δefp strain could also produce changes in protein stability as this chaperone has been proposed to have some proteolytic activity [50].
Altogether, changes in the levels of proteins involved in protein synthesis and stability could explain some of the differences we observe between ribo-seq and SILAC data. Additionally, the effects observed here on cellular levels of translation factors such as EttA and LepA, on proteins modifying protein stability such as HslU and on proteins expected to influence mRNA turnover such as RNaseII, suggest a broad role for EF-P in integrating and balancing different inputs that determine the efficiency of protein synthesis.
Materials and Methods
General methods
E. coli BW25113 (Wild type) and Δefp E. coli strains were obtained from the Keio collection. For the Δefp strain, the kanamycin cassettes was removed via pCP20-encoded FLP recombinase and was confirmed by PCR [51], [52]. The Δefp E. coli complemented strain was constructed by introducing the efp open reading frame in trans on the arabinose-inducible vector pBAD (Δefp pEF-P strain). Plasmids used for motif verification were derivatives of pBAD30 [53]. As previously described [11], the plasmid contained a tandem fluorescent fusion cassette composed of green fluorescent protein (gfp) followed directly by mCherry. A cloning site was added to that construct after the 3rd codon of gfp. This plasmid was subsequently designated pBAD30XS. Patterns were inserted into pBAD30XS by double strand oligo hybridization.
Translational fusion experiments assessing the expression of atpA and atpD were conducted in Salmonella enterica serovar Typhimurium strain 14028s (referred to as Salmonella in body text) and an isogenic efp deletion mutant designed to avoid interference with the yjeK promoter [11]. The open reading frame of atpA or atpD, plus 75 or 74 bp upstream (respectively) were inserted into the NheI and NsiI sites of the pXG10sf plasmid employed previously [11], [27], [28]. The plasmid employs a tightly controlled, low-copy number origin of replication (pSC101) and the constitutively active PLtet0-1 promoter to minimize variation in transcript levels. Mutations and swaps were generated using site-directed mutagenesis or Gibson isothermal assembly cloning using overlapping primers [54]. All strains, plasmids and primers used in this study are described in Table S6.
Ribosome profiling
Saturated cultures of Wild type E. coli BW25113, Δefp and the complemented strain were diluted to an OD600 nm of 0.01 in 200 mL of Luria broth medium. The media for the Δefp complemented strain was supplemented with 0.02% arabinose. Strains were grown at 37°C, 250 rpm to OD600 nm 0.4–0.5. Cells were harvested by rapid filtration [22] through a prewarmed 0.45 µm nitrocellulose membrane; the cells were scraped onto a pre-warmed spatula then directly submerged in liquid nitrogen. The frozen cells were dislodged into 0.65 ml of cold lysis buffer [20 mM Tris-HCl pH 8.0, 10.5 mM MgCl2, 40 U/µl RNase Inhibitor (Roche), and 100 U/ml Turbo DNase (NEB)] [17] and re-chilled in liquid nitrogen. The harvested cells were lysed by freeze/thaw 3 times then spun down at full speed for 10 min at 4°C. The clarified supernatant was immediately frozen in liquid nitrogen and stored at 80°C. This lysate was used to prepare footprint and total mRNA samples. A detailed description of sample preparation and library generation is provided in Text SI.
Data analysis
Ribosome profiling sequence reads were trimmed and aligned to the E. coli K12 MG1655 reference genome (Genebank version U00096.2) using FASTX-Toolkit and bwa (0.6.2) [55]. A file describing the coverage for each feature in the E. coli genome, was created using bedtools (2.17.0). Reads in each gene were normalized by the average of reads in the whole ORF and pauses were identified searching for peaks where ribosome occupancies were at least 10 fold above the gene average. A detailed description of these and other sequence analyses is provided in Text SI. Data has been deposited in the NCBI Sequence Read Archive (SRA) BioProject no. PRJNA241328.
Pausing-pattern verification
Overnight cultures of E. coli strains harboring pBAD30XS constructs in LB were diluted to an optical density at 600 nm (OD600) of 0.05 in M9 minimal salts medium supplemented with 0.4% (wt/vol) glycerol, 100 µg/ml ampicillin and 0.2% arabinose. All cultures were incubated at 37°C. Fluorescence was assessed using a spectrofluorimeter (Horiba) after 10 hrs. Cells were analyzed for GFP using excitation at 481 nm and emission at 507 nm and for mCherry with excitation at 587 nm and emission at 610 nm. The background level with blank medium was subtracted, and the ratio of GFP fluorescence over that of mCherry was calculated. Reported values represent averages and standard deviations determined from three independent experimental replicates.
GFP fluorescence assay for translational fusions of atpA and atpD
As described previously, LB overnight cultures of wild type or efp mutant Salmonella bearing a atpD or atpA constructs in pXG10sf were diluted 1/200 into MOPS minimal medium supplemented with 0.2% glucose and 20 µg/ml chloramphenicol [11]. The cultures were grown for 16 hrs at 37°C with shaking in a Tecan Infinite M200 microplate reader. Fluorescence (475 and 511 nm excitation and emission wavelengths, respectively) and OD600 were measured every 15 min. Background from media-only controls was subtracted and data was expressed as GFP fluorescence per OD600 unit at 10 hr post-inoculation.
Immunoblotting and quantification
Salmonella strains were grown in MOPS minimal media supplemented with 0.2% glucose to mid log phase (OD600∼0.5), washed twice (1 mM Tris pH 8.0, 5 mM magnesium acetate) and lysed by sonication in lysis buffer (9.32 M urea, 2.67 M thiourea, 40 mM Tris, 86.78 mM CHAPS, pH 8.5). 10 µg of total cell lysate was mixed with 2× SDS loading buffer and boiled for 10 min at 95°C. Proteins were separated by SDS-PAGE and transferred (semidry) to a nitrocellulose membrane. Following 1 h blocking at room temperature in 5% milk in TBST (1× Tris-buffered saline, 0.05% Tween 20), immunoblotting was conducted overnight in TBST +5% mile at 4°C using a mouse anti-DnaK antibody (1∶50,000; Enzo Life Sciences) and a mouse monoclonal antibody specific for the beta subunit of E. coli ATP Synthase (1∶1000; MitoSciences). Blots were washed and subsequently incubated for 1 h at room temperature with HRP-fused goat anti-mouse antibody (1∶10,000 in TBST +5% milk) for ECL imaging (Thermo Scientific). Quantification of AtpD protein levels relative to DnaK was done using Image Lab software (Bio-Rad Laboratories).
Supporting Information
Zdroje
1. LiGW, OhE, WeissmanJS (2012) The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria. Nature 484: 538–541.
2. ItoK, ChibaS, PoglianoK (2010) Divergent stalling sequences sense and control cellular physiology. Biochem Biophys Res Commun 393: 1–5.
3. LuJ, DeutschC (2008) Electrostatics in the ribosomal tunnel modulate chain elongation rates. J Mol Biol 384: 73–86.
4. WoolstenhulmeCJ, ParajuliS, HealeyDW, ValverdeDP, PetersenEN, et al. (2013) Nascent peptides that block protein synthesis in bacteria. Proc Natl Acad Sci U S A 110: E878–887.
5. YapMN, BernsteinHD (2009) The plasticity of a translation arrest motif yields insights into nascent polypeptide recognition inside the ribosome tunnel. Mol Cell 34: 201–211.
6. KomarAA, LesnikT, ReissC (1999) Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett 462: 387–391.
7. TsaiCJ, SaunaZE, Kimchi-SarfatyC, AmbudkarSV, GottesmanMM, et al. (2008) Synonymous mutations and ribosome stalling can lead to altered folding pathways and distinct minima. J Mol Biol 383: 281–291.
8. DoerfelLK, WohlgemuthI, KotheC, PeskeF, UrlaubH, et al. (2013) EF-P is essential for rapid synthesis of proteins containing consecutive proline residues. Science 339: 85–88.
9. UdeS, LassakJ, StarostaAL, KraxenbergerT, WilsonDN, et al. (2013) Translation elongation factor EF-P alleviates ribosome stalling at polyproline stretches. Science 339: 82–85.
10. PeilL, StarostaAL, LassakJ, AtkinsonGC, VirumaeK, et al. (2013) Distinct XPPX sequence motifs induce ribosome stalling, which is rescued by the translation elongation factor EF-P. Proc Natl Acad Sci U S A 110: 15265–15270.
11. HerschSJ, WangM, ZouSB, MoonKM, FosterLJ, et al. (2013) Divergent protein motifs direct elongation factor P-mediated translational regulation in Salmonella enterica and Escherichia coli. MBio 4: e00180–00113.
12. Hanawa-SuetsuguK, SekineS, SakaiH, Hori-TakemotoC, TeradaT, et al. (2004) Crystal structure of elongation factor P from Thermus thermophilus HB8. Proc Natl Acad Sci U S A 101: 9595–9600.
13. YanagisawaT, SumidaT, IshiiR, TakemotoC, YokoyamaS (2010) A paralog of lysyl-tRNA synthetase aminoacylates a conserved lysine residue in translation elongation factor P. Nat Struct Mol Biol 17: 1136–1143.
14. BlahaG, StanleyRE, SteitzTA (2009) Formation of the first peptide bond: the structure of EF-P bound to the 70S ribosome. Science 325: 966–970.
15. RoyH, ZouSB, BullwinkleTJ, WolfeBS, GilreathMS, et al. (2011) The tRNA synthetase paralog PoxA modifies elongation factor-P with (R)-beta-lysine. Nat Chem Biol 7: 667–669.
16. PeilL, StarostaAL, VirumaeK, AtkinsonGC, TensonT, et al. (2012) Lys34 of translation elongation factor EF-P is hydroxylated by YfcM. Nat Chem Biol 8: 695–697 doi:10.1038/nchembio.1001
17. BullwinkleTJ, ZouSB, RajkovicA, HerschSJ, ElgamalS, et al. (2013) (R)-beta-lysine-modified elongation factor P functions in translation elongation. J Biol Chem 288: 4416–4423.
18. GutierrezE, ShinBS, WoolstenhulmeCJ, KimJR, SainiP, et al. (2013) eIF5A promotes translation of polyproline motifs. Mol Cell 51: 35–45.
19. ParkMH (2006) The post-translational synthesis of a polyamine-derived amino acid, hypusine, in the eukaryotic translation initiation factor 5A (eIF5A). J Biochem 139: 161–169.
20. IngoliaNT, BrarGA, RouskinS, McGeachyAM, WeissmanJS (2012) The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments. Nat Protoc 7: 1534–1550.
21. IngoliaNT, GhaemmaghamiS, NewmanJR, WeissmanJS (2009) Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324: 218–223.
22. OhE, BeckerAH, SandikciA, HuberD, ChabaR, et al. (2011) Selective ribosome profiling reveals the cotranslational chaperone action of trigger factor in vivo. Cell 147: 1295–1308.
23. LorenzR, BernhartSH, Höner Zu SiederdissenC, TaferH, FlammC, et al. (2011) ViennaRNA Package 2.0. Algorithms Mol Biol 6: 26.
24. FredrickK, IbbaM (2010) How the sequence of a gene can tune its translation. Cell 141: 227–229.
25. Cruz-VeraLR, Magos-CastroMA, Zamora-RomoE, GuarnerosG (2004) Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons. Nucleic Acids Res 32: 4462–4468.
26. RamuH, Vazquez-LaslopN, KlepackiD, DaiQ, PiccirilliJ, et al. (2011) Nascent peptide in the ribosome exit tunnel affects functional properties of the A-site of the peptidyl transferase center. Mol Cell 41: 321–330.
27. CorcoranCP, PodkaminskiD, PapenfortK, UrbanJH, HintonJC, et al. (2012) Superfolder GFP reporters validate diverse new mRNA targets of the classic porin regulator, MicF RNA. Mol Microbiol 84: 428–445.
28. UrbanJH, VogelJ (2009) A green fluorescent protein (GFP)-based plasmid system to study post-transcriptional control of gene expression in vivo. Methods Mol Biol 540: 301–319.
29. PedelacqJD, CabantousS, TranT, TerwilligerTC, WaldoGS (2006) Engineering and characterization of a superfolder green fluorescent protein. Nat Biotechnol 24: 79–88.
30. WilsonDN, BeckmannR (2011) The ribosomal tunnel as a functional environment for nascent polypeptide folding and translational stalling. Curr Opin Struct Biol 21: 274–282.
31. TannerDR, CarielloDA, WoolstenhulmeCJ, BroadbentMA, BuskirkAR (2009) Genetic identification of nascent peptides that induce ribosome stalling. J Biol Chem 284: 34809–34818.
32. NakatogawaH, ItoK (2002) The ribosomal exit tunnel functions as a discriminating gate. Cell 108: 629–636.
33. BhushanS, HoffmannT, SeideltB, FrauenfeldJ, MielkeT, et al. (2011) SecM-stalled ribosomes adopt an altered geometry at the peptidyl transferase center. PLoS Biol 9: e1000581.
34. DanaA, TullerT (2012) Determinants of translation elongation speed and ribosomal profiling biases in mouse embryonic stem cells. PLoS Comput Biol 8: e1002755.
35. WolinSL, WalterP (1988) Ribosome pausing and stacking during translation of a eukaryotic mRNA. EMBO J 7: 3559–3569.
36. RychkovaA, MukherjeeS, BoraRP, WarshelA (2013) Simulating the pulling of stalled elongated peptide from the ribosome by the translocon. Proc Natl Acad Sci U S A 110: 10195–10200.
37. WilsonDN (2011) Peptides in the ribosomal tunnel talk back. Mol Cell 41: 247–248.
38. CanoVS, JeonGA, JohanssonHE, HendersonCA, ParkJH, et al. (2008) Mutational analyses of human eIF5A-1–identification of amino acid residues critical for eIF5A activity and hypusine modification. FEBS J 275: 44–58.
39. DiasCA, CanoVS, RangelSM, ApponiLH, FrigieriMC, et al. (2008) Structural modeling and mutational analysis of yeast eukaryotic translation initiation factor 5A reveal new critical residues and reinforce its involvement in protein synthesis. FEBS J 275: 1874–1888.
40. RossiD, KuroshuR, ZanelliCF, ValentiniSR (2014) eIF5A and EF-P: two unique translation factors are now traveling the same road. Wiley Interdiscip Rev RNA 5: 209–222.
41. BoëlG, SmithPC, NingW, EnglanderMT, ChenB, et al. (2014) The ABC-F protein EttA gates ribosome entry into the translation elongation cycle. Nat Struct Mol Biol 21: 143–51 doi: 10.1038/nsmb.2740
42. ChenB, BoelG, HashemY, NingW, FeiJ, et al. (2014) EttA regulates translation by binding the ribosomal E site and restricting ribosome-tRNA dynamics. Nat Struct Mol Biol 21: 152–159.
43. Vila-SanjurjoA, SchuwirthBS, HauCW, CateJH (2004) Structural basis for the control of translation initiation during stress. Nat Struct Mol Biol 11: 1054–1059.
44. IzutsuK, WadaC, KomineY, SakoT, UeguchiC, et al. (2001) Escherichia coli ribosome-associated protein SRA, whose copy number increases during stationary phase. J Bacteriol 183: 2765–2773.
45. ReuvenNB, DeutscherMP (1993) Multiple exoribonucleases are required for the 3′ processing of Escherichia coli tRNA precursors in vivo. FASEB J 7: 143–148.
46. ElseviersD, PetrulloLA, GallagherPJ (1984) Novel E. coli mutants deficient in biosynthesis of 5-methylaminomethyl-2-thiouridine. Nucleic Acids Res 12: 3521–3534.
47. WolfeMD, AhmedF, LacourciereGM, LauhonCT, StadtmanTC, et al. (2004) Functional diversity of the rhodanese homology domain: the Escherichia coli ybbB gene encodes a selenophosphate-dependent tRNA 2-selenouridine synthase. J Biol Chem 279: 1801–1809.
48. IbbaM, SollD (2000) Aminoacyl-tRNA synthesis. Annu Rev Biochem 69: 617–650.
49. SeongIS, OhJY, LeeJW, TanakaK, ChungCH (2000) The HslU ATPase acts as a molecular chaperone in prevention of aggregation of SulA, an inhibitor of cell division in Escherichia coli. FEBS Lett 477: 224–229.
50. MalkiA, CaldasT, AbdallahJ, KernR, EckeyV, et al. (2005) Peptidase activity of the Escherichia coli Hsp31 chaperone. J Biol Chem 280: 14420–14426.
51. BabaT, AraT, HasegawaM, TakaiY, OkumuraY, et al. (2006) Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol 2: 2006 0008.
52. DatsenkoKA, WannerBL (2000) One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A 97: 6640–6645.
53. GuzmanLM, BelinD, CarsonMJ, BeckwithJ (1995) Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J Bacteriol 177: 4121–4130.
54. GibsonDG, YoungL, ChuangRY, VenterJC, HutchisonCA3rd, et al. (2009) Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods 6: 343–345.
55. LiH, DurbinR (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.
56. LetzringDP, DeanKM, GrayhackEJ (2010) Control of translation efficiency in yeast by codon-anticodon interactions. RNA 16: 2516–2528.
57. CrooksGE, HonG, ChandoniaJM, BrennerSE (2004) WebLogo: a sequence logo generator. Genome Res 14: 1188–1190.
Štítky
Genetika Reprodukční medicínaČlánek vyšel v časopise
PLOS Genetics
2014 Číslo 8
Nejčtenější v tomto čísle
- Meta-Analysis of Genome-Wide Association Studies in African Americans Provides Insights into the Genetic Architecture of Type 2 Diabetes
- KDM6 Demethylase Independent Loss of Histone H3 Lysine 27 Trimethylation during Early Embryonic Development
- The RNA Helicases AtMTR4 and HEN2 Target Specific Subsets of Nuclear Transcripts for Degradation by the Nuclear Exosome in
- EF-P Dependent Pauses Integrate Proximal and Distal Signals during Translation