PEDS Advance Access originally published online on March 20, 2006
Protein Engineering Design and Selection 2006 19(6):285-289; doi:10.1093/protein/gzl007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Short Communication |
Generation and analysis of proline mutants in protein G
1 Division of Biology, California Institute of Technology 1200 E. California Boulevard, Pasadena, CA, USA 2 Howard Hughes Medical Institute and Division of Chemistry and Chemical Engineering, California Institute of Technology 200 E. California Boulevard, Pasadena, CA 91125, USA
3To whom correspondence should be addressed. E-mail: steve{at}mayo.caltech.edu
| Abstract |
|---|
|
|
|---|
The pyrrolidine ring of the amino acid proline reduces the conformational freedom of the protein backbone in its unfolded form and thus enhances protein stability. The strategy of inserting proline into regions of the protein where it does not perturb the structure has been utilized to stabilize many different proteins including enzymes. However, most of these efforts have been based on trial and error, rather than rational design. Here, we try to understand proline's effect on protein stability by introducing proline mutations into various regions of the B1 domain of Streptococcal protein G. We also applied the Optimization of Rotamers By Iterative Techniques computational protein design program, using two different solvation models, to determine the extent to which it could predict the stabilizing and destabilizing effects of prolines. Use of a surface area dependent solvation model resulted in a modest correlation between the experimental free energy of folding and computed energies; on the other hand, use of a Gaussian solvent exclusion model led to significant positive correlation. Including a backbone conformational entropy term to the computational energies increases the statistical significance of the correlation between the experimental stabilities and both solvation models.
Keywords: proline/protein/design/protein G/protein stability
| Introduction |
|---|
|
|
|---|
Proline is the only naturally occurring amino acid in which the side chain is bonded to the backbone nitrogen, forming a five-membered pyrrolidine ring. This pyrrolidine ring restricts the rotation of the NC
bond, decreasing the backbone conformational entropy of the unfolded form of the protein relative to other naturally occurring amino acids. This allows proline substitution to increase the stability of a protein by decreasing the entropic difference between the unfolded and the folded form, thereby increasing the free energy difference (Nemethy et al., 1966
On the other hand, prolines are also notorious for destabilizing proteins. It is well known that prolines, located internally in
-helices or ß-sheets, break the secondary structures, thus destabilizing the protein. Two main factors cause prolines to break secondary structures. One is the absence of hydrogen on the amide nitrogen, which prohibits prolines from acting as a donor in a hydrogen bond. Another is the steric constraint placed on proline and the neighboring residues by the pyrrolidine ring, hindering secondary structure formation. The phi and psi angles preferred by prolines are far from the typical range of those for ß-sheets and thus distort the strand significantly. In addition, steric restriction drives the residue preceding proline to prefer the beta conformation, thus limiting the occurrence of prolines in
-helices.
Because of the contradictory effects described above, stabilization of proteins by proline incorporation has typically been achieved by trial and error. The general understanding is that proline mutation is stabilizing because of entropic factors but this behavior can be masked and even reversed by destabilizing enthalpic changes. Thus, protein stabilization by prolines has been achieved by placing prolines in relatively solvent-exposed locations, where they would not disturb the stabilizing interactions of the protein, for example in loops and turns or the first turn of an
-helix (Watanabe and Suzuki, 1998
). In this article we examine the effects of proline mutations in protein G and attempt to predict these effects using computational tools with the aim of reasonably incorporating proline in computational protein design efforts.
| Materials and methods |
|---|
|
|
|---|
Mutagenesis and protein purification
Mutants of the B1 domain of protein G (Gß1) were constructed using inverse PCR in plasmid pET11A, expressed using BL21 (DE3) cells and purified as described previously (Malakauskas and Mayo, 1998
). Two forms, 56- and 57-residue species, of Gß1 resulted due to incomplete processing of the N-terminal methionine. In this study the 56-residue species of Gß1 mutants and the wild type were used. Molecular weights were verified using mass spectrometry.
CD analysis
CD data were collected on an Aviv 62DS spectrometer equipped with a thermoelectric unit. Thermal denaturation experiments were monitored at 218 nm from 1 to 99°C by 1°C increments with an equilibration time of 1.5 min using 50 µM of protein in 50 mM sodium phosphate at pH 5.5. The midpoint of the thermal unfolding transition (Tm) was determined from a two-state analysis of each denaturation curve (Minor and Kim, 1994
). Guanidinium chloride denaturations were performed at 25°C using 5 µM of protein in 50 mM sodium phosphate at pH 5.5. Data were collected for 5 min and averaged. Free energies of folding (
Gf) and error estimates were obtained by fitting the denaturation data to a two-state transition model (Santoro and Bolen, 1988
) using Kaleidagraph (Synergy Software). Chemical and thermal melting curves for protein G and its variants are presented in the Supplemental data available at PEDS online.
Computational analysis
The crystal structure of wild-type Gß1 (PDB ID: 1pga
[PDB]
) was used as the starting template for energy calculations. Explicit hydrogens were added using MolProbity (Lovell et al., 2003
) and the structure was energy minimized for 50 steps to remove any steric clashes (Mayo et al., 1990
). For each mutant, proline was substituted at the selected position, and the protein design program, ORBIT (Dahiyat and Mayo, 1997a
, b
; Dahiyat et al., 1997
; Street and Mayo, 1998
; Pierce et al., 2000
) was used to optimize the structure (selecting the optimal rotamer for proline as well as for all the other residues in the protein) and to calculate energies. Solvation energies were calculated using the method of either Street and Mayo (1998)
or Lazaridis and Karplus (1999)
.
| Results and discussion |
|---|
|
|
|---|
Proline mutants of Gß1
Prolines tend to have a phi angle of
63°, while the psi angle clusters around two regions in the Ramachandran map, 35° (
region) and 150° (ß region) (MacArthur and Thornton, 1991
). We selected 10 Gß1 residues having phi and psi angles compatible with proline for mutation: Thr2, Gly9, Lys10, Val21, Ala23, Ala24, Thr25, Val29, Asp36 and Ala48. In order to explore the effect of prolines in different structural environments, these residues were selected from various regions of the protein (Figure 1). Thr2 and Gly9 are located in a ß-strand. Ala23, Ala24 and Thr25 are the first three N-terminal residues on the
-helix, Val29 is in the middle of the helix and Asp36 is the C-terminal residue of the helix. The remaining residues are located in the loops and turns connecting the secondary structural elements. The phi and psi angles of the preceding residue of Thr2, Gly9, Lys10, Val21 and Ala23 are in the ß region of the Ramachandran map. Residues preceding proline prefer the ß region because their Cß and amide nitrogen sterically conflict with the C
of proline (Schimmel and Flory, 1968
; Matthews et al., 1987
; Hurley et al., 1992
).
|
Stability studies and analysis
The stability of each of the mutants was determined by performing thermal and chemical denaturation experiments (Table I and Supplementary data available at PEDS online). The far UV CD spectra before and after thermal denaturation indicate that all mutants except for K10P, T25P and V29P fold reversibly (data not shown). The post-transition region of the melting curves for V21P and V21P/A23P extends beyond the experimental range of 99°C, which leads to large estimated errors.
|
As expected from the fact that proline is a well-known secondary structure breaker, most of the proline mutants were less stable than the wild-type protein. An exception to this was V21P, which exhibited a
Gf enhancement of 0.5 kcal/mol and a Tm increase of 6°C compared with the wild-type protein. This value agrees well with the expected energy of stabilization generated by the entropic difference between Val and Pro in the unfolded state. According to the method of Nemethy et al. (1966)
-helix. It is solvent-exposed and does not interact with other residues; thus, mutation to proline does not disturb any energetically favorable interaction. Given these observations, it is not surprising that the increase in stability for V21P is close to the expected value.
Two residues, Lys10 and Ala48 are located in the i + 1 position of a ß-turn. The fact that proline is the most favored residue for the i + 1 position of ß-turns has been rationalized by the analysis of protein structures. These studies reveal that prolines have phi angles that are favored in that position (Hutchinson and Thornton, 1994
). Also, mutational studies report an increase in thermostability with proline substitution in this position (Watanabe and Suzuki, 1998
). In our case, these two proline mutants were slightly destabilizing compared with the wild-type protein. This may be partly due to the loss of a hydrogen bond between the amide nitrogen of Ala48 and the carboxylate of Asp46, and for Lys10 the loss of electrostatic interactions with negatively charged residues in the vicinity (Asp40, Glu56 and the C-terminal carboxyl group).
Prolines occur in the first turn of
-helices with high frequency. It has been suggested that proline residues are not destabilizing in this position because the amide hydrogens in the first turn do not make backbone/backbone hydrogen bonds within the helix (von Heijne, 1991
). Within the first turn, prolines exist predominantly at the N1 position because steric clashes would result if the preceding residue is in a helical conformation (Yun et al., 1991
; Cochran et al., 2001
). In our experiments, the proline mutation at the N2 position (A24P) was more destabilizing than the one at the N1 position (A23P), consistent with previous observations. However, both proline mutations were slightly destabilizing. Contrary to some mutational studies, but consistent with our results for A23P and A24P as well as for K10P and A48P, peptide helicity measurements demonstrated that prolines do not stabilize N terminal residues of
-helices or the i + 1 residue of ß-turns. Instead, it was suggested that prolines occur frequently in certain locations because they are tolerated and not because they are stabilizing (Cochran et al., 2001
). Proline substitution at Thr25, which is located in the N3 position, was highly destabilizing with 
Gf of 2.8 kcal/mol and a Tm decrease of 21°C compared with the wild-type protein. The amide nitrogen of the residues in the first turn of an
-helix frequently forms hydrogen bonds with a nearby side chain, such as the N-cap residue, in order to satisfy its hydrogen bond donor potential. This is particularly true for the amide nitrogen of an N3 residue, which has frequently been observed to form a hydrogen bond with the N-cap side chain (Penel et al., 1999
). In the wild-type Gß1 structure, the backbone and side chain of Thr25 make extensive hydrogen bonds with the carboxylate group of the N-cap residue, Asp22. This interaction stabilizes and caps the first turn of the helix (Gronenborn et al., 1991
). Substituting proline at this position eliminates this stabilizing interaction, thus destabilizing the protein.
It has been reported that the cost of introducing a proline into an
-helix is
3.4 kcal/mol (Oneil and Degrado, 1990
; Yun et al., 1991
). Consistent with this, our Val29 to proline mutation destabilized the protein by 3.5 kcal/mol and decreased the Tm by 23°C.
Proline substitution in the ß-sheet at Thr2 and Gly9 destabilizes the protein by distorting the secondary structure of the ß-sheet. The far UV CD spectra of these two mutants deviate significantly from the wild-type spectrum, suggesting a change in secondary structure content (data not shown).
Introduction of increasing numbers of prolines, up to nine, additively increased the stability of oligo-1,6-glucosidase (Watanabe et al., 1994
). To explore whether the stabilizing/destabilizing effects of our proline mutants were additive, we constructed a double mutant, V21P/A23P. To a first approximation, the effect was additive, resulting in a near zero effect on stability, as can be seen in Table I.
Comparison of computational and experimental energies
The energy of each of the mutants and the wild-type protein was calculated by substituting proline at the respective positions and optimizing all side chains using Optimization of Rotamers By Iterative Techniques (ORBIT). We used two different methods of calculating solvation energy, a surface area dependent solvation model from Street and Mayo (1998)
and a Gaussian solvent exclusion model from Lazaridis and Karplus (1999)
. The energies are reported in Table I. Mutants T2P, G9P, V29P and D36P showed significant backbone movement in the mutated region after minimization of the optimized structure determined by ORBIT and/or exhibited large deviations of their far UV CD spectra compared with the wild-type protein. Since ORBIT utilizes the static backbone of the wild-type crystal structure for its calculations, the ORBIT energy of these mutants is not likely to reflect the energy of the true structure. Thus, they were not considered in the correlation analysis between the calculated ORBIT energy (Ecalc) and
Gf obtained using experiment. However, we would like to point out that ORBIT does predict the destabilizing effects of three (G9P, V29P, D36P) of the four mutants excluded. They all have very high computed energies, due to large van der Waals clashes between side chain and the backbone. We propose that this is the reason for the destabilization of these mutants and deviation of their CD spectra.
Excluding the mutants mentioned above, the agreement between the ORBIT energy difference between the mutant (P) and the wild type (WT) [
Ecalc(P WT)] and the experimentally determined free energy difference [
Gf(P WT)] resulted in an R2 value of 0.79 using the Street and Mayo (SM) solvation model (Figure 2A) and of 0.94 using the Lazaridis and Karplus (LK) solvation model (Figure 3A). Correlation between Tm and ORBIT energy resulted in R2 values of 0.78 and 0.79 for the SM and LK models, respectively (Figures 2B and 3B). In order to estimate the prediction error of ORBIT energies more accurately and to consider whether the correlation for the dataset is dominated by the result for T25P, we used the leave-one-out cross-validation method on the free energy dataset. The cross-validation estimate of prediction error was 14.5 for the SM method while the LK method gave a significantly lower value of 0.12. Thus the Lazaridis and Karplus excluded volume solvation model based ORBIT energies show greater correlation to experimental
Gu and a lower estimate of prediction error than the surface area dependent solvation model based energies, suggesting that the LK model performs better in describing the proline mutants. Overall, although some false positives occur, ORBIT is reasonably predictive in ranking the stabilities of the various mutants as indicated by Spearman's rank correlation coefficients of 0.75 (P < 0.01) and 0.93 (P < 0.01) for the SM and LK free energy correlations, respectively (data not shown).
|
|
In our study, prolines have typically not been included in the set of amino acids that ORBIT considers in protein design calculations because the potential energy function used in ORBIT does not include a conformational entropy term. We tested whether including a backbone conformational entropy term to the computational energy increases the rank correlation between the experimental stabilities and computational energies. Using the backbone entropy scale from Stites and Pranata (1995)
As described above, proline stabilizes a protein by decreasing the backbone conformational entropy of the unfolded state. ORBIT predicts the stability of proline mutations reasonably well without an entropic term. This is likely due to the dominance of enthalpic contributions over entropic contributions in protein stability modulation by prolines, which overshadows the missing entropic term in the energy function. Nevertheless, addition of an entropic term with appropriate weighting factor increases the correlation between computational energy and experimental energy, especially for the surface area based solvation method.
| Acknowledgements |
|---|
|
|
|---|
The authors thank I. Caglar Tanrikulu for the helpful discussions and Marie Ary for editing the manuscript. This work was supported by the Howard Hughes Medical Institute, the Ralph M. Parsons Foundation and an IBM Shared University Research Grant.
| References |
|---|
|
|
|---|
Cochran D.A., Penel S., Doig A.J. (2001) Protein Sci. 10:463470.[CrossRef][Web of Science][Medline]
Dahiyat B.I. and Mayo S.L. (1996) Protein Sci. 5:895903.[Web of Science][Medline]
Dahiyat B.I. and Mayo S.L. (1997a) Science 278:8287.
Dahiyat B.I. and Mayo S.L. (1997b) Proc. Natl Acad. Sci. USA 94:1017210177.
Dahiyat B.I., Gordon D.B., Mayo S.L. (1997) Protein Sci. 6:13331337.[Web of Science][Medline]
Gronenborn A.M., Filpula D.R., Essig N.Z., Achari A., Whitlow M., Wingfield P.T., Clore G.M. (1991) Science 253:657661.
Humphrey W., Dalke A., Schulten K. (1996) J. Mol. Graph. 14:3338.[CrossRef][Web of Science][Medline]
Hurley J.H., Mason D.A., Matthews B.W. (1992) Biopolymers 32:14431446.[CrossRef][Web of Science][Medline]
Hutchinson E.G. and Thornton J.M. (1994) Protein Sci. 3:22072216.[Web of Science][Medline]
Lazaridis T. and Karplus M. (1999) Proteins 35:133152.[CrossRef][Web of Science][Medline]
Lovell S.C., Davis I.W., Arendall W.B. 3rd, de Bakker P.I., Word J.M., Prisant M.G., Richardson J.S., Richardson D.C. (2003) Proteins 50:437450.[CrossRef][Web of Science][Medline]
MacArthur M.W. and Thornton J.M. (1991) J. Mol. Biol. 218:397412.[CrossRef][Web of Science][Medline]
Malakauskas S.M. and Mayo S.L. (1998) Nat. Struct. Biol. 5:470475.[CrossRef][Web of Science][Medline]
Matthews B.W., Nicholson H., Becktel W.J. (1987) Proc. Natl Acad. Sci. USA 84:66636667.
Mayo S., Olafson B., Goddard W. (1990a) J. Phys. Chem. 94:88978909.[CrossRef]
Minor D.L. Jr and Kim P.S. (1994) Nature 367:660663.[CrossRef][Medline]
Nemethy G., Leach S., Scheraga H. (1966) J. Phys. Chem. 70:9981004.
Oneil K.T. and Degrado W.F. (1990) Science 250:646651.
Penel S., Hughes E., Doig A.J. (1999) J. Mol. Biol. 287:127143.[CrossRef][Web of Science][Medline]
Pierce N.A., Spriet J.A., Desmet J., Mayo S.L. (2000) J. Comput. Chem. 21:9991009.[CrossRef][Web of Science]
Santoro M. and Bolen D. (1988) Biochemistry 27:80638068.[CrossRef][Medline]
Schimmel P.R. and Flory P.J. (1968) J. Mol. Biol. 34:105120.[CrossRef][Web of Science][Medline]
Stites W.E. and Pranata J. (1995) Proteins 22:132140.[CrossRef][Web of Science][Medline]
Street A.G. and Mayo S.L. (1998) Fold Des. 3:253258.[CrossRef][Web of Science][Medline]
Vieille C. and Zeikus J.G. (1996) Trends Biotechnol. 14:183190.[CrossRef][Web of Science]
von Heijne G. (1991) J. Mol. Biol. 218:499503.[CrossRef][Web of Science][Medline]
Watanabe K. and Suzuki Y. (1998) J. Mol. Catal. B Enzym. 4:167180.[CrossRef]
Watanabe K., Masuda T., Ohashi H., Mihara H., Suzuki Y. (1994) Eur. J. Biochem. 226:277283.[Web of Science][Medline]
Yun R.H., Anderson A., Hermans J. (1991) Proteins 10:219228.[CrossRef][Web of Science][Medline]
Received February 16, 2006; accepted February 17, 2006.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
K. Takano, R. Higashi, J. Okada, A. Mukaiyama, T. Tadokoro, Y. Koga, and S. Kanaya Proline Effect on the Thermostability and Slow Unfolding of a Hyperthermophilic Protein J. Biochem., January 1, 2009; 145(1): 79 - 85. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



