PEDS Advance Access published online on January 18, 2008
Protein Engineering Design and Selection, doi:10.1093/protein/gzm082
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The Trp-cage: optimizing the stability of a globular miniprotein
Department of Chemistry, University of Washington, Seattle, WA 98195, USA
1 To whom correspondence should be addressed. E-mail: andersen{at}chem.washington.edu
| Abstract |
|---|
|
|
|---|
The Trp-cage, as the smallest miniprotein, remains the subject of numerous computational and experimental studies of protein folding dynamics and pathways. The original Trp-cage (NLYIQWLKDGGPSSGRPPPS, Tm = 42°C) can be significantly stabilized by mutations; melting points as high as 64°C are reported. In helical portions of the structure, each allowed replacement of Leu, Ile, Lys or Ser residues by Ala results in a 1.5 (±0.35) kJ/mol fold stabilization. No changes in structure or fluxionality of the core results upon stabilization. Contrary to the initial hypothesis, specific Pro/Trp interactions are not essential for core formation. The entropic advantage of Pro versus Ala (

SU = 11 ± 2 J/mol K) was measured at the solvent-exposed P17 site. Pro–Ala mutations at two of the three prolines (P12 and P18) that encage the indole ring result in less fold destabilization (2.3–3.4 kJ/mol). However, a P19A mutation reduces fold stability by 16 kJ/mol reflecting a favorable Y3/P19 interaction as well as Trp burial. The Y3/P19 hydrophobic staple interaction defines the folding motif as an 18-residue unit. Other stabilizing features that have been identified include a solvent-exposed Arg/Asp salt bridge (3.4–6 kJ/mol) and a buried H-bonded Ser side chain (
10 kJ/mol).
Keywords: buried polar sidechain/fast-folding miniprotein/hydrophobic core formation/motif minimization/proline-tryptophan interactions
| Introduction |
|---|
|
|
|---|
We have previously reported the design of a 20-residue peptide, TC5b (NLYIQ WLKDG GPSSG RPPPS), which adopts the Trp-cage fold (Fig. 1) (Neidigh et al., 2002
|
It was posited that the key stabilizing features of the Trp-cage are: the burial of the Trp side chain as part of the hydrophobic core, the minimization of
SU (associated with the inclusion of four prolines in the sequence) and an Asp9/Arg16 salt-bridge interaction. As the smallest fully protein-like polypeptide, TC5b has emerged as a paradigm for studying the ability of molecular dynamics to simulate protein folding (Simmerling et al., 2002
Mutations at Trp6 established that the Trp side chain is required for fold stabilization: (W6H)- and (W6F)-TC5b mutants were completely and largely unfolded in water (Barua and Andersen, 2002
). In the case of the W6F mutant, fold population estimates could be obtained in both water and 30% TFE and the extent of fold destabilization was the same in the two media, 
GF = 12.5 ± 0.6 kJ/mol. The indole ring can be replaced by a naphthalene ring with some degree of success: both regioisomers, the (W6
Nal)- and (W6βNal)-TC5b mutants, were fully folded in aqueous TFE. However, the naphthylalanine analogs were only 50%-folded in water, compared with circa 98% for TC5b (
GF = 9 kJ/mol). Trp association with DPC micelle head groups affects unfolding of Trp-cage structures (Neidigh et al., 2001
; Neidigh and Andersen, 2002
). These studies indicate that indole ring burial may contribute up to 12 kJ/mol of the driving force for Trp-cage formation. It was also noted that the W6H mutant and a C-terminally truncated peptide (NLYAQ WLKDG GPSK), which have the same N-terminal helical segment as TC5b but cannot form the Trp-cage, display only modest levels of helicity (Barua and Andersen, 2002
). This suggested that Trp-cage formation provides stabilization to the helix and that helix stabilization contributes to Trp-cage formation. The latter has become the basis of a novel method for determining helix propensities based on the Trp-cage scaffold (Lin et al., 2004
).
The Trp-cage sequence was derived starting with the sequence of exendin-4, a naturally occurring 39-residue peptide displaying a partially formed Trp-cage and a tendency toward aggregation in water (Neidigh et al., 2001
; Hudson and Andersen, 2004
). Synthetic exendin-4 (exenatide, Amylin Pharmaceuticals) is now an approved treatment for diabetes. Three serial truncations and mutations at five sites produced the 20-residue TC5b system (Neidigh et al., 2002
). Since then, data for 27 Trp-cage mutants have been published (Barua and Andersen, 2002
; Neidigh and Andersen, 2002
; Hudson and Andersen, 2004
; Lin et al., 2004
; Liu et al., 2004
; Bunagan et al., 2006
; Naduthambi and Zondlo, 2006
).
In the present account, we present previously unpublished details regarding the design of the Trp-cage and mutations that test the hypotheses regarding its stabilizing features, namely the relationship between intrinsic helix stability and overall fold stability, the entropic effect of the prolines on the latter, the contribution of a D9/R16 salt bridge and the effects of stabilization of local and secondary structural features on the folding scenario. Additional goals were to produce even smaller constructs that would still fold yielding the same hydrophobic core and to optimize the Trp-cage sequence to make constructs with increased fold stabilities suitable for pharmocophore display in drug discovery studies.
| Results |
|---|
|
|
|---|
Additional Trp-cage mutants (bringing the total number of reported sequences to 64, of which 11 have been reported from other laboratories) were synthesized for the present study. At this point, mutations at all positions in the original sequence, except the three glycines, have been examined. Of those retaining the Trp residue, all but five adopt the Trp-cage fold as the predominant conformation in water. As in previous studies, there is a remarkable consistency in both the chemical shifts (Supplementary Table available at PEDS online) and long-range NOEs observed for these species under conditions that favor folding (pH 6–7, 280 K). As a result, we provide detailed chemical shift assignments for only four analogs (Supplementary data available at PEDS online) and an NOE-based structure determination for only a single stabilized species. The large structuring shifts, due to ring-current effects in the hydrophobic core, are tracked as chemical shift deviations (CSD =
obs –
rc, where
obs is the observed chemical shift and
rc is a reference random coil chemical shift (Wishart and Sykes, 1994
sites within the helix (Y3, Q5, W6, K8); the sum of these four CSDs provides a measure of the extent of helix formation.
The Gly11 methylene is a particularly diagnostic site. The diastereotopic 
for the Gly11 methylene (G11
H
= 2.3–2.55 p.p.m., for species judged to be >92% folded) appears to be a sensitive probe of both structure and fluxionality. G11
H
is maximal for the more stable analogs in water, with G11H
3 displaying upfield CSDs <1.01 p.p.m. (versus 3.20–3.46 p.p.m. for G11H
2). In almost all cases, G11
H
decreases in the TFE containing medium and upon mutational destabilization of the Trp-cage structure. These changes predominantly reflect a decrease in the ring-current shielding of the more shielded G11H
2 and, in many cases, a further upfield ring-current shift of G11H
3. G11H
3 also displays unexpected melting behavior (vide infra). Both effects can be rationalized by increased backbone fluxionality with some motional averaging of the G11-CH2 ring-current shifts. As a result, we only use the CSD of 11H
2 to monitor Trp-cage formation.
Measuring fold populations and melting curves
The stabilities of the Trp-cage constructs reported herein are measured using NMR determinations of the extent of amide NH exchange protection afforded by folding and by NMR and CD-monitored melting studies. Typical NMR melting data (chemical shift versus temperature) for a stable and less stable Trp-cage appear in Fig. 2. In the NMR melting studies, the fold populations are obtained from CSDs. Each site experiencing a structuring CSD provides a site-specific melting temperature (Tm, defined as the temperature at which the chemical shift for that site is exactly halfway between the folded and statistical coil value). In the present, as well as previous studies (Neidigh et al., 2002
; Lin et al., 2004
), there is excellent agreement between melts for sites throughout the structure. This is the case for the numerous upfield shifted sites as well as those displaying downfield ring-current shifts such as the P12
, P12β3 and R16
protons. The only exceptions to uniform melting throughout the sequence are the P12
3 and G11
3 sites, vide infra.
|
The largest CSDs in aqueous buffer were typically observed for TC10b (DAYAQWLKDGGPSSGRPPPS, see Table I) at 280 K at pH 7, conditions which yielded amide NH exchange protection factors (PF) greater than 100 (vide infra); as a result, these CSDs were employed as folded-state reference values, indicative of a 99.5%-folded population (Lin et al., 2004
U) for each construct were calculated as follows: |
|
|
Fraction-unfolded values are plotted against temperature to give melting curves, from which Tm's were calculated. For those few constructs that displayed CSDs larger than those for TC10b at some sites, the CSDs at 280 K and pH 7 for that construct were equated with a 99%-folded population for calculating
U at higher temperatures.
Since the agreement between individual site-specific measures of folding is excellent, we rely on sums of CSDs (Neidigh et al., 2002
; Hudson and Andersen, 2004
; Lin et al., 2004
) for the Tm's reported herein. For each construct, two Tm-measures, cage and helix, were calculated. The cage Tm's (Tmcage) are derived from the sum of the CSDs of five protons (L7
, P18
, P18β3, P19
3 and P19
2) experiencing large upfield ring-current shifts associated with the indole ring of Trp6. The G11
2 proton has an even larger structuring CSDs (circa 3.4 p.p.m.) associated with cage formation, but CSDs of G11
2 are not available for all mutants since this signal undergoes extensive line broadening due to exchange as the population of the unfolded state increases. The G11
2 signal becomes too broad to be observed in 2D NMR spectra at high temperatures in some constructs. When available, the G11
2 CSD melt provides essentially the same Tm values as the other cage-monitoring signals used in this study. The helix Tm's (Tmhelix) monitor helix formation and were obtained from the CSDs of H
sites within the helix. For the vast majority of systems examined, these two measures of the melting behavior of Trp-cage constructs gave very similar results (Fig. 3). The few exceptions to this rule will be discussed in the later sections.
|
Optimization of the Trp-cage
A series of alanine substitutions were performed in the helical portion of the TC5b sequence with two different N-capping residues, Asn1 and Asp1. The effects of these mutations on fold stability, as monitored by CSD melts, are shown in Table I. Similar melting behavior is revealed by CD spectroscopy, with residue-molar ellipticities at the 222.5–223.6 nm minimum ([
]223) used to monitor helicity (see, Methods).
The L2A, I4A and K8A mutations increased fold stability, whereas the Q5A and L7A mutations resulted in somewhat and significantly lessened fold stability, respectively. Since the double mutation, L2A and I4A, resulted in constructs with highly stable folds (TC9b and TC10b) without altering outwardly directed polar groups, all subsequent mutations were carried out with the TC9b or TC10b sequences instead of the TC5b sequence. The increase in overall fold stability due to the stabilizing Ala mutations and the Ala
Aib (
-aminobutyric acid) mutations [Aib also increases the intrinsic helix propensity (Karle and Balaram, 1990
; Andersen and Tong, 1997
)] indicate that the intrinsic stability of the helical portion is coupled (Lin et al., 2004
) to the net fold stability.
Throughout, the constructs with an Asp N-cap were more stable than those with the Asn N-cap. The stability differences associated with the N-capping residue (Asp1 versus Asn1),
Tmcage = 4–8°C and
Tmhelix = 5.5–9°C, can be attributed to the greater helix N-capping propensity of Asp (particularly in its ionized state) (Doig and Baldwin, 1995
; Andersen and Tong, 1997
). Likewise, the decreased stability of the D1G mutants (TC11b1 and its Y3F mutant) reflects the lesser N-capping efficiency of glycine (Lin et al., 2004
). The destabilization (
Tm = –8.9 ± 1.8°C) observed for the Q5A mutation may be attributable to a helix-stabilizing QxxxD hydrogen-bonding interaction between the Gln5 and Asp9 side chains (Huyghues-Despointes et al., 1995
). Additional destabilization is observed upon acidification (
Tmcage = –14°C, for the Q5A mutant, –17.4 ± 1.0°C for the other systems in Table I).
Representative chemical shift melts for helix mutants appear in Fig. 3. With a few exceptions, the melting temperatures obtained from cage measures and helical measures agree within 5–6°C of each other for these constructs, implying that helix formation and formation of the tertiary structure are cooperative.
The two most destabilizing Ala substitutions in Table I were at Y3 and L7, sites that are within, or flank, the hydrophobic core of the Trp-cage. The specific implications of these large effects (
GU280 = 13 and 11 kJ/mol, respectively) appear later.
Structural characterization of TC10b
Of the stabilized constructs in Table I, TC10b was selected for more extensive structural and folding thermodynamics evaluation. An NMR structure ensemble for TC10b (Fig. 4a) was generated from 186 NOE distances, of which 28 were long-range (i/i+n, n > 4) constraints. Over the accepted structures (28 structures from 40 random starting points) in the ensemble, the pairwise RMSD for residues 3–19 was 0.41 (±0.14) Å for the backbone atoms and 1.02 (±0.17) Å for all heavy atoms. The acceptance criteria and the method employed for generation of constraints appear in the Methods section. The structural and refinement statistics, the constraints employed and the dihedral angles observed in the ensemble appear in the Supplementary data available at PEDS online.
|
All of the TC10b structures in the ensemble displayed the full set of characteristic Trp-cage features that were reported for TC5b: an
-helix from Ala2 to Lys8, a short 310-helix (residues 12–14), an indole-NH
1 hydrogen bond to the i + 10 (Arg16) backbone carbonyl and the placement of Pro12 and Pro18,19 flanking and directly above, respectively, the indole ring, with the Tyr3 ring completing the hydrophobic cluster. In our prior reports of Trp-cage structural features, the exchange protection at Gly11-NH was attributed to a Gly11-NH
O=C-Trp6 H-bond which was present in most members of the TC5b and exendin-4 structure ensembles. In the TC10b ensemble, Gly11-NH
(O=C-Trp6 and/or O=C-Leu7) H-bonds were present in 22 of the 28 structures. A comparison of the NOE-derived structures of TC10b and TC5b (Fig. 4b) reveals ensembles that are not significantly different: the inter-ensemble pairwise residues 3–19 backbone RMSD between TC5b and TC10b was 0.68 (±0.12) Å, which is only slightly larger than the corresponding intra-ensemble RMSDs, 0.39 (±0.12) and 0.41 (±0.14) Å, respectively for TC5b and TC10b. Subsequent studies (to be reported elsewhere) have revealed that as few as 18 long-range constraints, with the addition of 36 medium- and short-range constraints, can generate Trp-cage structure ensembles within a 0.63 ± 0.17 Å backbone RMSD of the published TC5b ensemble.
We turned to 1H/2H NH-exchange experiments to confirm the greater stability of TC10b that was apparent in melting curves and to determine the folding thermodynamic parameters at 280 K. At pD = 5.0, the PF for the backbone NHs of residues A4, L7, K8 and D9 ranged from 101.48–102.01, with slightly larger PFs (102.51 and 102.16, respectively) for the backbone NH of G11 and NH
1 of W6 corresponding to folded-fraction (
F) estimates of 0.997 and 0.993, respectively. At pD = 5.5, the PF for G11 NH and W6 NH
1 were 102.51 and 102.11 respectively; at pD = 6.6, we could only obtain an accurate rate for W6 NH
1, yielding a PF of 102.14. The thermodynamic stability (
GU280) for TC10b calculated from the PF is +12.2 ± 1.1 kJ/mol. CD-monitored melts provide an alternative measure of the thermodynamics of unfolding; a two-state fit assuming a temperature invariant
Cp (see Methods) affords
GU298 = 5.56 ± 0.11 kJ/mol:
HU298 = 65 ± 2 kJ/mol,
SU298 = 198 ± 8 J/mol K and
Cp = –220 ± 110 J/mol K.
Probing origins of the pH effect on fold stability
A comparison of the Tm's at pH 7 and 2.5 in Table I shows that all of the Trp-cage constructs are significantly more stable at pH 7 (
Tm
7°C). Chemical shift melts illustrating this pH effect appear in Fig. S1 (Supplementary data available at PEDS online). CD-detected melts and pH titrations gave corroborating results (Fig. 5). The mid-point (circa pH 3.6) of the titration curve for TC5a, Fig. 5a, corresponds to the pKa of an Asp (or a C-terminal carboxylate). The complete CD melts for TC5a at pH 7 and 2.5 are shown in Fig. 5b, which also includes the common unfolded baseline observed in the presence of a chemical denaturant (see Methods) and the CD melt for TC10b and its less stable L7A mutant at pH 7.
|
For constructs that have an Asp N-cap, a host of pH effects can be suggested: (a) protonation at the D9/R16 salt bridge, (b) the N-capping capability of Asp is highly pH dependent (NAsp0 = 2.8 versus NAsp– = 6.5) (Andersen and Tong, 1997
In order to reduce the number of potential pH effects, further mutations (Table II) were performed with Asn, rather than Asp, as the N-cap. The aim was to clarify the role of the QxxxD and D9/R16 interactions in the pH dependence of fold stability. In this series, a Q5A mutation would eliminate the helix-stabilizing QxxxD interaction but still allow salt-bridge formation, whereas an R16Nva (Arg
norvaline) mutation would allow the QxxxD interaction but eliminate any salt bridging possibility. A D9N mutant with and without an amidated C-terminus was also examined. Throughout this series, both NMR and CD (data not shown) reveal similar melting behavior. Table II lists the NMR-derived melting points for these constructs at pH 7 and 2.5. Table II also includes 
GU measures of the effect of acidification for each construct and the 
GU associated with each mutation.
|
Destabilizing effects of truncations and other single-site mutations
The effects (typically destabilizing) of other mutations, and further truncation, of the Trp-cage are collected in Table III. These include mutations at all of the proline sites and also additional mutations in the loop region that connects the N-terminal
helix and C-terminal polyProII helix.
|
Mutations at proline sites Proline residues often stabilize the folded states of proteins due to a reduction in the loss of backbone conformational entropy upon folding (Matthews et al., 1987
Defining the minimum sequence length of the Trp-cage
Deletions at both the N-terminus and the C-terminus of the sequence were examined (Table III). The deletion of the N-terminal helix-capping residue produced a construct (TC12a) that was <40%-folded at 280 K in aqueous buffer at pH 7. However, deleting Ala2 rather than Asn1 of TC9b (and thus retaining a helix N-cap) afforded a folded 19-residue system (TC12b, 84%-folded at 280 K). Another construct was prepared by the truncation of the sequence at the C-terminus, with the Ser20 replaced by an amide function (tr10b). This construct was
97%-folded at 280 K with a Tm of 53°C, indicating a circa 1 kJ/mol stability loss. These two truncations were then combined in the same sequence to prepare an 18-mer (tr12b) that was found to be 73%-folded at 280 K.
Probing other features of the trp-cage
We examined an R16K mutation as an alternative to the D9/R16 salt bridge which would be less likely to display alternate cation/
interactions with the indole ring. This mutation reduced fold stability (
GU280 = 2.9 kJ/mol) less than the R16Nva mutation. As it turns out (see Discussion), this effect may be related to interactions with residues in the 310-helix. The loop region between the N-terminal helix and the C-terminal RPPPS sequence consists of a short 310-helix between residues 12–14. Of these residues, Pro12 forms one edge of the Trp-cage hydrophobic cluster, whereas the potential roles of the other two residues, Ser13 and Ser14, in the short 310-helix were not readily apparent from the NMR structures. The Ser13 side chain is fully solvent exposed in both of our Trp-cage structure ensembles; in analogy to
helix sites, the S13A mutation was fold stabilizing (by 1.8 kJ/mol).
In contrast, the S14 side chain is internalized in all of our structure ensembles. Moreover, Neidigh reported (Neidigh, 1999
) that the NOESY and TOCSY spectra of (S14T)-TC5b in pH 7 aqueous buffer (with 30% TFE added) display a prominent threonine hydroxyl proton resonance indicating that this proton is protected from exchange with the solvent. However, (S14T)-TC5b was too unstable for NMR characterization in the absence of TFE. To probe the interactions of the S14 side chain, we prepared the S14A of TC10b and S14T/S14allo-Thr mutants of TC9b. The allo-Thr mutation was fully tolerated with no apparent loss (
0.7 kJ/mol) in cage stability; the S
T mutation was slightly destabilizing (
GF280 = 2 kJ/mol); but the S14A mutation, with a 
GF280 of 8.4 kJ/mol, was the most destabilizing effect observed for any residue that was not an obvious apolar component of the hydrophobic core.
| Discussion |
|---|
|
|
|---|
The Trp-cage structure is retained in both optimized and truncated sequences
The sequence of the Trp-cage can be optimized for fold stability without causing any significant structural changes. The NMR ensembles generated for TC5b (Tm = 42°C) and TC10b, which bears three stabilizing mutations N1D/L2A/I4A (Tm = 58 ± 2°C), are identical to within a 0.6 Å backbone RMSD. A few detectable differences in the residues 12–17 span are evident by examining the backbone
/
values (Supplementary Table available at PEDS online); given the sparseness of NOE constraints in this loop, these minor differences may not be significant. TC10b also retains all of the notably ring-current shifted signals of TC5b and these hardly increase in magnitude due to the stabilizing mutations: accounting for the unfolded fraction of TC5b present (estimated from the NH protection data), the sum of the structuring shifts for the folded state of TC5b is >95% of the sum of maximum CSDs at each position observed in higher melting constructs. We view this as evidence that the less stable TC5b structure does not show dramatically enhanced structural fluxionality in the folded state. MD simulations starting from the Trp-cage fold (Fesinmeyer, 2005
), which will be reported elsewhere, reveal a host of collapsed, near-native folds (backbone RMSD = 0.8–2 Å versus the NMR structure) that do not reproduce the chemical shifts observed. Structures with 1–2 Å C
or backbone RMSDs from the NMR structure, which also fail to predict the observed ring-current shifts, have been equated with the folded state in a number of literature accounts of computational Trp-cage folding simulations.
The 19-mer and 18-mer constructs that were obtained by the truncation of the TC10b sequence also displayed the characteristic long-range NOEs and the CSDs associated with the Trp-cage constructs. For example, the ratios of the ring-current shifts at G11H
2/P18H
and P18Hβ3/P19H
2 (1.53 and 3.12, respectively) in the 18-mer were identical (±0.02) to those observed for TC10b with the absolute values indicating a 0.72 ± 0.02-fold-population for the 18-mer at 280 K. This implies that truncation did not lead to any significant structural changes; rather
GF became less negative. As a result, we view the Trp-cage as an 18-residue folding motif.
Confirmation of key assumptions about the Trp-cage fold
Except for Y3A, Q5A, L7A and presumably W6A [not examined since W6F is highly destabilizing (Barua and Andersen, 2002
)], mutations to alanine in the N-terminal helix of the Trp-cage results in improved fold stability. This suggests that these side chains are involved in specific fold-stabilizing interactions. The L7A construct is only 55%-folded at 280 K in aqueous buffer; it appears that Leu7 (sandwiched between the Tyr3 and Trp6 side chains) is an important contributor to the burial of the indole ring. Interactions within the helix may be responsible for the decrease in stability associated with the Q5A mutation: Gln–Asx (i, i + 4) hydrogen-bonding interactions have been reported to occur in helices (Huyghues-Despointes et al., 1993
; Stapley and Doig, 1997
). It has been reported that this interaction is more stabilizing when the Gln is N-terminal to the Asx, particularly for an ionized Asp (Huyghues-Despointes et al., 1995
). The very large Y3A effect (
14 kJ/mol) is attributed primarily to favorable Y3/P19 hydrophobic interactions, vide infra.
The introduction of N9D and A16R mutations during the initial Trp-cage optimization procedure had a profound effect leading to the D9/R16 salt-bridge hypothesis (Neidigh et al., 2002
). A D9/R16 salt bridge has also been a prominent feature in most of the MD folding simulations reported for the Trp-cage, either as an interaction that was viewed as facilitating rapid folding (Snow et al., 2002
; Chowdhury et al., 2003
; Nikiforovich et al., 2003
) or as a kinetic trap (Zhou, 2003
; Ding et al., 2005
; Linhananta et al., 2005
). However, the NMR structure ensembles fail to provide a well-defined H-bonded salt-bridging geometry. This reflects the apparent absence of side chain/side chain NOEs across the bridging sites. Inserting a restraint that keeps the D9/R16 side chain termini within interaction distance, established that a better defined salt bridge is fully consistent with the observed NOEs (vide infra).
Our efforts to rationalize the substantial pH effects on Trp-cage fold stability [initially reported (Neidigh et al., 2002
) to be 5.8–7.5 kJ/mol] focused on mutational effects in the QxxxDxxxxxxR unit of Trp-cages, and the effects of C-terminal amidation (Table II) and how these alter the fold population dependence on pH. The most instructive mutation was the replacement of Arg16 with norvaline (the des-guanidino equivalent). The fold-population of the R16Nva mutant of TC9b was pH-independent (
F = 0.56 at 300 K, versus 0.86 for TC9b at pH 7). Assuming that D9/R16 interaction is the only fold-stabilizing interaction involving the guanidino group, implies a 4–5 kJ/mol fold stabilization associated with the salt bridge. This result also suggests that the helix-stabilizing QxxxD interaction is not pH-dependent in our construct. The data for the Q5A and D9N mutants were less clear-cut. The D9N mutation produced a smaller destabilization (
GF280 = 3.4 kJ/mol) and the mutant retained some pH dependence to its fold stability (
2 kJ/mol), even though the C-terminal carboxyl is the only ionizable site. Indeed, terminal amidation reduced fold stability by 1.5 ± 0.5 kJ/mol. In sum, based on the R16Nva, Q5A, and D9N mutations, the D9/R16 salt-bridge stabilization is bracketed in the 3.4–6 kJ/mol range. There may also be a stabilizing Coulombic interaction between the C-terminal carboxylate and the N-terminal ammonium ion in the Asn-capped sequences, which is masked in species with Asp as the N-cap. Assuming no other Q5 side-chain interactions that affect fold stability, the stability decrease for (Q5A)-TC9b (2.4 kJ/mol) represents a measure of the free energy contribution of the QxxxD H-bond to helix stability. The validity of any of these estimates is also complicated by the general concern that this type of mutational analysis cannot factor out all context effects and thus attribution to particular side chain/side chain interactions is always less than fully justifiable.
In our initial report, we posited the inclusion of four prolines in the TC5b sequence as a significant factor contributing to the unexpected fold stability of this 20-residue sequence. The importance of the entropic effect of the proline residues on Trp-cage fold stability was confirmed by mutations of the P17 residue. The substitution of P17 by either Ala or Gly resulted in less folded constructs, with 
GF280 estimated as 3.6 and 6.2 kJ/mol, respectively; the reduction in fold stability associated with mutations at this site is attributed solely to increases in
SU. From the protein stability data of Matthews et al. (1987)
, the expected magnitude of the
SU effects of a P
A and P
G mutation are 3.5–6 and 5.5–9 kJ/mol, respectively, at this temperature. The agreement between our mutational results at P17 and the expectation values for entropic effects in proteins is excellent. However, extension of this conclusion to the other proline sites in not straight-forward.
Does the Trp-cage provide evidence for uniquely stabilizing Pro/Trp or Pro/Tyr interactions?
The locations of the apolar ring portions of P12 and P18, in tight association with the central indole ring of the hydrophobic core, and some structural analogies to geometries observed in the complexes of polyPro peptides and protein domains that recognize such sequences led us to suggest (Neidigh et al., 2001
, 2002
) that these represented particularly favorable interactions. Thus, we were surprised to discover that mutations at these two prolines were tolerated. The first mutation examined was P18L, based on the notion that the aliphatic chain of Leu might be a reasonable replacement for the proline ring placed immediately above the indole ring (Fig. 4). The P18L mutation was only slightly more destabilizing (
GF = 5.7 kJ/mol) than the P17A effect attributed to the 
SU effect. However, Pro
Ala mutations at either (or both) Pro12,18 were distinctly less stabilizing than the P17A mutation. In both (P18A)-TC10b and the double mutant, the diagnostic ring-current shifts of the Trp-cage are retained; the CSD at 18H
(as large as –2.6 p.p.m.) was in fact greater in the P18A mutants than when this site is a proline (H
CSD = –2.3 in TC10b). The Ala18 alpha methine appears to be even closer to, and centered within the shielding cone of, the indole ring. The P19H
2 site also experiences a larger upfield shift. No other changes in ring-current shifts are apparent, for example, the CSDs of L7H
, G11H
2, G11H
3 and R16H
agree within 0.02 p.p.m. for TC10b and its P18A mutant. The effect of the P18A mutation (
GF280 = 2.3 kJ/mol) must, therefore, be rationalized as a minor, enthalpically favorable change in the docking of the residues 17–19 segment onto the indole ring that compensates for the 
SU effect. This result and the well-tolerated P12A mutation raise the possibility that the solvent shielding of the indole ring (and the resulting favorable enthalpic contribution of Trp-cage formation) may largely be due to interactions between the ring and the peptide backbone. [This is similar to aryl rescue of glycine at β-sheet sites (Kemmink et al., 1993
; Merkel and Regan, 1998
; Jackups and Liang, 2005
) which also requires an unsubstituted Gly.] It should be noted, however, that the 
SU effect could be muted for a PPP
PAP mutation due to a narrower unfolded state
/
distribution at the central alanine in a sequence with a significant preference for a polyProII conformation.
In distinct contrast, the P19A mutation reduces fold stability by
14 kJ/mol. Pro19 displays interactions with both Trp6 and Tyr3. The Y3/P19 interaction, evidenced by numerous side chain/side chain NOEs and a strong Y3
/P19
connectivity, defines the limits of the 18-residue Trp-cage motif and completes the encaging of the indole ring. Although we had not anticipated it, the observation that the favorable enthalpic interaction with the greatest contact order is also the interaction with the largest effect on thermodynamic stability seems entirely reasonable: it can be viewed as a hydrophobic staple holding the Trp-cage structure together. We also probed the Y3/P19 interaction by examining Y3L and Y3A mutations. The Y3L mutation was only modestly destabilizing (
GF280 = 4.4 kJ/mol), and a L3/P19 interaction was evidenced by stereospecific side chain/side chain NOEs. The Y3A mutation was nearly as destabilizing (
GF280
13 kJ/mol) as the P19A mutation. Taken together, these observations suggest that the Y3/P19 interaction is indeed hydrophobic in nature and the major contributor to the cage stability differences observed for both the Y3A and the P19A mutations. Y3 and P19 also make contacts with the W6 side chain, but the data do not allow the parsing of the mutational 
GF data into components reflecting three possible interactions (Y3/W6, Y3/P19 and W6/P19).
The contribution of buried H-bonds
The NH exchange protection due to two buried long-range H-bonds, Trp6 indole-NH
1 to O=C-Arg16 and Gly11-NH
(O=C-Trp6 and/or O=C-Leu7), is particularly diagnostic of Trp-cage formation; to these we can now add H-bonding interactions of the buried Ser14 side chain hydroxyl. Although the contribution of H-bonding to protein fold stability is generally viewed as rather small, on the order of 5 kJ/H-bond or less (Baldwin, 2003
), reflecting the compensating effect of H-bonding interactions with water in the unfolded state, it is clear that virtually all polar backbone group of proteins are H-bonded; one estimate of the cost of breaking a solvating H-bond without replacing it with an intermolecular one is
20 kJ (Fleming and Rose, 2005
). The mutations of S14 and R16 reported in Table III were, in part, based on observations made by computational chemists examining the Trp-cage motif. Pitera and Swope noted that the S14 hydroxyl was positioned for H-bonding (to the backbone carbonyls of G10 or G11) in their simulated folded state geometries. In the NMR structure ensemble of TC10b, based strictly on NOE connectivities, S14H
is within H-bonding distances of either the Asp9 or Gly11 backbone carbonyls in most of the structures. Simmerling and Roitberg (personal communication) brought another feature, an Arg-NH
to Ser-O
H-bond (Fig. S2, Supplementary data available at PEDS online) to our attention; such an interaction would be expected to increase the strength of any H-bond involving S14H
. We examined an R16K mutation to probe this region; the Lys side chain could still have a stabilizing Coulombic interaction with D9, but no side-chain/side-chain hydrogen bonding interaction with Ser would be expected. Consistent with an Arg16-NH
H-bonding interaction, the R16K mutation was destabilizing (3 kJ/mol), but other rationales are available.
The S14 mutational studies support an H-bonded status: both Thr and allo-Thr were well tolerated at this position, whereas the S14A mutation (
GF280 = 8.4 ± 0.4 kJ/mol) was the most destabilizing effect observed for any residue that was not an obvious, apolar contributor to the hydrophobic core. Furthermore, in aqueous pH 7 buffer, the diastereomeric S14T mutants display a side-chain hydroxyl proton in the 2D-NMR experiments [A search of the BMRB database reveals that Ser-OH groups are observed in only 2.1% of the proteins examined and that these observations are restricted to systems with large hydrophobic cores. The statistics for observing Thr-OH are only slightly more favorable (3.8%). Hevein, a tetra-disulfide 43-mer (Cao, 1993
; Asensio et al., 1995
), is, to our knowledge, the smallest protein to display side-chain OH signals in its NMR spectra under the conditions routinely used for protein resonance assignment. Alternative experiments that allow even random coil Ser and Thr OHs to be observed are available (e.g. Liepinsh et al., 1992
).], indicating an equivalent and significant degree of exchange protection for this proton. The internalized position of the Thr side chain in the S14T mutant is supported by close contact with the Asp9-CH2 evident as a very intense NOE with Thr methyl; the stereochemistry change at Cβ reduces the NOE intensity to the Asp9-CH2 and produces indole-CH to C
H3 NOEs that are not observed for the S14T mutant. Apparently, the Thr hydroxyl forms the same H-bond in both diastereomers, with the Cβ diastereotopic change only altering the location of the methyl group. Our estimate of the stabilizing effect of the Ser14 hydroxyl based on the S14A mutation includes the fold improvement expected for a Ser
Ala mutation within the 310-helix (1.8 kJ/mol for S13A); thus the measured buried H-bond effect is 10.2 ± 0.7 kJ/mol for Ser14 of the Trp-cage:
50% of the folding impairment expected for an internalized backbone amide lacking an H-bonding interaction (Fleming and Rose, 2005
).
The Thr and allo-Thr side-chain CSDs further corroborate the assignment and provided insights into the position of the Ser14 side chain in the Trp-cage. The Ser14 Hβ's have disparate CSD values (
Hβ3 = –0.393, 
Hβ2 = –0.012 in TC9b); CSDs as large as –0.45 have been observed for S14Hβ3 in more stable Trp-cage analogs. Within the IUPAC naming convention, Hβ of Thr corresponds to Ser Hβ2; Hβ3 in the case of allo-Thr: the Ser14 CSD values reflect the similar magnetic environments experienced by corresponding nuclei in allo-Thr14 Hβ (–0.490) and Thr14 Hβ (–0.005 p.p.m.). The methyl signals (H
2) show, as expected, the opposite trend - allo-Thr14 H
2 slightly downfield (+0.033) and that of Thr14 significantly upfield (–0.160 p.p.m.). These observations were used to test alternative computational models of the Trp-cage conformation.
The NMR ensemble of TC10b was calculated without H-bond restraints. Since neither the D9/R16 salt bridge nor a consistent pattern of H-bonds involving the Ser14 side chain appeared in this or the prior Trp-cage structure ensemble, we examined the extent to which specific H-bonding patterns are consistent with the NOE constraints and ring-current shift at the internalized S14 side chain (data not shown). With the exception of the S14H
O=C-Gly10 suggestion of Pitera and Swope, all
and 310-helical H-bonding networks as well as S14H
and R16H
H-bonding interactions could be accommodated. Added H-bond restraints (and distance restraints maintaining the Asp and Arg side chains within salt-bridging distance) did not introduce any significant increases in the violations of NOE-derived distance constraints; in fact, a number of modified CNS runs with a full complement of H-bonds specified resulted in decreased ENOE terms. The resulting models for TC10b are viewed as ones incorporating other NMR data besides the NOEs. The quality of ensembles calculated with alternative H-bond restraints was judged based on: ETOT and ENOE, the extent to which H-bonds displayed normal geometries, agreement with the 49 medium- and long-range NOEs, and compatibility with the S14 mutation data, see Supplementary data available at PEDS online. The lowest energy model resulted when salt bridging and 14H
-9O'/16H
-14O
/9HN-5O'/11HN-7O' H-bond restraints were applied. The resulting ensemble (intra-ensemble 3–19 backbone RMSD = 0.31 ± 0.12 Å) was still very similar to the ensemble generated with only NOE constraints (inter-ensemble RMSD = 0.47 ± 0.16 Å) but displayed improved energies versus the NOE-constraints-only ensemble and rationalizes all other NMR observations. However, distinguishing between a 14H
-9O' and 14H
-11O' H-bond is not really possible; as this corresponds to nothing more than a change in
2 of S14. Views of a representative member of this ensemble, as well as atomic coordinates of this model of the Trp-cage structure, appear in the Supplementary data available at PEDS online.
Cooperativity between helix formation and Trp-cage formation
The helix and cage melting temperatures of most of the Trp-cage constructs reported in this study are within 2–5°C of each other, implying cooperativity between helix formation and formation of the tertiary structure involving the hydrophobic cluster about the Trp side chain. The requirement of an efficient helix N-cap for cage formation also supports this hypothesis. Further evidence for this cooperativity came from the observation that stabilization of the helical part of the sequence resulted in improving overall fold stability (the L2/I4/K8
Ala and the A4
Aib mutations). Introduction of an alanine in the 310-helix (S13
Ala) also led to an increase in fold stability. The average stabilization effect, based on the changes in melting temperatures, was 1.5 ± 0.35 kJ/mol per alanine substitution.
All Trp-cage species bearing the D9/R16 unit with (Tmhelix–Tmcage) <7°C displayed two other consistent correlations: the Tm from the CD melt was closer to the Tmcage (rather than Tmhelix) value, and the (Tmhelix–Tmcage) difference increased by 1.5–2.5°C upon protonation of the Asp side chain. The better correlation between TmCD and Tmcage, rather than Tmhelix, can be rationalized as follows. NMR measures of helicity are sensitive only to local
/
values whereas the CD measure of helicity requires a longer contiguous string of helical
/
values. Apparently, when Asp protonation decreases the extent of cage formation; unfolded or partially folded states, which retain some local helicity, make a larger contribution to the NMR melts. A well-defined complete helix is present when the Trp-cage is folded; however, in the absence of Trp-cage formation (e.g. C-terminally truncated species and the W6F mutant of TC5b), only partial local helicity has been reported (Barua and Andersen, 2002
). To provide a measure of the intrinsic helicity of the Cap-AYAQWLADG unit in our better Trp-cage species in the absence cage formation, we compared the 13C shift melts of truncated species NAYA'QWLA'DGKK, A'=(13C=O)-Ala, in pH 7 buffer with and without 6 M urea added to that of fully folded DAYA'QWLA'DGGPSSGRPPPS. This indicated the maximal fH at the labeled alanines was 0.35 in the truncated species under the most favorable folding conditions: cage formation increases helix formation by at least a 12 kJ/mol 
G increment.
Turning to those Trp-cage analogs that display (Tmhelix–Tmcage)
7°C, these fall into two categories: (i) mutational or solvent effects that increase the intrinsic stability of the N-terminal
-helix with an opposite (or lesser) effect on the tendency for Trp-cage formation and (ii) mutation in the residues 14–19 span that destabilize the Trp-cage. Representatives of the first category are the introduction of Aib residues, N-terminal helix extensions (Lin et al., 2004
), the L7A mutation and the addition of 30% TFE. The TC10b mutants with the largest difference, (Tmhelix–Tmcage), were S14A-TC10b (
Tm = 11.5), L7A-TC10b (
Tm = 12) and P19A-TC10b (
Tm = 24°C). The helix and cage melts for L7A-TC10b are shown in Fig. 3. In contrast to the L2A substitution (
GF = –1.3 kJ/mol), mutation of the hydrophobic L7 side chain reduces the Trp-cage population to 0.56 at 280 K (
GF
+12 kJ/mol), whereas the helix H
sites report a Tm of 296 K. The latter represents partial formation of the N-terminal helix in states lacking the Trp-cage. Melting data comparisons for TC10b and its S14A and P19A mutants appear in Fig. 6. In the case of the P19A mutant at 280 K, the helix measures indicate a 60 ± 4%-fold population, even though the sum of the Pro18
/β3 CSDs indicate(s)
F = 0.17.
|
Partial formation of the N-terminal helix in unfolded or partially folded states that lack the complete Trp-cage (evident in Fig. 6a) also rationalizes the general observation that Tmhelix values increase more than Tmcage values on TFE addition. The extreme example of this is exendin-4 in 30% TFE: the PF of the backbone NHs of the residues in the helix C-terminus were as much as 500-fold larger than the PF obtained from the indole ring H
1 of Trp6; the latter, due to the W6H
1
O=C-R16 H-bond, reflects full Trp-cage formation. Figure 7a shows a melting (unfolding) scenario that rationalizes the data. With the long helix of exendin-4, there is essentially no (
U < 10–5) random coil state in 30% TFE at temperatures below 30°C and cage formation (Kcage = 260) can be viewed as the docking of the triPro unit onto the Trp and Tyr rings held in place on one side of a preformed helix. In systems with shorter helices, particularly those lacking helix stabilizing Aib or Ala substitutions, cage disruption (by melting or mutation) does not produce partially helical intermediates. In fact, TC5b in aqueous buffer has no additional protection for the backbone HN near the helix C-terminus and shows nearly two-state chemical shift and CD melting behavior. In optimized Trp-cage species with more stable helices, intermediate behavior is observed.
|
MD folding simulations for TC5b have indicated different sequences of events regarding the appearance of hydrophobic core features. Docking of the P17–19 unit onto the Trp side-chain figures in early hydrophobic collapse in some simulations (Snow et al., 2002
3 and P12H
3 protons, which is illustrated in the present cases by Fig. 6. This scenario can explain the persistence of helical populations as the C-terminal triPro unit undocks and provides a rationale for the effects of mutations in the residues 14–19 span that destabilize the Trp-cage but retain significant helicity.
Turning to the specific proton melting behavior that does not track the melting behavior of the proton shifts that are used as measures of Trp-cage folding, the upfield CSD of P12H
3 (for TC10b and other stable Trp-cage species) actually increases in magnitude as partial melting occurs (Fig. 6c) and G11H
3 displays little loss of its structuring CSD on partial melting. The P19A mutant provides a compelling example of similar behavior in response to a mutation that disfavors docking of the C-terminal segment; even though the Gly11
2 measure of cage formation and the sum of the Pro18
/β3 CSDs indicate
F = 0.39 and 0.17, respectively, the CSD at Gly11
3 is fully 72% of that observed for TC10b. These chemical shift melts and mutational effects can be rationalized as the consequence of the population, in the unfolded state, of a residual hydrophobic cluster consisting of the W6 and the G11/P12 residues interacting in a non-native orientation, the half-cage motif, that reduces the ring-current shift of G11H
2 (but not G11H
3) and places the P12-
CH2 further into the shielding region than in the full Trp-cage structure. The P12-
CH2 protons (particularly P12H
3) in many destabilized mutants display negative CSDs of greater magnitude than those observed for TC10b itself, suggesting that the population of the half-cage increases with the destabilization of the full Trp-cage.
Formation of the half-cage requires at least partial formation of the C-terminal portion of the helix with the W6/P12 interaction acting as a helix C-cap: both the (S14A)- and (P19A)-mutants display upfield shifts at G11H
3 that are larger than expected based on the CSDs observed at G11
2 and P18
. Further, when large P12H
3 CSDs are observed in mutants, they display a temperature dependence that correlates with CSDs that monitor helicity rather than cage formation. Evidence of the population of a discreet half-cage state in C-terminally truncated species and by the P19A mutant of TC10b (as well as other mutants designed to reduce the tendency toward complete Trp-cage formation) will be presented elsewhere.
An alternative rationale for the unusual melting behavior of the structuring shifts at P12H
3 and the two G11-CH2 resonances—increased G10–P12 loop fluxionality within the folded state at higher temperatures—deserves mention. Segmental motion of this type would, if the region with increased fluxionality in the high temperature folded ensemble extends to P18, rationalize much of the melting data. Mutational effects on cage stability imply that the P19/Y3 (andW6) interaction is particularly stabilizing; this may also be the more persistent hydrophobic interactions at higher temperature as we consistently find that P19
' CSD melts suggest a higher
F than the P18
/β3 and G11
2 CSDs at higher temperatures. More detailed studies of additional mutants will be required to support such a rationale. In their absence, the nearly constant CSD ratios for sites in L7, G10, G11, P12, R16 and P18 in the numerous analogs examined to date support a common non-fluxional fold and a nearly two-state folding process. Recent studies (Mok et al., 2007
) reveal retention of the W6/P12 (but not the W6/P18) interaction in the urea-denatured state and suggest partial hydrophobic collapse even in this unfolded state.
It is now established that the Trp-cage is an 18-residue motif. The NMR ensemble generated for the more stable TC10b analog displays dihedral angles (and H-bonding networks) that correspond more closely to secondary structure norms; in particular, the
angle values for residues 13–14 are closer to the standard 310-helix value (–26°), and the residues in the 16–19 span display
/
angles corresponding to a polyProII helix (–75°/ +145°). In the absence of salt-bridging distance restraints, the R16 side chain displays two
1 values and side chain arrangements only one of which is consistent with an R16/D9 ion-pairing interaction (Fig. 4a). Additional restrained MD trajectories (Supplementary data available at PEDS online) indicate that enforcing an R16/D9 ion-pairing interaction as well as a defined set of H-bonding interactions including the Ser14 hydroxyl and regularizing the H-bonding pattern within the
-helical segment yields a structural model that is fully consistent with all of the NMR observations. Our best model of TC10b appears in the Supplementary data available at PEDS online, with coordinates included so that these can serve as starting structures for initiating computational unfolding trajectories. We urge the use of the TC10b sequence for any further folding simulations and suggest that a backbone RMSDs <1 Å should be used to define the folded state; we have been unable to generate any structures that are consistent with the ring-current shifts observed for the folded state which differ from the NOE-derived structure by >0.9 Å.
Although the burial of the Trp side chain is clearly the dominant design feature of the Trp-cage motif, the present mutational study suggests that indole/backbone interactions rather than specific indole/proline ring interactions may be the key component. The high proline content of the Trp-cage clearly improves fold stability. The entropic advantage of Pro versus Ala (
SU = 11 ± 2 J/mol K) was measured at the solvent-exposed P17 site. The study has also revealed two unexpected contributors to
GF: a buried H-bonding Ser side chain (
GF
10 kJ/mol) and a Y3/P19 hydrophobic staple interaction. Concerning the latter, both P19A and Y3A mutations are destabilizing by >13 kJ/mol, but folding can be rescued with a Y3L mutation (
GF = 4.4 kJ/mol).
The data presented herein support a cooperative melting scenario for Trp-cage species; the chemical shift melts serving as the best experimental measure: uniform structure loss over probes throughout the structure (Ferguson et al., 2005
; Fesinmeyer et al., 2005
; Olsen et al., 2005
) requires folding cooperativity. In the present case, the excellent agreement between the observed structuring chemical shifts and those calculated for structure ensembles that are tightly converged about a single conformer lends additional validation to the structure.
The two-state character of Trp-cage folding transition gains support from a recent correlation of CD melts and differential scanning calorimetry (Streicher and Makhatadze, 2007
). Apparently contrasting observations or conclusions, however, deserve some comment. A number of the folding simulations (vide supra) show initial hydrophobic collapse that could be equated with a molten globule folding intermediate. A UV-resonance Raman study (Ahmed et al., 2005
) of the thermal unfolding of TC5b has been interpreted as evidence for non-two-state behavior and posits that the folded state becomes more compact on warming from 4°C to 20°C followed by rather gradual partial melting from 25°C to 70°C with the latter corresponding to the loss of only four helical amide units. To be consistent with the observed temperature dependence of chemical shifts that include large ring-current components, the more compact state formed at 20°C would have to have a more fluxional structure in which most of the ring-current shifts are significantly, and similarly, diminished in magnitude: a rather unlikely set of coincidences.
The few deviations from uniform loss of structuring CSDs upon melting or the introduction of destabilizing single-site mutations are fully rationalized by the folding scenarios in Fig. 7, which allow for partial local structure in the unfolded ensemble. The appropriate description of denatured and unfolded states of Trp-cage species remains an area of active exploration (Mok et al., 2007
), as does a clarification of the details of the folding pathway(s) for this ultrafast folding system. Two Trp-cage constructs were included in the development of a new model of ultra-fast folding (Ghosh et al., 2007
) which attributes folding acceleration to the existence of a multiplicity of parallel pathways to the folded state geometry. We continue to view the Trp-cage motif as an ideal model for exploring the correlation between protein folding simulations and experimental data. For its size, the system displays a sharp melting transition and the Trp-cage has local structuring propensities in the unfolded state that are similar to those observed in a number of proteins of interest to the protein folding research community. Reproducing, in silico, the mutational effects elucidated in this study should provide a stringent test of folding computation methods.
| Methods |
|---|
|
|
|---|
Peptide synthesis and purification
Peptides were synthesized on an Applied Biosystem 433A peptide synthesizer using standard Fmoc solid-phase peptide synthesis methods. The resins used for the synthesis were Wang resins preloaded with the C-terminal amino acid. C-terminal amides were prepared similarly but using Rink resins. Peptides were cleaved from the resin using a 95:2.5:2.5 trifluoroacetic acid (TFA):triisopropylsilane:water mixture. The cleaved peptides were purified by reverse phase HPLC on a Varian C18 prep-scale column using gradients of water/acetonitrile (having 0.1% and 0.085% TFA, respectively). Collected fractions were lyophilized and their identity and molecular weight confirmed using a Bruker Esquire Ion Trap mass spectrometer.
CD spectroscopy and melting analysis
Stock peptide solutions for CD experiments were prepared in 10 mM phosphate buffer at the appropriate pH. The concentration of the stock solution was determined by the UV absorption of tyrosine and tryptophan (
= 1190 M–1 cm–1 and 5580 M–1 cm–1, respectively at 278 nm). CD samples with concentrations of 20–30 µM were prepared by dilution of the stock solution. Spectra were recorded on a Jasco J715 instrument in cells with a path-length of 0.1 cm. Typical spectral accumulation parameters were a scan rate of 100 nm/min with a 2 nm band-width, and a 0.1 nm step resolution over the wavelength range 185–270 nm with eight scans averaged for each spectrum at temperatures ranging from 0°C to 90°C. The spectrophotometer was equilibrated at each temperature for 5 min before acquisition. The raw ellipticity data were converted into mean residue molar ellipticity units, (deg cm2)/(residue dmol) using standard Jasco software.
Typical changes in the CD signature of a Trp-cage that undergoes nearly complete melting over the 0–90°C range are shown in Fig. 8a.
|
CD melts are presented as plots of mean residue molar ellipticity at 222.5–223.6 nm ([
]223) versus temperature. The melting curves for TC10b (and additional analogs) is shown in Fig. 5, which also includes an unfolded reference line. The 100%-folded baseline was obtained by assuming a temperature gradient (
[
]223/
T) that was 0.32% of the 100%-folded ellipticity. The latter was obtained from the lowest temperature [
]223 observation adjusting it to 100%-folded based on the NMR measures (CSDs) of
F. The temperature dependence of [
]223 in 7 M GdmCl was determined for three Trp-cage species, TC5b, TC10b and its less stable S14A mutant. For the less stable species, TC5b and (S14A)-TC10b, [
]223 versus T (°C) was linear to within experimental error: –661 – 29.6·T and –997 – 26.3·T, respectively. The corresponding plot for TC10b displayed slight curvature which could be rationalized as the melting of a residual 2% helicity; the linear fit over the low temperature region was –1430 – 18·T. For the calculation of CD Tm values and melting curves (expressed as
U versus T, Fig. 8b), a common equation was used for unfolded state reference, [
]223, U=–900 – 29·T. The interpolation for determining the Tm was a four-order polynomial fit of melting curve. For constructs with Y3 replaced by a non-aromatic residue, alternative baselines, [
]223, U = –2050 – 21·T and [
]223, H = –22 400 + 72·T, reflecting the known positive [
]223-contribution of Tyr in both the coil and helical state (chakrabatty et al., 1993) were also examined.
Unfolding thermodynamics were derived by the method previously applied to two-state hairpin unfolding transitions (Andersen et al., 2006
). We first determined the range of
Cp values that produced a linear relationship for Eq. (1), as measured by R2. We report the thermodynamic parameters for T* = 298 K.
|
| (1) |
For TC10b,
Cp values between –330 and –110 J/mol K displayed R2>0.998, consistent with apparent two-state behavior, over the 10–90°C range. This, in turn, provided the range of
S and
H values. A recent calorimetric study of TC5b (Streicher and Makhatadze, 2007
) also found a small
CpU value, but it was positive (0.3 ± 0.1 kJ/mol K).
All NMR experiments were performed on a Bruker DRX 500 MHz spectrometer on peptide samples of 1–1.5 mM concentration in 50 mM phosphate buffer at pH 7 and 2.5, with 10% D2O. DSS was used as the internal proton reference standard and set to 0 p.p.m. for all conditions. A combination of TOCSY and NOESY spectra 2D NMR spectra recorded at temperatures ranging from 280 to 325 K were used to assign all resonances. An MLEV-17 (Bax and Davis, 1985
) spinlock (12 or 60 ms) was employed for the TOCSY and mixing times of 60, 150 or 200 ms were used for the NOESY spectra.
The CSDs (CSD =
obs –
rc) for all the peptides were calculated using the reference random coil chemical shifts (
rc) for the specific residue with nearest neighbor corrections. With the exception of Gly11, all the H
random coil values were obtained from our automated CSD calculation program (Fesinmeyer et al., 2004
), which is available at http://andersenlab.chem.washington.edu/CSDb. The reference random coil values used for Gly11-H
2, Pro18-Hβ3, Pro19-H
2 and -H
3 were 4.02, 2.29, 3.59 and 3.74 p.p.m. respectively; the proline side chain values are used for prolines at other positions as well.
NMR melting temperatures reported herein rely on sums of CSDs. For each construct, two Tm-measures, cage and helix, were calculated. The cage Tm's are derived from the sum of the CSDs of five protons (L7
, P18
, P18β3, P19
3 and P19
2; when available G11
2 replaces or supplements L7
in the calculation) experiencing large upfield ring-current shifts associated with the indole ring of Trp6. The helix melt measure employs H
sites within the helix–Y3, Q5, W6, K8. (For species with N-terminal modifications or truncations, the measure of helix formation includes only the Q5/W6/K8 H
sites.)
NH exchange studies and PF calculations
The NH exchange experiments were performed on a Bruker DRX 500 MHz spectrometer at 280 K by adding pre-cooled D2O phosphate buffer (pD = 5.0, 5.5 or 6.6) to the lyophilized peptide sample in a pre-cooled NMR tube and then recording 1D proton spectra at various time intervals. The slowest exchanging NH's were used for calculating the exchange rates and PF (PF = krc/kobs). Exchange rates (kobs) were obtained from 1D spectra as the slopes of plots of ln (NH signal intensity) versus time. The reference random coil rate constant for exchange (krc) was calculated using the appropriate Molday factors (Bai et al., 1993
), as previously described for exendin-4 and TC5b (Neidigh et al., 2001
, Neidigh et al., 2002). The relationship,
F = 1 – (1/PF) (Fezoui and Braswell, 1999
), was employed to convert PF to an extent of folding measure. The mean
GU was calculated using the
F values obtained from the PFs of the three or four slowest exchanging protons measured at the three different pDs (5.0, 5.5 and 6.6), as described previously (Lin et al., 2004
).
NMR structure ensemble calculation
NOE intensities from NOESY spectra from all available experimental conditions were converted into distance constraints. The NOE intensities were obtained directly from the NOESY spectra and these were converted to distances by an automated program, DIS (Fesinmeyer, 2005
), which also calculates the allowed ranges (see Supplementary data available at PEDS online). The resulting constraints were then used in a previously reported (Neidigh et al., 2001
) simulated annealing protocol using the program CNS (Brünger et al., 1998
). Stereospecific assignments for the glycine H
's, W6-Hβ, L7-Hβ, K8-Hβ, D9-Hβ and S14-Hβ prochiral CH2 groups, the proline ring hydrogens and the L7 methyl signals were accomplished by pairwise trials accepting the assignment that gave a significantly lower ENOE. Resolution of the D9-CβH2 and S14-CβH2 assignments required trials for all four possibilities. The resulting assignments and the constraint list appear in the Supplementary data available at PEDS online.
In the present study, 40 structures were generated, out of which 28 structures were accepted. The acceptance criteria for these structures were no NOE violation >0.2 Å and agreement with idealized covalent geometry (the mean of the bond, angle and improper torsion violations could not exceed 0.01 Å, 0.55° and 0.2° in any structure with no individual values exceeding 0.02 Å, 3° and 2°, respectively). A structure ensemble was generated using the 28 accepted structures. The backbone and heavy atom RMSD measures of convergence are the mean ± s.e. for pairwise comparisons over the entire accepted ensemble. The RMSD calculations and figures depicting structure ensembles were generated using MolMol (Koradi et al., 1996
). Since some bond lengths and angles are parameterized differently by CNS and the structure analysis algorithm used by the Research Collaboratory for Structural Bioinformatics (RCSB-PDB), the structures submitted to the PDB (accession #2JOF) are the CNS-generated structures modified by 500 steps of steepest-descent minimization using the SANDER application of AMBER6. This minimization has no significant effect on the NOE–constraint deviations. The largest heavy atom position change observed during such minimizations was 0.32 Å.
| Supplementary Material |
|---|
|
|
|---|
Supplementary data are available at PEDS online.
| Funding |
|---|
|
|
|---|
US National Institutes of Health [grant GM050658 to NHA].
| Footnotes |
|---|
Edited by Dan Raleigh
| Acknowledgements |
|---|
|
|
|---|
We thank Prof. Carlos Simmerling (Dept of Chemistry and Center for Structural Biology, Stony Brook University) for continuing insights into Trp-cage folding from molecular dynamics that have been useful in designing and analyzing mutational studies. The US National Science Foundation provided funds for the upgrade of NMR instrumentation in the UW Chemistry Department facility.
| References |
|---|
|
|
|---|
Ahmed Z., Ilir A.B., Mikhonin A.V., Asher S.A. J. Am. Chem. Soc. (2005) 127:10943–10950.[CrossRef][Web of Science][Medline]
Andersen N.H., Tong H. Protein Sci. (1997) 6:1920–1936.[Web of Science][Medline]
Andersen N.H., Neidigh J.W., Harris S.M., Lee G.M., Liu Z., Tong H. J. Am. Chem. Soc. (1997) 119:8547–8561.[CrossRef][Web of Science]
Andersen N.H., Fesinmeyer R.M., Neidigh J.W., Barua B. The Trp-Cage: A Notably Stable Mini-Protein Fold in Peptides 2000—Martinez J., Fehrentz J-A., eds. (2001) Paris, France: EDK.
Andersen N.H., Olsen K.A., Fesinmeyer R.M., Tan X., Hudson F.M., Eidenschink L.A., Farazi S.R. J. Am. Chem. Soc. (2006) 128:6101–6110.[CrossRef][Web of Science][Medline]
Asensio J.L., Cañada F.J., Bruix M., Rodriguez-Romero A., Jimenez-Barbero J. Eur. J. Biochem. (1995) 230:621–633.[Web of Science][Medline]
Bai Y., Milne J.S., Mayne L., Englander S.W. Proteins (1993) 17:75–86.[CrossRef][Web of Science][Medline]
Baldwin R.L. J. Biol. Chem. (2003) 278:17581–17588.
Barua B., Andersen N.H. Lett. Pept. Sci. (2002) 8:221–226.
Bax A., Davis D.G. J. Magn. Reson. (1985) 65:355–360.[Web of Science]
Brünger A.T., et al. Acta Crystallogr. D Biol. Crystallogr. (1998) D54:905–921.[CrossRef][Medline]
Bunagan M.R., Yang X., Saven J.G., Gai F. J. Phys. Chem. B (2006) 110:3759–3763.[Medline]
Cao B. NMR structural studies of bound peptides and hevein, a small protein. (1993) PhD.
Carnevali P., Toth G., Toubassi G., Siavash N. J. Am. Chem. Soc. (2003) 125:14244–14245.[CrossRef][Web of Science][Medline]
Chen J., Im W., Brooks C.L. III. J. Am. Chem. Soc. (2006) 128:3728–3736.[CrossRef][Web of Science][Medline]
Chowdhury S., Lee M.C., Xiong G., Duan Y. J. Mol. Biol. (2003) 327:711–717.[CrossRef][Web of Science][Medline]
Ding F., Buldyrev S.V., Dokholyan N.V. Biophys. J. (2005) 88:147–155.[CrossRef][Web of Science][Medline]
Doig A.J., Baldwin R.L. Protein Sci. (1995) 4:1325–1336.[Web of Science][Medline]
Ferguson N., Sharpe T.D., Schartau P.J., Sato S., Allen M.D., Johnson C.M., Rutherford T.J., Fersht A.R. J. Mol. Biol. (2005) 353:427–446.[CrossRef][Web of Science][Medline]
Fesinmeyer R.M. Chemical shifts define the structure and folding thermodynamics of polypeptides. (2005) PhD.
Fesinmeyer R.M., Hudson F.M., Andersen N.H. J. Am. Chem. Soc. (2004) 126:7238–7243.[CrossRef][Web of Science][Medline]
Fesinmeyer R.M., Hudson F.M., White G.W.N., Olsen K.A., Euser A., Andersen N.H. J. Biomol. NMR (2005) 33:213–231.[CrossRef][Web of Science][Medline]
Fezoui Y., Braswell E.H. Biochemistry (1999) 38:2796–2804.[CrossRef][Web of Science][Medline]
Fleming P.J., Rose G.D. Protein Sci. (2005) 14:1911–1917.[CrossRef][Web of Science][Medline]
Ghosh K., Ozkan S.B., Dill K.A. J. Am. Chem. Soc. (2007) 129:11920–11927.[CrossRef][Web of Science][Medline]
Hudson F.M., Andersen N.H. Biopolymers (Pept. Sci.) (2004) 76:298–308.[CrossRef]
Huyghues-Despointes B.M., Scholtz J.M., Baldwin R.L. Protein Sci. (1993) 2:1604–1611.[Web of Science][Medline]
Huyghues-Despointes B.M., Klinger T.M., Baldwin R.L. Biochemistry (1995) 34:13267–13271.[CrossRef][Web of Science][Medline]
Ivankov D.M., Garbuzynskiy S.O., Alm E., Plaxco K.W., Baker D., Finkelstein A.V. Protein Sci. (2003) 12:2057–2062.[CrossRef][Web of Science][Medline]
Jackups R.J., Liang J. J. Mol. Biol. (2005) 354:979–993.[CrossRef][Web of Science][Medline]
Juraszek J., Bolhuis P.G. Proc. Natl Acad. Sci. USA (2006) 103:15859–15864.
Karle I.L., Balaram P. Biochemistry (1990) 29:6747–6756.[CrossRef][Web of Science][Medline]
Kemmink J., van Mierlo C.P.M., Scheek R.M., Creighton T.E. J. Mol. Biol. (1993) 230:312–322.[CrossRef][Web of Science][Medline]
Kentsis A., Gindin T., Mezei M., Osman R. (2007) PLoS ONE, 2, e446.
Koga N., Takada S. J. Mol. Biol. (2001) 313:171–180.[CrossRef][Web of Science][Medline]
Koradi R., Billeter M., Wüthrich K. J. Mol. Graph. (1996) 14:51–55.[CrossRef][Web of Science][Medline]
Kubelka J., Hofrichter J., Eaton W.A. Curr. Opin. Struct. Biol. (2004) 14:76–88.[CrossRef][Web of Science][Medline]
Liepinsh E., Otting G., Wüthrich K. J. Biomol. NMR (1992) 2:447–465.[CrossRef][Web of Science][Medline]
Lin J.C., Barua B., Andersen N.H. J. Am. Chem. Soc. (2004) 126:13679–13684.[CrossRef][Web of Science][Medline]
Linhananta A., Boer J., MacKay I. J. Chem. Phys. (2005) 122:114901/114901–114901/114915.
Liu Y., Liu Z., Androphy E., Chen J., Baleja J.D. Biochemistry (2004) 43:7421–7433.[CrossRef][Web of Science][Medline]
Matthews B.W., Nicholson H., Becktel W.J. Proc. Natl Acad. Sci. USA (1987) 84:6663–6667.
Merkel J.S., Regan L. Fold. Des. (1998) 3:449–455.[CrossRef][Web of Science][Medline]
Mok K.H., Kuhn L.T., Goez M., Day I.J., Lin J.C., Andersen N.H., Hore P.J. Nature (2007) 447:106–109.[CrossRef][Medline]
Naduthambi D., Zondlo N.J. J. Am. Chem. Soc. (2006) 128:12430–12431.[CrossRef][Web of Science][Medline]
Neidigh J.W. Chemical shift tools in peptide folding and miniature protein design. (1999) PhD.
Neidigh J.W., Andersen N.H. Biopolymers (2002) 65:354–361.[CrossRef][Web of Science][Medline]
Neidigh J.W., Fesinmeyer R.M., Prickett K.S., Andersen N.H. Biochemistry (2001) 40:13188–13200.[CrossRef][Web of Science][Medline]
Neidigh J.W., Fesinmeyer R.M., Andersen N.H. Nat. Struct. Biol. (2002) 9:425–430.[CrossRef][Web of Science][Medline]
Neuweiler H., Doose S., Sauer M. Proc. Natl Acad. Sci. USA (2005) 102:16650–16655.
Nikiforovich G.V., Andersen N.H., Fesinmeyer R.M., Frieden C. Proteins (2003) 52:292–302.[CrossRef][Web of Science][Medline]
Olsen K.A., Fesinmeyer R.M., Stewart J.M., Andersen N.H. Proc. Natl Acad. Sci. USA (2005) 102:15483–15487.
Ota M., Ikeguchi M., Kidera A. Proc. Natl Acad. Sci. USA (2004) 101:17658–17663.
Padmanabhan S., Baldwin R.L. J. Mol. Biol. (1994) 241:706–713.[CrossRef][Web of Science][Medline]
Pitera J.W., Swope W. Proc. Natl Acad. Sci. USA (2003) 100:7587–7592.
Plaxco K.W., Simons K.T., Ruczinski I., Baker D. Biochemistry (2000) 39:11177–11183.[CrossRef][Web of Science][Medline]
Qiu L., Pabit S.A., Roitberg A.E., Hagen S.J. J. Am. Chem. Soc. (2002) 124:12952–12953.[CrossRef][Web of Science][Medline]
Schug A., Herges T., Wenzel W. Phys. Rev. Lett. (2003) 91:158102/158101–158104.
Schug A., Wenzel W., Hansmann U.H.E. J. Chem. Phys. (2005) 122:194711/194711–194717.
Searle M.S., Ciani B. Curr. Opin. Struct. Biol. (2004) 14:1–7.[CrossRef][Web of Science]
Simmerling C.L., Strockbine B., Roitberg A.E. J. Am. Chem. Soc. (2002) 124:11258–11259.[CrossRef][Web of Science][Medline]
Snow C.D., Zagrovic B., Pande V.S. J. Am. Chem. Soc. (2002) 124:14548–14549.[CrossRef][Web of Science][Medline]
Stapley B.J., Doig A.J. J. Mol. Biol. (1997) 272:465–473.[CrossRef][Web of Science][Medline]
Streicher W.W., Makhatadze G.I. Biochemistry (2007) 46:2876–2880.[CrossRef][Web of Science][Medline]
Ulmschneider J.P., Ulmschneider M.B., Di Nola A. J. Phys. Chem. B (2006) 110:16733–16742.[Medline]
Wishart D.S., Sykes B.D. Methods Enzymol. (1994) 239:363–392.[Web of Science][Medline]
Zhou R. Proc. Natl Acad. Sci. USA (2003) 100:13280–13285.
Received November 15, 2007; revised November 27, 2007; accepted November 28, 2007.
![]()
CiteULike
Connotea
Del.icio.us What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







