Protein Engineering, Vol. 14, No. 5, 321-328,
May 2001
© 2001 Oxford University Press
Side-chain conformations in 4-
-helical bundles
1 Department of Biology, University of Crete, P.O. Box 2208, GR-71409 Heraklion and 2 Foundation for Research and TechnologyHellas, Institute of Molecular Biology and Biotechnology (IMBB), P.O. Box 1527, GR-71110 Heraklion, Crete, Greece
| Abstract |
|---|
|
|
|---|
The distribution of the
1,
2 dihedral angles in a dataset consisting of 12 unrelated 4-
-helical bundle proteins was determined and qualitatively compared with that observed in globular proteins. The analysis suggests that the 4-
-helical bundle motif could occasionally impose steric constraints on side chains: (i) the side-chain conformations are limited to only a subset of the conformations observed in globular proteins and for some amino acids they are sterically more constrained than those in helical regions of globular proteins; (ii) aspartic acid and asparagine occasionally adopt rotamers that have not been previously reported for globular or helical proteins; (iii) some rotamers of tyrosine and isoleucine are predominantly or exclusively associated with hydrophobic core positions (a, d); (iv) mutations in the hydrophobic core occur preferentially between residue types which among other physicochemical properties also share a predominant rotamer.
Keywords: 4-
-helical bundles/
-dihedral angles/rotamer/side chain
| Introduction |
|---|
|
|
|---|
The conformation of side chains is an essential feature of protein architecture. Consequently, knowledge of the factors that affect the side-chain conformations is significant, both for the understanding of protein folding and for the successful design of mutated proteins.
Side chains in proteins prefer certain conformations as shown by the non-uniform distribution of the
-dihedral angles (Janin et al., 1978
). The preferred conformations correspond to energy minima that are generally represented by three regions of
1 (around 60°, 180° and 60°; Janin et al., 1978). Analyses of the distribution of
-dihedral angles by two groups (James and Sielecki, 1983
; Ponder and Richards, 1987
) led to the definition of a rotamer as a dense cluster of points in the
-angle space; furthermore, a rotamer library was developed based upon 19 well-determined protein structures. The relationship between secondary structure and side-chain conformations was subsequently investigated (McGregor et al., 1987
; Summers et al., 1987
). These studies revealed that the rotamer preferences of side chains are strongly affected by the secondary structure. A significant correlation between the backbone
,
values and the side-chain dihedral angles was also found (Dunbrack and Karplus, 1993
). Based on a significantly enlarged data set, Schrauber et al. (Schrauber et al., 1993
) further refined the original rotamer library for globular proteins introduced by Ponder and Richards (Ponder and Richards, 1987
). The influence of backbone conformations was taken into account by grouping the rotamer distributions for each amino acid according to several secondary structure-based classes. Furthermore, the term `rotamericity' of an amino acid was introduced (Schrauber et al., 1993
), defined as the ratio of the total number of occurrences of the specific amino acid in any of the possible rotamers to the total number of occurrences of this amino acid in the sample.
In the present study, the role of the geometric constraints posed by a specific topology to the side-chain dihedral angles was investigated. As a model for protein topology, the 4-
-helical bundle motif was used. This simple, recurrent tertiary motif consists of four
-helices packed against each other in an antiparallel manner at an angle of about 20° (Figure 1a
) (Weber and Salemme, 1980
; Cohen and Parry, 1986
). The
-helices are usually connected together with loop regions; alternatively, the bundle is formed as an assembly of helices belonging to different polypeptide chains, as is the case with the ColE1 Rop protein (Banner et al., 1987
; Presnell and Cohen, 1989
; Harris et al., 1994
). The amino acid sequences of the helices follow a specific pattern of hydrophilic and hydrophobic residues of the type (a,b,c,d,e,f,g)n (Crick, 1953
). This pattern is repeated every seven residues (heptads). Positions a and d form the core of the bundle and are generally occupied by hydrophobic amino acids (Figure 1b
).
|
The amino acid frequencies for the seven topologically distinct positions of the heptad repeat revealed highly specific relationships between topology and sequence preferences (Paliakasis and Kokkinidis, 1992
-helical bundle topology are reflected not only on the pattern of amino acid sequence but also in the conformations of side chains as expressed in terms of
-angles. In this study, the distribution of
1,
2 dihedral angles found in a sample of 12 4-
-helical bundles is compared with the conformational preferences of side chains observed in globular proteins or specifically in
-helices. However, this work can only provide qualitative information since the small size of the sample makes it difficult to draw definite conclusions. | Materials and methods |
|---|
|
|
|---|
To analyze the conformations of the
-dihedral angles, an initial sample of 443 4-
-helical bundle structures (326 of which were lysozyme molecules) was obtained from the Brookhaven Protein Data Bank (Bernstein et al., 1977; http://pdb-browsers.ebi.ac.uk//) and the FSSP database (Holm and Sander, 1996; http://www2.ebi.ac.uk/dali/fssp/fssp.html). To avoid bias due to sequence homologies, a subset of 12 structures with <25% sequence identity was used (Table I
-helical residues only (588 out of a total of 741 residues in the sample). Gly, Pro and Ala residues were excluded. Assignment of the heptad positions (ag) in the sequences of our sample was carried out manually by inspecting their structures with the program O (Jones et al., 1991
2, the analysis was restricted to
1,
2 only. For the classification of
1,
2 combinations, the conventions of Schrauber et al. (1993) were used, i.e. a side-chain conformation was assigned to a specific rotamer if the dihedral angles did not deviate by more than 20° from the values reported for this rotamer. To investigate whether the tightly packed hydrophobic cores impose special constraints to side chains, each amino acid type was analyzed according to its topological position in the bundle using a classification of residues into two groups. The first group consisted of internal residues (a and d positions) while the second group comprised more exposed residues (positions b, c, e, f and g).
|
Furthermore, the constraints imposed by 4-
-helical bundle cores on the evolution of their sequences were examined from the perspective of rotamer conservation. More specifically, the internal positions (a and d) in a series of aligned sequences of homologous 4-
-helical bundles were compared and the pattern of amino acid substitutions was examined from the aspect of rotamer conservation. To perform this analysis, representative sets of homologous sequences for eight of the proteins of our sample were identified using the Swiss Prot data bank (Bairoch and Apweiler, 1997
|
| Results |
|---|
|
|
|---|
Owing to the small size of the sample (see Materials and methods section), only qualitative information can be obtained by this work. Nevertheless, as far as we can assert by estimated standard deviations, our results (rotamer preferences, etc.) are generally consistent with the statistics of the much larger sample of globular proteins performed by other authors (Schrauber et al., 1993
Amino acid composition of the sample
The amino acid composition of the helical parts of the proteins used in this work is presented in Figure 2
. Compared with an earlier analysis (Paliakasis and Kokkinidis, 1992
) it shows only minor differences, i.e. (i) a higher occurrence of Leu and (ii) an increase in the Leu to Ala ratio. The composition of internal (a, d) positions (Figure 2
) shows a clear predominance of Leu, Ala, Ile and Val residues, which agrees well with the preferences found by Paliakasis and Kokkinidis.
|
Rotamer preferences
The clusters of
1,
2 dihedral angles found for the side chains of our sample are presented in Table III
. The most striking feature of this distribution for the majority of amino acid types is that a large fraction of the side chains belong to a single rotamer. Ile, Val, Thr and Cys are extreme examples, where the dominant cluster contains at least five times more members than the next one. On the other hand, Leu and Arg have two densely populated rotamers whereas Ser and Glu do not show pronounced preferences for a particular rotamer.
|
Table III
-helical regions of globular proteins (in bold in Table III
-helical bundles are limited to a small subset of the rotamers found in globular proteins. Furthermore, the side-chain conformations of Tyr, Met, Thr and Cys appear to be more constrained in our sample compared with
-helical regions of globular proteins because there is a strong preference for one out of several possible `helical rotamers'. As shown in Table III
-helical regions. A novel rotamer with
1 = 178° and
2 = 64° which to our knowledge has not been reported earlier, was found for Asp (see Table III
1 = 170° and
2 = 59° (Table III
|
A possible interpretation of these findings could be that the 4-
-helical bundle motif imposes special constraints on side-chain conformations. If this is indeed so, one would expect these constraints to be more pronounced for residues of the tightly packed hydrophobic core. For this reason, the residues occupying the hydrophobic core positions a and d of the heptad repeat were treated separately. For each amino acid type, the residues were distinguished in two groups according to their location in (a, d) and (b, c, e, f, g) positions and the
1,
2 plots were obtained. Inspection of these diagrams showed that for Ile and Tyr residues the two groups are segregated to different rotamers (Figure 4a and b
1 = 62°,
2 = 60° for Ile and
1 = 70°,
2 = 107° for Tyr, correspond to rotamers which are also present in the globular proteins, but which have not been reported yet in connection with protein cores.
|
Another useful indicator for assessing the influence of position-specific effects to the side-chain conformations is the rotamericity (Schrauber et al., 1993
Correlation between frequency of natural mutations and rotamer preferences in the hydrophobic core of the 4-
-helical bundles
Natural mutations in the hydrophobic core of 4-
-helical bundles were analyzed from the aspect of rotamer conservation [rotamer conservation in proteins has been reported (Summers et al., 1987
) and forms the basis of molecular modeling by homology]. In this analysis, the sequence alignments described in the Materials and methods section were examined and the pattern of sequence variation at the a and d positions was studied. From a total of 226 pairs of aligned amino acids in the core, 81 substitutions were found. In Table IV
the observed substitutions and the corresponding frequencies for the residues commonly found in a and d positions are presented. An interesting observation is the relatively high frequency of Val
Leu and Met
Leu substitutions. It is worth noting that only four types of substitution account for 40% of the total observed (Met
Leu, Val
Leu, Leu
Ile and Leu
Val). Comparison of Tables III and IV![]()
shows that substitutions occur overwhelmingly between residue types that have their major rotamers in common. For example, the main rotamer of Ile and Met coincides with one of the main rotamers of Leu. As an exception to the above observations, Phe
Ile substitution occurs between residues that do not share any common rotamer while some relatively rare substitutions (e.g. Met
Phe, Tyr
Asp and Tyr
Met) occur between residues which do not share a major rotamer but have at least one less populated
1,
2 cluster in common. For example, the dominant rotamer of Tyr and Phe (
1 = 176°,
2 = 78° and
1 = 180°,
2 = 75°, respectively) coincide with one of the `rare' rotamers of Met (
1 = 175°,
2 = 64°).
|
| Discussion |
|---|
|
|
|---|
The conclusions of this work can be summarized as follows. (i) In addition to the known influence of
-helical secondary structure on the side-chain conformations (Schrauber et al., 1993
-helical bundle topology probably imposes on some residues additional constraints and restricts the permitted conformations overwhelmingly to a unique rotamer. (ii) The side chains of aspartic acid and probably asparagine occasionally adopt a conformation that is associated with a novel rotamer. (iii) For isoleucine and tyrosine, rotamers were found which appear to be more compatible with residues in the hydrophobic core. (iv) Natural mutations in a and d positions of the 4-
-helical bundles tend to occur between amino acids which among other properties have their main rotamers in common.
Previous studies have demonstrated that the secondary structure significantly affects the side-chain conformations by restricting them to a subset of those found in globular proteins (McGregor et al., 1987
; Schrauber et al., 1993
). This is confirmed and reinforced by our analysis, which in addition showed that for some amino acids (Tyr, Met, Thr, Cys) the preference of specific side-chain conformations is much more pronounced even when compared to
-helices. This could be an effect of the 4-
-helical bundle topology, which imposes additional constraints to side chains limiting the permitted conformations to a subset of those adopted by
-helices.
Generally, the majority of side chains in the 4-
-helical bundles do not adopt novel or unusual conformations; side chains are clustered in some of the already known regions (from the globular proteins) of the
1,
2 space. A notable exception to this statement is the Asp residue, several occurrences of which have been classified as a novel rotamer (Figure 5a
). This result was unexpected given that Asp is not subject to the specific constraints of the motif, since it systematically occurs in external or semi-buried positions. Indications of a new rotamer are also present for Asn. An analysis of all Asp and Asn residues in our sample shows that their temperature factors are fairly low and very close to or even lower than the average temperature factor of the structure to which they belong. Thus, the observed behavior is probably not an artifact due to an increased side-chain flexibility. The novel rotamers are not associated with a particular position of Asp/Asn residues (e.g. capping residues) or a particular backbone conformation (all Asp/Asn residues studied have helical
,
angles). Detailed inspection and comparison with the known rotamers of the globular proteins, showed that the novel rotamer places the side chain in an optimal position relative to the protein backbone and the side chains of the neighboring
-helices, so that electrostatic repulsions between charged groups and steric hindrances between the side chains are minimized. However, the size of our sample is too small to go beyond a qualitative discussion of these novel rotamers.
|
A related observation is that Ile and Tyr exhibit a rotamer that is exclusively occupied by internal residues (positions a and d). Taking into account our limited data we could only assert that these conformations appear to be less well accepted by external residues; however, they are adopted by a significant fraction of the total number of internal residues (~50% for Tyr and 20% for Ile). For Ile, in particular, this position-specific conformation does not belong to the preferred rotamer of
-helices (Schrauber et al., 1993
Side-chain shape, volume, polarity, packing density and cavity volume have been reported as factors that affect the pattern of substitutions in the protein interior (Bordo and Argos, 1990
; Vlassi et al., 1999
). Conservation of these structural parameters is important for hydrophobic core mutations. Our present study provides evidence that the pattern of side-chain substitutions in 4-
-helix bundles is also consistent with the conservation of highly populated rotamers. In a previous study, Summers et al. (1987) concluded that for structurally and functionally homologous proteins, there is a high probability that the side-chain conformations are conserved. For amino acid substitutions, the conservation of orientation of both C
and C
atoms, was estimated to be in order of 3575% (Summers et al., 1987
). In the case of 4-
-helical bundles, the frequency of the amino acid substitutions that occur between residues with at least one rotamer in common was estimated to be ~75% and indicates an extensive conservation of rotamers in the core of homologous bundles. This is consistent with recent work by Vlassi et al. (1999), which found a pronounced tendency of 4-
-helical bundles to preserve hydrophobic core packing interactions upon mutations. Exceptions to this behavior (e.g. Phe
Ile substitutions), where rotamer conservation is not possible, occur at the end of the bundles. It is reasonable to assume that the constraints of the motif at those positions are weaker.
The aim of this work, as mentioned, was to examine qualitatively whether steric hindrances posed by the 4-
-helical bundle topology are reflected in the conformations of side chains. Within this framework, it has been shown that a tertiary motif can affect to some extent the conformations adopted by the side chains of some amino acids. This happens first through additional restrictions imposed to the side-chain conformations, second through the formation of novel
1,
2 clusters and third through some rotamers that appear to be more compatible with internal residues. A natural extension of this work would be a systematic analysis of additional known tertiary motifs. Such a study would provide valuable information for the understanding of protein folding and would find applications in the successful design of mutations and in the homology modeling of new structures.
| Notes |
|---|
3 To whom correspondence should be addressed. E-mail: kokkinid{at}imbb.forth.gr
| References |
|---|
|
|
|---|
Bairoch,A. and Apweiler,P. (1997) J. Mol. Med., 75, 312316.[Web of Science][Medline]
Banner,D.W., Kokkinidis,M. and Tsernoglou,D. (1987) J. Mol. Biol., 196, 657675.[Web of Science][Medline]
Bernstein,F.C., Koetzle,T.F., Williams,G.J.B., Meyer,E.F., Brice,M.D., Rogers,J.R., Kennard,O., Shimanouchi,T. and Tsumi,M. (1997) J. Mol. Biol., 112, 535542.
Bordo,D. and Argos,P. (1990) J. Mol. Biol., 211, 975988.[Web of Science][Medline]
Cohen,C. and Parry,D.A.D. (1986) Trends Biochem. Sci., 11, 245248.
Collaborative Computational Project Number 4 (1994) Acta Crystallogr., D50, 760763.
Crick,F.H.C. (1953) Acta Crystallogr., 6, 689697.
Dunbrack,L.R. and Karplus,M. (1993) J. Mol. Biol., 230, 543574.[Web of Science][Medline]
Harris,N.L., Presnell,S.R. and Cohen,F.E. (1994) J. Mol. Biol., 236, 13561368.[Web of Science][Medline]
Holm,L. and Sander,C. (1996) Science, 273, 595602.
James,M.N.G. and Sielecki,A.R. (1983) J. Mol. Biol., 183, 299361.
Janin,J., Wodak,S., Levitt,M. and Maigret,B. (1978) J. Mol. Biol., 125, 357386.[Web of Science][Medline]
Jones,T.A., Zou,J-Y., Cowan,S.W. and Kjeldgaard,M. (1991) Acta Crystallogr., A47, 110119.
McGregor,M.J., Islam,S.A. and Sternberg,M.J.E. (1987) J. Mol. Biol., 198, 295310.[Web of Science][Medline]
Paliakasis,C.D. and Kokkinidis,M. (1992) Protein Eng., 5, 739748.
Pearson,W.R. and Lipman,D.J. (1988) Proc. Natl Acad. Sci. USA, 85, 24442448.
Ponder,J.W. and Richards,M. (1987) J. Mol. Biol., 193, 775791.[Web of Science][Medline]
Presnell,S.R. and Cohen,F.E. (1989) Proc. Natl Acad. Sci. USA, 86, 65926596.
Schrauber,H., Eisenhaber,F. and Argos,P. (1993) J. Mol. Biol., 230, 592612.[Web of Science][Medline]
Summers,N.L., Carlson,W.D. and Karplus,M. (1987) J. Mol. Biol., 196, 175198.[Web of Science][Medline]
Vlassi,M., Cesareni,G. and Kokkinidis,M. (1999) J. Mol. Biol., 285, 817827.[Web of Science][Medline]
Weber,P.C. and Salemme,F.R. (1980) Nature, 287, 8284.[Medline]
Received September 14, 2000; accepted February 15, 2001.
![]()
CiteULike
Connotea
Del.icio.us What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




