Protein Engineering, Vol. 14, No. 1, 7-15,
January 2001
© 2001 Oxford University Press
Assessing the role of tryptophan residues in the binding site
Department of Biochemistry and Bioinformatics Centre, Bose Institute,P-1/12, CIT Scheme VIIM, Calcutta 700 054, India
| Abstract |
|---|
|
|
|---|
Instead of looking at the interfacial area as a measure of the extent of a proteinprotein recognition site, a new procedure has been developed to identify the importance of a specific residue, namely tryptophan, in the binding process. Trp residues which contribute more towards the free energy of binding have their accessible surface area reduced, on complex formation, for both the main-chain and side-chain atoms, whereas for the less important residues the reduction is restricted only to the aromatic ring of the side chain. The two categories of residues are also distinguished by the presence or absence of hydrogen bonds involving the Trp residue in the complex. A comparison of the observed change in the accessible surface area with the value calculated using an analytical expression provides another way of characterizing the Trp residues critical for binding and this has been used to identify such residues involved in binding non-proteinaceous molecules in protein structures.
Keywords: accessible surface area/molecular recognition/protein-protein complexes/substrate binding/tryptophan
| Introduction |
|---|
|
|
|---|
The recognition and association between macromolecules are fundamental to the functioning of biological systems. The affinity between two molecules for the formation of non-covalent complexes can be quantified on a structural basis (Janin, 1995a

G) has been measured. They found that the free energy of binding is not evenly distributed across interfaces; instead, there are hotspots (
G = 2 kcal/mol) of binding energy made up of a small subset of residues in the dimer interface. Of all the amino acid residues found in the interface, the likelihood of being in hotspots is the maximum for tryptophan (Trp). In this context, it would be of interest to see if there is any structural or binding feature in the three-dimensional structure of a complex that one can use to distinguish a Trp residue in the hotspot from another which is energetically less important. Such characteristics can then be used to assess the importance of Trp in a protein in the binding of other non-proteinaceous molecules, such as carbohydrate, cofactor, substrate or drug.
We have recently analyzed the environment of Trp residues (the aromatic part of the side chain, in particular) in protein structures, the nature of the interacting residues (partners) and the exponential dependence of the accessible surface area of the Trp residue on its number of partners (other protein residues in contact with Trp) (Samanta et al., 2000
). As atoms buried at proteinprotein interfaces are close-packed like the protein interior (Lo Conte et al., 1999
), the aforementioned features of Trp residues in proteins should also be transferable to the residues in the interface region. Consequently, one should be able to assess the role of Trp in the binding by finding the change in the number of its partner residues on complex formation and the associated loss in its accessible surface area and by looking at other elements of its environment and comparing the results with those found within protein structures. This paper is an anatomy of Trp residues in energetically hotspots and other less important regions in proteinprotein interfaces, as well as those involved in the binding of other small molecules.
| Materials and methods |
|---|
|
|
|---|
Information on the Trp residues which are at the proteinprotein interface, as revealed by the crystallographic analysis of the heterodimeric complex, was obtained from the file interface.xls in http://motorhead.ucsf.edu/~thorn/hotspot (Bogan and Thorn, 1998
|
For the analysis of the role of Trp in binding small molecules (termed substrates in this paper), all non-proteinaceous molecules (excluding water) in contact with Trp residues were identified for a selected set of 180 protein structures from the Protein Data Bank (PDB) (Sussman et al., 1998
| Results and discussion |
|---|
|
|
|---|
Trp residues in protein interface
Table I
lists Trp residues which are/are not in hotspots, as elucidated by Bogan and Thorn (1998). The number of partner residues in contact with the Trp residue, considering either the whole residue or just the aromatic ring, before and after complex formation and the names of the partner residues are also provided. Figure 1
depicts a Trp in the interface and how its partners are disposed in the two subunits.
|
An analysis of the environment of the aromatic ring of Trp showed that the peak in the distribution of the number of protein residues in contact with the ring (the so-called partners) occurs at six (Samanta et al., 2000
ASAw and
ASAr in Table II
|
Using the analytical expression relating the accessible surface area of a Trp residue and its number of partners (Samanta et al., 2000
ASAw and
ASAr, can be calculated. These values are, in general, smaller than the observed values irrespective of whether or not the Trp residue is in a hotspot (Table II
ASAw is greater than that of
ASAr for Trp residues in a hotspot and the calculated values also reflect the same trend. Additionally, for residues not in a hotspot, the trend is just the opposite (with one exception), i.e.
ASAw (calc.) <
ASAr (calc.); in one case both the values are 0.0, as there is no change in the number of partners on complex formation.
|
Trp residues in substrate-binding site
Based on the above observations on Trp residues in the protein interface, we wanted to see if it is possible to assess the importance of Trp residues in the binding site of non-proteinaceous molecules in protein structures. For a residue to be important the following two conditions have to be satisfied:
ASAw (obs.)
2
ASAw (calc.) and
ASAr (obs.)
2
ASAr (calc.). These conditions are only approximate, as when applied to residues in Table II
, these would have missed out one hotspot residue and also would have identified one non-hotspot residue as important. However, in the case of substrate binding these conditions should be more appropriate. As the substrates are usually much larger than the average size of an amino acid residue and
ASA values are calculated assuming an increase in the number of partners by just one owing to the substrate binding, these values are expected to be smaller than the values actually observed if the Trp residue is crucial for the binding of the substrate. The other criterion for an important residue is the existence of a hydrogen bond between Trp and the substrate molecule.
The formulae of all the substrate molecules used in our analysis and their atoms which are found in contact with the indole ring of Trp residues in different PDB files are shown in Figure 3
. Information on Trp residues, their partners, accessible surface areas and how these change on substrate binding is provided in Table III
. In one respect these Trp residues are different from those in the protein interface. Whereas the latter residues have 25 partners (around the aromatic ring) in the parent molecule (Table II
), the majority of the former residues have a value of
6. The substrate molecules are of different shapes and sizes. Trp residues which are deemed to be important in substrate binding using the conditions on ASA are marked with dots in the last column in Table III
. If in addition there is a hydrogen bond between the Trp residue and the substrate, the residue is likely to be important in substrate binding. One example is the binding of FMN by Trp57 in the structure, 1rcf. 1stp corresponds to the structure of streptavidin which binds biotin with exceptionally high affinity (Kd = 1015 M) (Green, 1975
). There are three Trp residues in the binding site (Weber et al., 1989
) and all are shown to be important, thus lending credence to the predictive power of our methodology. Moreover, aromatic-sugar stacking is a typical feature of proteincarbohydrate interactions (Vyas, 1991
; Kadziola et al., 1998
). In all the structures (1byb, 1cel, 1slt and 2gbp) where a carbohydrate molecule is bound, there is at least one Trp residue which is shown to be important. However, in all the cases the decrease in the accessible surface area on substrate binding may not be the best criterion to judge the role of a residue. For example, in the binding of the small sulfate ion (structure, 1sbp), there is hardly any change in ASA and the formation of the hydrogen bond could be the deciding factor in this case. Another situation where the comparison of the observed and calculated values of
ASA may not yield the right result is when the number of partners is atypically small, e.g. 0. In 4fxn, although the observed value is one of the highest in the table, the calculated value is also large and their difference is very small. Nevertheless, this procedure provides some guidelines as to the importance of a Trp residue in binding, which can then be corroborated by protein engineering experiments.
|
|
Conclusion
Depending on the magnitude of contribution towards the binding energy, interface residues have been classified as being or not being in a hotspot (Bogan and Thorn, 1998
). In this paper we analyzed whether it is possible to identify Trp residues in hotspots from those which are not, on the basis of crystal structure data. We find that for Trp residues not in hotspots, the change in accessible surface area of the Trp residue on complex formation is restricted to only the indole ring, whereas for hotspot residues the change involves the whole residue. Although the former residues do not form hydrogen bonds with the physiological partner molecule, a hydrogen bond is usually formed for the latter residues. Depending on the change in the number of partner residues, it is possible to calculate the expected change in the accessible surface area of a Trp residue due to complex formation. The observed values are always found to be greater than the calculated values. Similar comparisons between the observed and calculated values and the identification of any hydrogen bond linking Trp to the substrate molecule provides a way to assess the importance of Trp residues in the substrate-binding sites. Based on these encouraging results involving Trp, we are now in the process of extending the methodology to other residues.
| Notes |
|---|
1 To whom correspondence should be addressed. E-mail: pinak{at}boseinst.ernet.in
| Acknowledgments |
|---|
The authors are grateful to the Department of Biotechnology and the Council of Scientific and Industrial Research for financial support.
| References |
|---|
|
|
|---|
Bogan,A.A. and Thorn,K.S. (1998) J. Mol. Biol., 280, 19.[Web of Science][Medline]
Chothia,C. and Janin,J. (1975) Nature, 256, 705708.[Medline]
Green,N.M. (1975) Adv. Protein Chem., 29, 85133.[Medline]
Hubbard,S.J. (1991) ACCESS, a Program for Calculating Accessibilities. Department of Biochemistry and Molecular Biology, University College London, London.
Janin,J. (1995a) Biochimie, 77, 497505.[Medline]
Janin,J. (1995b) Proteins: Struct. Funct. Genet., 21, 3039.[Web of Science][Medline]
Janin,J. and Chothia,C. (1990) J. Biol. Chem., 265, 1602716030.
Jones,S. and Thornton,J.M. (1996) Proc. Natl Acad. Sci. USA, 93, 1320.
Kadziola,A., Sogaard,M., Svensson,B. and Haser,R. (1998) J. Mol. Biol., 278, 205217.[Web of Science][Medline]
Lee,B. and Richards,F.M. (1971) J. Mol. Biol., 55, 379400.[Web of Science][Medline]
Lo Conte,L., Chothia,C. and Janin,J. (1999) J. Mol. Biol., 285, 21772198.[Web of Science][Medline]
McCoy,A.J., Epa,V.A. and Colman,P.M. (1997) J. Mol. Biol., 268, 570584.[Web of Science][Medline]
Norel,R., Lin,S.L., Wolfson,H.J. and Nussinov,R. (1994) Biopolymers, 34, 933940.[Web of Science][Medline]
Samanta,U., Pal,D. and Chakrabarti,P. (2000) Proteins: Struct. Funct. Genet., 38, 288300.[Web of Science][Medline]
Sayle,R.A. and Milner-White,E.J. (1995) Trends Biochem. Sci., 20, 374.[Web of Science][Medline]
Sussman,J.L., Lin,D., Jiang,J., Manning,N.O., Prilusky,J., Ritter,O. and Abola,E.E. (1998) Acta Crystallogr., D54, 10781084.
Vyas,N.K. (1991) Curr. Opin. Struct. Biol., 1, 732740.
Weber,P.C., Ohlendorf, D.H., Wendoloski,J.J. and Salemme,F.R. (1989) Science, 243, 8588.
Wells,J.A. (1991) Methods Enzymol., 202, 390411.[Web of Science][Medline]
Received May 9, 2000; revised October 31, 2000; accepted November 9, 2000.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
U. Samanta, R. P. Bahadur, and P. Chakrabarti Quantifying the accessible surface area of protein residues in their local environment Protein Eng. Des. Sel., August 1, 2002; 15(8): 659 - 667. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

positions indicated) in contact with Trp169B in the PDB file 3hhr (details are available in Table I



