Protein Engineering, Vol. 12, No. 12, 1021-1024,
December 1999
© 1999 Oxford University Press
COMMUNICATIONS |
Correlation between occupancy and B factor of water molecules in protein crystal structures
European Molecular Biology Laboratory, Meyerhofstrasse 1,69012 Heidelberg, Germany and Department of General Chemistry,Pavia University, via Taramelli 12, 27100 Pavia, Italy.E-mail: carugo{at} embl-heidelberg.de
| Abstract |
|---|
|
|
|---|
An empirical relationship between occupancy and the atomic displacement parameter of water molecules in protein crystal structures has been found by comparing a set of well refined sperm whale myoglobin crystal structures. The relationship agrees with a series of independent structural features whose impact on water occupancy can easily be predicted as well as with other known data and is independent of the protein fold. The estimation of the water occupancy in protein crystal structures may help in understanding the physico-chemical properties of the proteinsolvent interface and can allow the monitoring of the accuracy of the protein crystal structure refinement.
Keywords: atomic displacement parameter/occupancy/protein hydration/protein structure/water structure
| Introduction |
|---|
|
|
|---|
The importance of the proteinwater interface can hardly be overemphasized since it mediates any physiological protein behaviour. Most of what we know about protein hydration results from crystallographic studies. Despite recent advances in the treatment and refinement of water posional parameters at the proteinsolvent interface (Schoenborn et al., 1995
| Materials and methods |
|---|
|
|
|---|
Twenty-six sperm whale myoglobin crystal structures determined at room temperature (room temperature was assumed if not otherwise explicitly specified), in the same space group (P21), with identical sequence (disregarding the first and last residues and without mutations, deletions and gaps in the electron density maps) and resolution
2.0 Å were found in the Protein Data Bank (Bernstein et al., 1977|
|
All equivalenced C
atoms were superposed on those of the structure 104m by the method of Kabsch and McLachan (Kabsch, 1978
; McLachan, 1979
) and the rotation and translation matrices were applied to the coordinates of the water molecules. The root-mean-square distances between equivalent C
atoms varied from 0.148 to 0.507 Å. A cluster of water molecules was then built around each water molecule by grouping water molecules no more distant than a given threshold distance of identity (t.d.i.) from the central water molecule. The appropriate symmetry operations were applied to each water molecule in each structure to detect equivalences when water molecules were positioned in distant parts of the asymmetric unit. Threshold distances of identity were varied from 0.25 to 3.25 Å in steps of 0.25 Å. Care was taken to avoid redundancies so that a cluster found several times (i.e. by considering different central water molecules with identical surrounding water molecules) was included only once in the analysis. The mean normalized a.d.p. was computed for each cluster and also the mean number of hydrogen bonds formed by its water molecules. The hydrogen bonds, computed with HBPLUS (McDonald and Thornton, 1994
), were classified into three types, those with other water molecules, those with main-chain atoms and those with side-chain atoms. The occupancy of each cluster was computed as the ratio between the number of its water molecules and the number of myoglobin structures. Hence a cluster containing 26 (or 13) water molecules has occupancy 1.0 (or 0.5).
| Results and discussion |
|---|
|
|
|---|
The relationship between normalized a.d.p. and occupancy of a cluster can be evaluated by a least-squares fit of the function
|
|
2.75 Å, the regression coefficients were statistically independent of the t.d.i., as shown in Figure 1|
|
|
|
The mean number of hydrogen bonds increases with the cluster occupancy nearly linearly from 1.5 (occupancy lower than 0.1) to 3.8 (occupancy higher than 0.9). The latter value nearly corresponds to the saturation of the hydrogen bond capability of a water molecule. In detail, the mean number of hydrogen bonds with other water molecules, with main-chain atoms and with side-chain atoms increases from 1.1, 0.2 or 0.2 for cluster occupancy lower than 0.1 to 2.6, 0.5 or 0.7 for cluster occupancy higher than 0.9, respectively. This agrees with the observation that the occupancy of a hydration site is strictly proportional to the total number of possible hydrogen bonds (Lounnas and Pettit, 1994a
|
Equation 3 was used to estimate the occupancies of water molecules in a set of 257 protein structures selected with PDBSELECT (Hobohm and Sander, 1994
|
It must be remembered that the correlation between occupancy and a.d.p. value should be applied within each protein monomer in the case of structures containing several monomers in the asymmetric unit. For example, in a dimeric protein it may happen that one monomer has an overall mean a.d.p. of 15 Å2 (in B units) and the other of 30 Å2 due to a different crystal packing. A well conserved water molecule located in both monomers is expected to have the same occupancy despite the fact that it will probably be associated with a higher a.d.p. value in the second monomer. The normalization of the a.d.p. values (Equation 1) within each monomer should give the water molecule the same normalized a.d.p. value in both monomers and thus should allow the computing of the same occupancy.
In conclusion, an empirical relationship between observed atomic displacement parameters and occupancies of water molecules has been derived by analysing known myoglobin crystal structures. The predicted occupancies agree with other structural features whose trends are clearly predictable. High occupancy hydration sites are more probable within the first protein hydration layer, cluster together and are strictly associated with the formation of many hydrogen bonds. It has also been shown that the predicted occupancies do not depend on the fold type, in agreement with the currently accepted model that protein hydration mostly depends on the local chemical features of the protein surface (Pettit et al., 1998
).
These results can have a large range of applications. They allow one to monitor the progress in the protein crystal structure refinement by avoiding possible overfitting due to the inclusion of too low an occupancy of water molecules in the model. Moreover, water molecules with too high an occupancy (much larger than 1.0) can be detected and should be considered as possible atoms other than water oxygens, for example metal cations or heavy anions. These results also should allow for more detailed physico-chemical characterizations of the proteinbulk water interface which plays a relevant role in most protein functions such as recognition of cofactors, substrates and other proteins and could be included in docking predictions (Sternberg et al., 1998
). A more precise evaluation of the proteinbulk water interface is also desirable when physical and mechanical properties of the protein are of interest, since the flexibility of a protein fragment is always directly connected to the flexibility of the interface.
| Notes |
|---|
1 Correspondence should be sent to the Pavia address
| References |
|---|
|
|
|---|
Bernstein,F.C., Koetzle,T.F., Williams,G.J.B., Meyer,E.F., Brice,M.D., Rodgers,J.R., Kennard,O., Shimanouchi,T. and Tasumi,M. (1977) J. Mol. Biol., 112, 535542.[Web of Science][Medline]
Bryant,R.G. (1996) Annu. Rev. Biophys. Biol. Struct., 25, 2953.[Web of Science][Medline]
Carugo,O. and Argos,P. (1998) Proteins: Struct. Funct. Genet., 31, 201213.[Web of Science][Medline]
Carugo,O. and Bordo,D. (1999) Acta Crystallogr., D55, 479483.
Dowdy,S. and Wearden,S. (1991) Statistics for Research. Wiley, New York, pp. 229281.
Ehrlich,L., Reczko,M., Bohr,H. and Wade R.C. (1998) Protein Engng, 11, 1119.
Ernst,J.A., Clubb,R.T., Zhou H.-X., Gronenborn,A.M. and Clore,G.M. (1996) Science, 267, 18131817.
Feher,V.A., Baldwin,E.P. and Dhalquist,W. (1996) Nature Struct. Biol., 3, 516521.[Web of Science][Medline]
Frauenfelder,H. and Petsko, GA. (1980) Biophys. J., 32, 465478.[Web of Science][Medline]
Frey,M. (1993) Top. Mol. Struct. Biol., 17, 100146.
Gerstein,M. and Chothia,C. (1996) Proc. Natl Acad. Sci. USA, 93, 1016710172.
Hobohm,U. and Sander,C. (1994) Protein Sci., 3, 522530.[Web of Science][Medline]
Kabsch,W.A. (1978) Acta Crystallogr., A34, 828828.
Karplus,P.A. and Faerman,C. (1994) Curr. Opin. Struct. Biol., 4, 770776.
Karplus,P.A. and Schulz,G.E. (1985) Naturwissenschaften, 72, 212213.[Web of Science]
Kraulis,P.J. (1991) J. Appl. Crystallogr., 24, 946950.
Lamzin,V.S. and Wilson,K.S. (1993) Acta Crystallogr., D49, 129147.
Lounnas,V. and Pettit,B.M. (1994a) Proteins: Struct. Funct. Genet., 18, 133147.[Web of Science][Medline]
Lounnas V. and Pettit,B.M. (1994b) Proteins: Struct. Funct. Genet., 18, 148160.[Web of Science][Medline]
Mak,T.C.W. and Zhou,G.-D. (1997). Crystallography in Modern Chemistry. Wiley, New York.
Makarov,V.A., Andrews,B.K. and Pettit,B.M. (1998) Biopolymers, 45, 469478.[Web of Science][Medline]
McDonald,I.K. and Thornton,J.M. (1994) J. Mol. Biol., 238, 777793.[Web of Science][Medline]
McLachan,A.D. (1979) J. Mol. Biol., 128, 4867.
Parak,F., Hartmann,H., Schmidt,M., Corongiu,G. and Clementi,E. (1992) Eur. Biophys. J., 21, 313320.[Web of Science][Medline]
Parthasarathy,S. and Murthy,M.R.N. (1997) Protein Sci., 6, 25612567.[Web of Science][Medline]
Pettit,B.M., Makarov,V.A. and Andrews,B.K. (1998) Curr. Opin. Struct. Biol., 8, 218221.[Web of Science][Medline]
Phillips,G.N. and Pettit,B.M. (1995) Protein Sci., 4, 149158.[Web of Science][Medline]
Podjarny,A.D., Howard,E.I., Urzhumetsev,A. and Grigera,J.R. (1997) Proteins: Struct. Funct. Genet., 28, 303312.[Web of Science][Medline]
Ringe,D. and Petsko,G.A. (1986) Methods Enzymol., 131, 389433.[Medline]
Schoenborn,B.P., Garcia,A. and Knott,R. (1995) Prog. Biophys. Mol. Biol., 64, 105119.[Web of Science][Medline]
Sternberg,M.J.E., Gabb,H.A. and Jackson,R.M. (1998) Curr. Opin. Struct. Biol., 8, 250256.[Web of Science][Medline]
Svergun,D.I., Richard,S., Koch,M.H.J., Sayers,Z., Kuprin,S. and Zaccai,G. (1998) Proc. Natl Acad. Sci. USA, 95, 22672272.
Thanki,N., Thornton,J.M. and Goodfellow,J.M. (1998) J. Mol. Biol., 202, 637657.
Tronrud,D.E. (1996) J. Appl. Crystallogr., 29, 100104.
Received May 28, 1999; revised September 23, 1999; accepted September 23, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. D. J. van Dijk and A. M. J. J. Bonvin Solvated docking: introducing water into the modelling of biomolecular complexes Bioinformatics, October 1, 2006; 22(19): 2340 - 2347. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ogata and S. J. Wodak Conserved water molecules in MHC class-I molecules and their putative structural and functional roles Protein Eng. Des. Sel., August 1, 2002; 15(8): 697 - 705. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




