Protein Engineering vol. 16 no. 6 pp. 397-406, 2003
© 2003 Oxford University Press
Using multiple sequence correlation analysis to characterize functionally important protein regions
Department of Chemical Engineering, The Pennsylvania State University, 112 Fenske Laboratory, University Park, PA 16802, USA
1 To whom correspondence should be addressed. e-mail: costas{at}psu.edu
Protein co-evolution under structural and functional constraints necessitates the preservation of important interactions. Identifying functionally important regions poses many obstacles in protein engineering efforts. In this paper, we present a bioinformatics-inspired approach (residue correlation analysis, RCA) for predicting functionally important domains from protein family sequence data. RCA is comprised of two major steps: (i) identifying pairs of residue positions that mutate in a coordinated manner, and (ii) using these results to identify protein regions that interact with an uncommonly high number of other residues. We hypothesize that strongly correlated pairs result not only from contacting pairs, but also from residues that participate in conformational changes involved during catalysis or important interactions necessary for retaining functionality. The results show that highly mobile loops that assist in ligand association/dissociation tend to exhibit high correlation. RCA results exhibit good agreement with the findings of experimental and molecular dynamics studies for the three protein families that are analyzed: (i) DHFR (dihydrofolate reductase), (ii) cyclophilin, and (iii) formyl-transferase. Specifically, the specificity (percentage of correct predictions) in all three cases is substantially higher than those obtained by entropic measures or contacting residue pairs. In addition, we use our approach in a predictive fashion to identify important regions of a transmembrane amino acid transporter protein for which there is limited structural and functional information available.
Received December 1, 2002; revised February 7, 2003; accepted April 15, 2003.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Fuchs, A. J. Martin-Galiano, M. Kalman, S. Fleishman, N. Ben-Tal, and D. Frishman Co-evolving residues in membrane proteins Bioinformatics, December 15, 2007; 23(24): 3312 - 3319. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Saraf, A. R. Horswill, S. J. Benkovic, and C. D. Maranas From the Cover: FamClash: A method for ranking the activity of engineered enzymes PNAS, March 23, 2004; 101(12): 4142 - 4147. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Shen and M. Vihinen Conservation and covariance in PH domain sequences: physicochemical profile and information theoretical analysis of XLA-causing mutations in the Btk PH domain Protein Eng. Des. Sel., March 1, 2004; 17(3): 267 - 276. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Saraf and C. D. Maranas Using a residue clash map to functionally characterize protein recombination hybrids Protein Eng. Des. Sel., December 1, 2003; 16(12): 1025 - 1034. [Abstract] [Full Text] [PDF] |
||||


