Protein Engineering, Vol. 12, No. 11, 919-925,
November 1999
© 1999 Oxford University Press
Class-defining characteristics in the mouse heavy chains of variable domains
Department of Mathematics, Rutgers University, Piscataway, NJ 08854, USA
Analysis of residue correlation in over 2700 mouse heavy chains of the VH domains was carried out on three hierarchical levels. At the position level, statistical analysis revealed 45 positions that conserve similar residues in almost all chains. At the fragment level, the focus of investigation shifted to the study of combinations of amino acids in strands and loops. It was found that no more than 10 patterns were sufficient for describing strands and loops in the chains. At the sequence level, we determined all possible combinations of these patterns and classified the mouse heavy chains. Comparison of the sequences in the eight classes revealed residues at the class-determining positions that were unique to each class. Because a strong correlation of residues was found, one only needs several residues to classify a sequence. It follows that no all residue alignment procedure is necessary to divide sequences into classes. An important corollary of our approach is the possibility of predicting residues in an incomplete sequence from a small sequence fragment. On the basis of our analysis of mouse heavy chains we hypothesize about the presently unknown mouse VH germline repertoire.
Keywords: immunoglobulin variable domain/protein sequence classification/residue classification/residue prediction in sequences
1 To whom correspondence should be addressed; email: akister{at}math.rutgers.edu
Dedicated to the memory of Oleg Ptitsyn
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
B. Reva, A. Kister, S. Topiol, and I. Gelfand Determining the roles of different chain fragments in recognition of immunoglobulin fold Protein Eng. Des. Sel., January 1, 2002; 15(1): 13 - 19. [Abstract] [Full Text] [PDF] |
||||
