Protein Engineering, Vol. 15, No. 3, 169-183,
March 2002
© 2002 Oxford University Press
Knowledge-based selection of targets for structural genomics
Institute for Bioinformatics, GSFNational Research Center for Environment and Health, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
E-mail: d.frishman{at}gsf.de
The problem of rational target selection for protein structure determination in structural genomics projects on microbes is addressed. A flexible computational procedure is described that directly incorporates the whole body of annotation available in the PEDANT genome database into the sequence clustering and selection process in order to identify proteins that are likely to possess currently unknown structural domains. Filtering out gene products based on predicted structural features, such as known three-dimensional structures and transmembrane regions, allows one to reduce the complexity of neighbor relationships between sequences and all but eliminates the need for further partitioning of single-linkage clusters into disjoint protein groups corresponding to homologous families. The results of a large-scale computation experiment in which exemplary target selection for 32 prokaryotic genomes was conducted are presented.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Hou, G. E. Sims, C. Zhang, and S.-H. Kim A global representation of the protein fold space PNAS, March 4, 2003; 100(5): 2386 - 2390. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Frishman, M. Mokrejs, D. Kosykh, G. Kastenmuller, G. Kolesov, I. Zubrzycki, C. Gruber, B. Geier, A. Kaps, K. Albermann, et al. The PEDANT genome database Nucleic Acids Res., January 1, 2003; 31(1): 207 - 211. [Abstract] [Full Text] [PDF] |
||||

