Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by King, R. D.
Right arrow Articles by Page, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by King, R. D.
Right arrow Articles by Page, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Protein Engineering, Vol. 13, No. 1, 15-19, January 2000
© 2000 Oxford University Press

Is it better to combine predictions?

Ross D. King1, Mohammed Ouali, Arbra T. Strong2, Alaaeldin Aly2, Adel Elmaghraby2, Mehmed Kantardzic2 and David Page3

Department of Computer Science, University of Wales, Aberystwyth Penglais, Aberystwyth, Ceredigion, SY23 3DB, Wales, UK, 2 Department of Engineering Mathematics and Computer Science, Speed Scientific School, University of Louisville, Louisville, KY 40292, USA and 3 Department of Biostatistics and Medical Informatics and Department of Computer Sciences, University of Wisconsin, 1300 University Avenue, Room 5795 Medical Sciences, Madison, WI 53706, USA.Email: page{at}biostac.wisc.edu

We have compared the accuracy of the individual protein secondary structure prediction methods: PHD, DSC, NNSSP and Predator against the accuracy obtained by combing the predictions of the methods. A range of ways of combing predictions were tested: voting, biased voting, linear discrimination, neural networks and decision trees. The combined methods that involve `learning' (the non-voting methods) were trained using a set of 496 non-homologous domains; this dataset was biased as some of the secondary structure prediction methods had used them for training. We used two independent test sets to compare predictions: the first consisted of 17 non-homologous domains from CASP3 (Third Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction); the second set consisted of 405 domains that were selected in the same way as the training set, and were non-homologous to each other and the training set. On both test datasets the most accurate individual method was NNSSP, then PHD, DSC and the least accurate was Predator; however, it was not possible to conclusively show a significant difference between the individual methods. Comparing the accuracy of the single methods with that obtained by combing predictions it was found that it was better to use a combination of predictions. On both test datasets it was possible to obtain a ~3% improvement in accuracy by combing predictions. In most cases the combined methods were statistically significantly better (at P = 0.05 on the CASP3 test set, and P = 0.01 on the EBI test set). On the CASP3 test dataset there was no significant difference in accuracy between any of the combined method of prediction: on the EBI test dataset, linear discrimination and neural networks significantly outperformed voting techniques. We conclude that it is better to combine predictions.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Protein Eng Des SelHome page
M. Albrecht, S. C.E. Tosatto, T. Lengauer, and G. Valle
Simple consensus procedures are effective and sufficient in secondary structure prediction
Protein Eng. Des. Sel., July 1, 2003; 16(7): 459 - 462.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
C. L. Wilson, S. J. Hubbard, and A. J. Doig
A critical assessment of the secondary structure {alpha}-helices and their termini in proteins
Protein Eng. Des. Sel., July 1, 2002; 15(7): 545 - 554.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. B. Kell, R. M. Darby, and J. Draper
Genomic Computing. Explanatory Analysis of Plant Expression Profiling Data Using Machine Learning
Plant Physiology, July 1, 2001; 126(3): 943 - 951.
[Full Text] [PDF]


Home page
Protein Eng Des SelHome page
A. C.R. Martin
The ups and downs of protein topology; rapid comparison of protein structure
Protein Eng. Des. Sel., December 1, 2000; 13(12): 829 - 837.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. Gijsbers, H. Ceulemans, W. Stalmans, and M. Bollen
Structural and Catalytic Similarities between Nucleotide Pyrophosphatases/Phosphodiesterases and Alkaline Phosphatases
J. Biol. Chem., January 5, 2001; 276(2): 1361 - 1368.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.