Skip Navigation


PEDS Advance Access originally published online on April 14, 2008
Protein Engineering Design and Selection 2008 21(6):369-377; doi:10.1093/protein/gzn012
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrowOA All Versions of this Article:
21/6/369    most recent
gzn012v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Simms, A. M.
Right arrow Articles by Daggett, V.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Simms, A. M.
Right arrow Articles by Daggett, V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Dynameomics: design of a computational lab workflow and scientific data repository for protein simulations

Andrew M. Simms1, Rudesh D. Toofanny2, Catherine Kehl1, Noah C. Benson1 and Valerie Daggett1,2,3

1Biomedical and Health Informatics Program, University of Washington, Seattle, WA 98195, USA 2Department of Bioengineering, University of Washington, Box 355013, Seattle, WA 98195-5013, USA

3 To whom correspondence should be addressed. E-mail: daggett{at}u.washington.edu

Dynameomics is a project to investigate and catalog the native-state dynamics and thermal unfolding pathways of representatives of all protein folds using solvated molecular dynamics simulations, as described in the preceding paper. Here we introduce the design of the molecular dynamics data warehouse, a scalable, reliable repository that houses simulation data that vastly simplifies management and access. In the succeeding paper, we describe the development of a complementary multidimensional database. A single protein unfolding or native-state simulation can take weeks to months to complete, and produces gigabytes of coordinate and analysis data. Mining information from over 3000 completed simulations is complicated and time-consuming. Even the simplest queries involve writing intricate programs that must be built from low-level file system access primitives and include significant logic to correctly locate and parse data of interest. As a result, programs to answer questions that require data from hundreds of simulations are very difficult to write. Thus, organization and access to simulation data have been major obstacles to the discovery of new knowledge in the Dynameomics project. This repository is used internally and is the foundation of the Dynameomics portal site http://www.dynameomics.org. By organizing simulation data into a scalable, manageable and accessible form, we can begin to address substantial questions that move us closer to solving biomedical and bioengineering problems.

Keywords: data warehouse/database/Dynameomics/OLAP/protein dynamics

Received March 3, 2008; revised March 3, 2008; accepted March 4, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
D. A. C. Beck, D. O. V. Alonso, D. Inoyama, and V. Daggett
The intrinsic conformational propensities of the 20 naturally occurring amino acids and reflection of these propensities in proteins
PNAS, August 26, 2008; 105(34): 12259 - 12264.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.