Cargando…

Bookshelf: a simple curation system for the storage of biomolecular simulation data

Molecular dynamics simulations can now routinely generate data sets of several hundreds of gigabytes in size. The ability to generate this data has become easier over recent years and the rate of data production is likely to increase rapidly in the near future. One major problem associated with this...

Descripción completa

Detalles Bibliográficos
Autores principales: Vohra, Shabana, Hall, Benjamin A., Holdbrook, Daniel A., Khalid, Syma, Biggin, Philip C.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004466/
https://www.ncbi.nlm.nih.gov/pubmed/21169341
http://dx.doi.org/10.1093/database/baq033
_version_ 1782193982139269120
author Vohra, Shabana
Hall, Benjamin A.
Holdbrook, Daniel A.
Khalid, Syma
Biggin, Philip C.
author_facet Vohra, Shabana
Hall, Benjamin A.
Holdbrook, Daniel A.
Khalid, Syma
Biggin, Philip C.
author_sort Vohra, Shabana
collection PubMed
description Molecular dynamics simulations can now routinely generate data sets of several hundreds of gigabytes in size. The ability to generate this data has become easier over recent years and the rate of data production is likely to increase rapidly in the near future. One major problem associated with this vast amount of data is how to store it in a way that it can be easily retrieved at a later date. The obvious answer to this problem is a database. However, a key issue in the development and maintenance of such a database is its sustainability, which in turn depends on the ease of the deposition and retrieval process. Encouraging users to care about meta-data is difficult and thus the success of any storage system will ultimately depend on how well used by end-users the system is. In this respect we suggest that even a minimal amount of metadata if stored in a sensible fashion is useful, if only at the level of individual research groups. We discuss here, a simple database system which we call ‘Bookshelf’, that uses python in conjunction with a mysql database to provide an extremely simple system for curating and keeping track of molecular simulation data. It provides a user-friendly, scriptable solution to the common problem amongst biomolecular simulation laboratories; the storage, logging and subsequent retrieval of large numbers of simulations. Download URL: http://sbcb.bioch.ox.ac.uk/bookshelf/
format Text
id pubmed-3004466
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30044662010-12-20 Bookshelf: a simple curation system for the storage of biomolecular simulation data Vohra, Shabana Hall, Benjamin A. Holdbrook, Daniel A. Khalid, Syma Biggin, Philip C. Database (Oxford) Database Tool Molecular dynamics simulations can now routinely generate data sets of several hundreds of gigabytes in size. The ability to generate this data has become easier over recent years and the rate of data production is likely to increase rapidly in the near future. One major problem associated with this vast amount of data is how to store it in a way that it can be easily retrieved at a later date. The obvious answer to this problem is a database. However, a key issue in the development and maintenance of such a database is its sustainability, which in turn depends on the ease of the deposition and retrieval process. Encouraging users to care about meta-data is difficult and thus the success of any storage system will ultimately depend on how well used by end-users the system is. In this respect we suggest that even a minimal amount of metadata if stored in a sensible fashion is useful, if only at the level of individual research groups. We discuss here, a simple database system which we call ‘Bookshelf’, that uses python in conjunction with a mysql database to provide an extremely simple system for curating and keeping track of molecular simulation data. It provides a user-friendly, scriptable solution to the common problem amongst biomolecular simulation laboratories; the storage, logging and subsequent retrieval of large numbers of simulations. Download URL: http://sbcb.bioch.ox.ac.uk/bookshelf/ Oxford University Press 2010-12-17 /pmc/articles/PMC3004466/ /pubmed/21169341 http://dx.doi.org/10.1093/database/baq033 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Vohra, Shabana
Hall, Benjamin A.
Holdbrook, Daniel A.
Khalid, Syma
Biggin, Philip C.
Bookshelf: a simple curation system for the storage of biomolecular simulation data
title Bookshelf: a simple curation system for the storage of biomolecular simulation data
title_full Bookshelf: a simple curation system for the storage of biomolecular simulation data
title_fullStr Bookshelf: a simple curation system for the storage of biomolecular simulation data
title_full_unstemmed Bookshelf: a simple curation system for the storage of biomolecular simulation data
title_short Bookshelf: a simple curation system for the storage of biomolecular simulation data
title_sort bookshelf: a simple curation system for the storage of biomolecular simulation data
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004466/
https://www.ncbi.nlm.nih.gov/pubmed/21169341
http://dx.doi.org/10.1093/database/baq033
work_keys_str_mv AT vohrashabana bookshelfasimplecurationsystemforthestorageofbiomolecularsimulationdata
AT hallbenjamina bookshelfasimplecurationsystemforthestorageofbiomolecularsimulationdata
AT holdbrookdaniela bookshelfasimplecurationsystemforthestorageofbiomolecularsimulationdata
AT khalidsyma bookshelfasimplecurationsystemforthestorageofbiomolecularsimulationdata
AT bigginphilipc bookshelfasimplecurationsystemforthestorageofbiomolecularsimulationdata