Cargando…

MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations

The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or e...

Descripción completa

Detalles Bibliográficos
Autores principales: Tiemann, Johanna K. S., Szczuka, Magdalena, Bouarroudj, Lisa, Oussaren, Mohamed, Garcia, Steven, Howard, Rebecca J., Delemotte, Lucie, Lindahl, Erik, Baaden, Marc, Lindorff-Larsen, Kresten, Chavent, Matthieu, Poulain, Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10187166/
https://www.ncbi.nlm.nih.gov/pubmed/37205542
http://dx.doi.org/10.1101/2023.05.02.538537
_version_ 1785042696778285056
author Tiemann, Johanna K. S.
Szczuka, Magdalena
Bouarroudj, Lisa
Oussaren, Mohamed
Garcia, Steven
Howard, Rebecca J.
Delemotte, Lucie
Lindahl, Erik
Baaden, Marc
Lindorff-Larsen, Kresten
Chavent, Matthieu
Poulain, Pierre
author_facet Tiemann, Johanna K. S.
Szczuka, Magdalena
Bouarroudj, Lisa
Oussaren, Mohamed
Garcia, Steven
Howard, Rebecca J.
Delemotte, Lucie
Lindahl, Erik
Baaden, Marc
Lindorff-Larsen, Kresten
Chavent, Matthieu
Poulain, Pierre
author_sort Tiemann, Johanna K. S.
collection PubMed
description The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or easily searchable. Leveraging an original search strategy, we found and indexed about 250,000 files and 2,000 datasets from Zenodo, Figshare and Open Science Framework. With a focus on files produced by the Gromacs MD software, we illustrate the potential offered by the mining of publicly available MD data. We identified systems with specific molecular composition and were able to characterize essential parameters of MD simulation, such as temperature and simulation length, and identify model resolution, such as all-atom and coarse-grain. Based on this analysis, we inferred metadata to propose a search engine prototype to explore collected MD data. To continue in this direction, we call on the community to pursue the effort of sharing MD data, and increase populating and standardizing metadata to reuse this valuable matter.
format Online
Article
Text
id pubmed-10187166
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101871662023-05-17 MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations Tiemann, Johanna K. S. Szczuka, Magdalena Bouarroudj, Lisa Oussaren, Mohamed Garcia, Steven Howard, Rebecca J. Delemotte, Lucie Lindahl, Erik Baaden, Marc Lindorff-Larsen, Kresten Chavent, Matthieu Poulain, Pierre bioRxiv Article The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or easily searchable. Leveraging an original search strategy, we found and indexed about 250,000 files and 2,000 datasets from Zenodo, Figshare and Open Science Framework. With a focus on files produced by the Gromacs MD software, we illustrate the potential offered by the mining of publicly available MD data. We identified systems with specific molecular composition and were able to characterize essential parameters of MD simulation, such as temperature and simulation length, and identify model resolution, such as all-atom and coarse-grain. Based on this analysis, we inferred metadata to propose a search engine prototype to explore collected MD data. To continue in this direction, we call on the community to pursue the effort of sharing MD data, and increase populating and standardizing metadata to reuse this valuable matter. Cold Spring Harbor Laboratory 2023-05-02 /pmc/articles/PMC10187166/ /pubmed/37205542 http://dx.doi.org/10.1101/2023.05.02.538537 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Tiemann, Johanna K. S.
Szczuka, Magdalena
Bouarroudj, Lisa
Oussaren, Mohamed
Garcia, Steven
Howard, Rebecca J.
Delemotte, Lucie
Lindahl, Erik
Baaden, Marc
Lindorff-Larsen, Kresten
Chavent, Matthieu
Poulain, Pierre
MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title_full MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title_fullStr MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title_full_unstemmed MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title_short MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations
title_sort mdverse: shedding light on the dark matter of molecular dynamics simulations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10187166/
https://www.ncbi.nlm.nih.gov/pubmed/37205542
http://dx.doi.org/10.1101/2023.05.02.538537
work_keys_str_mv AT tiemannjohannaks mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT szczukamagdalena mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT bouarroudjlisa mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT oussarenmohamed mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT garciasteven mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT howardrebeccaj mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT delemottelucie mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT lindahlerik mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT baadenmarc mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT lindorfflarsenkresten mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT chaventmatthieu mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations
AT poulainpierre mdversesheddinglightonthedarkmatterofmoleculardynamicssimulations