Cargando…

Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing

BACKGROUND: A lack of transparency and reporting standards in the scientific community has led to increasing and widespread concerns relating to reproduction and integrity of results. As an omics science, which generates vast amounts of data and relies heavily on data science for deriving biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Mendez, Kevin M., Pritchard, Leighton, Reinke, Stacey N., Broadhurst, David I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6745024/
https://www.ncbi.nlm.nih.gov/pubmed/31522294
http://dx.doi.org/10.1007/s11306-019-1588-0
_version_ 1783451468306579456
author Mendez, Kevin M.
Pritchard, Leighton
Reinke, Stacey N.
Broadhurst, David I.
author_facet Mendez, Kevin M.
Pritchard, Leighton
Reinke, Stacey N.
Broadhurst, David I.
author_sort Mendez, Kevin M.
collection PubMed
description BACKGROUND: A lack of transparency and reporting standards in the scientific community has led to increasing and widespread concerns relating to reproduction and integrity of results. As an omics science, which generates vast amounts of data and relies heavily on data science for deriving biological meaning, metabolomics is highly vulnerable to irreproducibility. The metabolomics community has made substantial efforts to align with FAIR data standards by promoting open data formats, data repositories, online spectral libraries, and metabolite databases. Open data analysis platforms also exist; however, they tend to be inflexible and rely on the user to adequately report their methods and results. To enable FAIR data science in metabolomics, methods and results need to be transparently disseminated in a manner that is rapid, reusable, and fully integrated with the published work. To ensure broad use within the community such a framework also needs to be inclusive and intuitive for both computational novices and experts alike. AIM OF REVIEW: To encourage metabolomics researchers from all backgrounds to take control of their own data science, mould it to their personal requirements, and enthusiastically share resources through open science. KEY SCIENTIFIC CONCEPTS OF REVIEW: This tutorial introduces the concept of interactive web-based computational laboratory notebooks. The reader is guided through a set of experiential tutorials specifically targeted at metabolomics researchers, based around the Jupyter Notebook web application, GitHub data repository, and Binder cloud computing platform. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s11306-019-1588-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6745024
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-67450242019-09-27 Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing Mendez, Kevin M. Pritchard, Leighton Reinke, Stacey N. Broadhurst, David I. Metabolomics Review Article BACKGROUND: A lack of transparency and reporting standards in the scientific community has led to increasing and widespread concerns relating to reproduction and integrity of results. As an omics science, which generates vast amounts of data and relies heavily on data science for deriving biological meaning, metabolomics is highly vulnerable to irreproducibility. The metabolomics community has made substantial efforts to align with FAIR data standards by promoting open data formats, data repositories, online spectral libraries, and metabolite databases. Open data analysis platforms also exist; however, they tend to be inflexible and rely on the user to adequately report their methods and results. To enable FAIR data science in metabolomics, methods and results need to be transparently disseminated in a manner that is rapid, reusable, and fully integrated with the published work. To ensure broad use within the community such a framework also needs to be inclusive and intuitive for both computational novices and experts alike. AIM OF REVIEW: To encourage metabolomics researchers from all backgrounds to take control of their own data science, mould it to their personal requirements, and enthusiastically share resources through open science. KEY SCIENTIFIC CONCEPTS OF REVIEW: This tutorial introduces the concept of interactive web-based computational laboratory notebooks. The reader is guided through a set of experiential tutorials specifically targeted at metabolomics researchers, based around the Jupyter Notebook web application, GitHub data repository, and Binder cloud computing platform. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s11306-019-1588-0) contains supplementary material, which is available to authorized users. Springer US 2019-09-14 2019 /pmc/articles/PMC6745024/ /pubmed/31522294 http://dx.doi.org/10.1007/s11306-019-1588-0 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Review Article
Mendez, Kevin M.
Pritchard, Leighton
Reinke, Stacey N.
Broadhurst, David I.
Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title_full Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title_fullStr Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title_full_unstemmed Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title_short Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing
title_sort toward collaborative open data science in metabolomics using jupyter notebooks and cloud computing
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6745024/
https://www.ncbi.nlm.nih.gov/pubmed/31522294
http://dx.doi.org/10.1007/s11306-019-1588-0
work_keys_str_mv AT mendezkevinm towardcollaborativeopendatascienceinmetabolomicsusingjupyternotebooksandcloudcomputing
AT pritchardleighton towardcollaborativeopendatascienceinmetabolomicsusingjupyternotebooksandcloudcomputing
AT reinkestaceyn towardcollaborativeopendatascienceinmetabolomicsusingjupyternotebooksandcloudcomputing
AT broadhurstdavidi towardcollaborativeopendatascienceinmetabolomicsusingjupyternotebooksandcloudcomputing