Cargando…

If these data could talk

In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields...

Descripción completa

Detalles Bibliográficos
Autores principales: Pasquier, Thomas, Lau, Matthew K., Trisovic, Ana, Boose, Emery R., Couturier, Ben, Crosas, Mercè, Ellison, Aaron M., Gibson, Valerie, Jones, Chris R., Seltzer, Margo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584398/
https://www.ncbi.nlm.nih.gov/pubmed/28872630
http://dx.doi.org/10.1038/sdata.2017.114
_version_ 1783261469388832768
author Pasquier, Thomas
Lau, Matthew K.
Trisovic, Ana
Boose, Emery R.
Couturier, Ben
Crosas, Mercè
Ellison, Aaron M.
Gibson, Valerie
Jones, Chris R.
Seltzer, Margo
author_facet Pasquier, Thomas
Lau, Matthew K.
Trisovic, Ana
Boose, Emery R.
Couturier, Ben
Crosas, Mercè
Ellison, Aaron M.
Gibson, Valerie
Jones, Chris R.
Seltzer, Margo
author_sort Pasquier, Thomas
collection PubMed
description In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields exhibit distressingly low rates of reproducibility. Although there are many dimensions to this issue, we believe that there is a lack of formalism used when describing end-to-end published results, from the data source to the analysis to the final published results. Even when authors do their best to make their research and data accessible, this lack of formalism reduces the clarity and efficiency of reporting, which contributes to issues of reproducibility. Data provenance aids both reproducibility through systematic and formal records of the relationships among data sources, processes, datasets, publications and researchers.
format Online
Article
Text
id pubmed-5584398
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-55843982017-09-12 If these data could talk Pasquier, Thomas Lau, Matthew K. Trisovic, Ana Boose, Emery R. Couturier, Ben Crosas, Mercè Ellison, Aaron M. Gibson, Valerie Jones, Chris R. Seltzer, Margo Sci Data Comment In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields exhibit distressingly low rates of reproducibility. Although there are many dimensions to this issue, we believe that there is a lack of formalism used when describing end-to-end published results, from the data source to the analysis to the final published results. Even when authors do their best to make their research and data accessible, this lack of formalism reduces the clarity and efficiency of reporting, which contributes to issues of reproducibility. Data provenance aids both reproducibility through systematic and formal records of the relationships among data sources, processes, datasets, publications and researchers. Nature Publishing Group 2017-09-05 /pmc/articles/PMC5584398/ /pubmed/28872630 http://dx.doi.org/10.1038/sdata.2017.114 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Comment
Pasquier, Thomas
Lau, Matthew K.
Trisovic, Ana
Boose, Emery R.
Couturier, Ben
Crosas, Mercè
Ellison, Aaron M.
Gibson, Valerie
Jones, Chris R.
Seltzer, Margo
If these data could talk
title If these data could talk
title_full If these data could talk
title_fullStr If these data could talk
title_full_unstemmed If these data could talk
title_short If these data could talk
title_sort if these data could talk
topic Comment
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584398/
https://www.ncbi.nlm.nih.gov/pubmed/28872630
http://dx.doi.org/10.1038/sdata.2017.114
work_keys_str_mv AT pasquierthomas ifthesedatacouldtalk
AT laumatthewk ifthesedatacouldtalk
AT trisovicana ifthesedatacouldtalk
AT booseemeryr ifthesedatacouldtalk
AT couturierben ifthesedatacouldtalk
AT crosasmerce ifthesedatacouldtalk
AT ellisonaaronm ifthesedatacouldtalk
AT gibsonvalerie ifthesedatacouldtalk
AT joneschrisr ifthesedatacouldtalk
AT seltzermargo ifthesedatacouldtalk