Cargando…
If these data could talk
In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584398/ https://www.ncbi.nlm.nih.gov/pubmed/28872630 http://dx.doi.org/10.1038/sdata.2017.114 |
_version_ | 1783261469388832768 |
---|---|
author | Pasquier, Thomas Lau, Matthew K. Trisovic, Ana Boose, Emery R. Couturier, Ben Crosas, Mercè Ellison, Aaron M. Gibson, Valerie Jones, Chris R. Seltzer, Margo |
author_facet | Pasquier, Thomas Lau, Matthew K. Trisovic, Ana Boose, Emery R. Couturier, Ben Crosas, Mercè Ellison, Aaron M. Gibson, Valerie Jones, Chris R. Seltzer, Margo |
author_sort | Pasquier, Thomas |
collection | PubMed |
description | In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields exhibit distressingly low rates of reproducibility. Although there are many dimensions to this issue, we believe that there is a lack of formalism used when describing end-to-end published results, from the data source to the analysis to the final published results. Even when authors do their best to make their research and data accessible, this lack of formalism reduces the clarity and efficiency of reporting, which contributes to issues of reproducibility. Data provenance aids both reproducibility through systematic and formal records of the relationships among data sources, processes, datasets, publications and researchers. |
format | Online Article Text |
id | pubmed-5584398 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-55843982017-09-12 If these data could talk Pasquier, Thomas Lau, Matthew K. Trisovic, Ana Boose, Emery R. Couturier, Ben Crosas, Mercè Ellison, Aaron M. Gibson, Valerie Jones, Chris R. Seltzer, Margo Sci Data Comment In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields exhibit distressingly low rates of reproducibility. Although there are many dimensions to this issue, we believe that there is a lack of formalism used when describing end-to-end published results, from the data source to the analysis to the final published results. Even when authors do their best to make their research and data accessible, this lack of formalism reduces the clarity and efficiency of reporting, which contributes to issues of reproducibility. Data provenance aids both reproducibility through systematic and formal records of the relationships among data sources, processes, datasets, publications and researchers. Nature Publishing Group 2017-09-05 /pmc/articles/PMC5584398/ /pubmed/28872630 http://dx.doi.org/10.1038/sdata.2017.114 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Comment Pasquier, Thomas Lau, Matthew K. Trisovic, Ana Boose, Emery R. Couturier, Ben Crosas, Mercè Ellison, Aaron M. Gibson, Valerie Jones, Chris R. Seltzer, Margo If these data could talk |
title | If these data could talk |
title_full | If these data could talk |
title_fullStr | If these data could talk |
title_full_unstemmed | If these data could talk |
title_short | If these data could talk |
title_sort | if these data could talk |
topic | Comment |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584398/ https://www.ncbi.nlm.nih.gov/pubmed/28872630 http://dx.doi.org/10.1038/sdata.2017.114 |
work_keys_str_mv | AT pasquierthomas ifthesedatacouldtalk AT laumatthewk ifthesedatacouldtalk AT trisovicana ifthesedatacouldtalk AT booseemeryr ifthesedatacouldtalk AT couturierben ifthesedatacouldtalk AT crosasmerce ifthesedatacouldtalk AT ellisonaaronm ifthesedatacouldtalk AT gibsonvalerie ifthesedatacouldtalk AT joneschrisr ifthesedatacouldtalk AT seltzermargo ifthesedatacouldtalk |