Cargando…
More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
As data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for futu...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2677268 |
_version_ | 1780962760804270080 |
---|---|
author | Feger, Sebastian Stefan Woźniak, Paweł W |
author_facet | Feger, Sebastian Stefan Woźniak, Paweł W |
author_sort | Feger, Sebastian Stefan |
collection | CERN |
description | As data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for future data science work. We report on our studies of practices around preservation that show how secondary uses of preservation technology and non-conventional design tools incentivize best practices and reshape analysts' perceptions of research tools. We expect that our work will impact sustainability and reusability in data science well beyond experimental physics. |
id | cern-2677268 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | cern-26772682019-09-30T06:29:59Zhttp://cds.cern.ch/record/2677268engFeger, Sebastian StefanWoźniak, Paweł WMore Than Preservation: A Researcher-Centered Approach to Reproducibility in Data ScienceComputing and ComputersAs data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for future data science work. We report on our studies of practices around preservation that show how secondary uses of preservation technology and non-conventional design tools incentivize best practices and reshape analysts' perceptions of research tools. We expect that our work will impact sustainability and reusability in data science well beyond experimental physics.CERN-OPEN-2019-003oai:cds.cern.ch:26772682019-01-12 |
spellingShingle | Computing and Computers Feger, Sebastian Stefan Woźniak, Paweł W More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title | More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title_full | More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title_fullStr | More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title_full_unstemmed | More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title_short | More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science |
title_sort | more than preservation: a researcher-centered approach to reproducibility in data science |
topic | Computing and Computers |
url | http://cds.cern.ch/record/2677268 |
work_keys_str_mv | AT fegersebastianstefan morethanpreservationaresearchercenteredapproachtoreproducibilityindatascience AT wozniakpawełw morethanpreservationaresearchercenteredapproachtoreproducibilityindatascience |