Cargando…

More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science

As data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for futu...

Descripción completa

Detalles Bibliográficos
Autores principales: Feger, Sebastian Stefan, Woźniak, Paweł W
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:http://cds.cern.ch/record/2677268
_version_ 1780962760804270080
author Feger, Sebastian Stefan
Woźniak, Paweł W
author_facet Feger, Sebastian Stefan
Woźniak, Paweł W
author_sort Feger, Sebastian Stefan
collection CERN
description As data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for future data science work. We report on our studies of practices around preservation that show how secondary uses of preservation technology and non-conventional design tools incentivize best practices and reshape analysts' perceptions of research tools. We expect that our work will impact sustainability and reusability in data science well beyond experimental physics.
id cern-2677268
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling cern-26772682019-09-30T06:29:59Zhttp://cds.cern.ch/record/2677268engFeger, Sebastian StefanWoźniak, Paweł WMore Than Preservation: A Researcher-Centered Approach to Reproducibility in Data ScienceComputing and ComputersAs data volumes and complexity in data science work increase, reusability of experimental studies becomes increasingly important. Our research in data-intensive High Energy Physics shows that supporting and motivating data workers in preserving and sharing their resources is a key challenge for future data science work. We report on our studies of practices around preservation that show how secondary uses of preservation technology and non-conventional design tools incentivize best practices and reshape analysts' perceptions of research tools. We expect that our work will impact sustainability and reusability in data science well beyond experimental physics.CERN-OPEN-2019-003oai:cds.cern.ch:26772682019-01-12
spellingShingle Computing and Computers
Feger, Sebastian Stefan
Woźniak, Paweł W
More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title_full More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title_fullStr More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title_full_unstemmed More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title_short More Than Preservation: A Researcher-Centered Approach to Reproducibility in Data Science
title_sort more than preservation: a researcher-centered approach to reproducibility in data science
topic Computing and Computers
url http://cds.cern.ch/record/2677268
work_keys_str_mv AT fegersebastianstefan morethanpreservationaresearchercenteredapproachtoreproducibilityindatascience
AT wozniakpawełw morethanpreservationaresearchercenteredapproachtoreproducibilityindatascience