Cargando…
Towards Reproducible Research Data Analyses in LHC Particle Physics
The reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of too...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2263664 |
_version_ | 1780954255651241984 |
---|---|
author | Simko, Tibor |
author_facet | Simko, Tibor |
author_sort | Simko, Tibor |
collection | CERN |
description | The reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of tools developed to support particle physics researchers in preserving the knowledge around analyses so that capturing, sharing, reusing and reinterpreting data becomes easier. The presentation will focus on three pillars: (i) capturing structured knowledge information about data analysis processes; (ii) capturing the computing environment, the software code, the datasets, the configuration and other information assets used in data analyses; (iii) re-instantiating of preserved analyses on a containerised computing cloud for the purposes of re-validation and re-interpretation. |
id | cern-2263664 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | cern-22636642019-09-30T06:29:59Zhttp://cds.cern.ch/record/2263664engSimko, TiborTowards Reproducible Research Data Analyses in LHC Particle PhysicsILIDE2017TalkThe reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of tools developed to support particle physics researchers in preserving the knowledge around analyses so that capturing, sharing, reusing and reinterpreting data becomes easier. The presentation will focus on three pillars: (i) capturing structured knowledge information about data analysis processes; (ii) capturing the computing environment, the software code, the datasets, the configuration and other information assets used in data analyses; (iii) re-instantiating of preserved analyses on a containerised computing cloud for the purposes of re-validation and re-interpretation.IT-TALK-2017-004oai:cds.cern.ch:22636642017 |
spellingShingle | Talk Simko, Tibor Towards Reproducible Research Data Analyses in LHC Particle Physics |
title | Towards Reproducible Research Data Analyses in LHC Particle Physics |
title_full | Towards Reproducible Research Data Analyses in LHC Particle Physics |
title_fullStr | Towards Reproducible Research Data Analyses in LHC Particle Physics |
title_full_unstemmed | Towards Reproducible Research Data Analyses in LHC Particle Physics |
title_short | Towards Reproducible Research Data Analyses in LHC Particle Physics |
title_sort | towards reproducible research data analyses in lhc particle physics |
topic | Talk |
url | http://cds.cern.ch/record/2263664 |
work_keys_str_mv | AT simkotibor towardsreproducibleresearchdataanalysesinlhcparticlephysics AT simkotibor ilide2017 |