Cargando…

Towards Reproducible Research Data Analyses in LHC Particle Physics

The reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of too...

Descripción completa

Detalles Bibliográficos
Autor principal: Simko, Tibor
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:http://cds.cern.ch/record/2263664
_version_ 1780954255651241984
author Simko, Tibor
author_facet Simko, Tibor
author_sort Simko, Tibor
collection CERN
description The reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of tools developed to support particle physics researchers in preserving the knowledge around analyses so that capturing, sharing, reusing and reinterpreting data becomes easier. The presentation will focus on three pillars: (i) capturing structured knowledge information about data analysis processes; (ii) capturing the computing environment, the software code, the datasets, the configuration and other information assets used in data analyses; (iii) re-instantiating of preserved analyses on a containerised computing cloud for the purposes of re-validation and re-interpretation.
id cern-2263664
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22636642019-09-30T06:29:59Zhttp://cds.cern.ch/record/2263664engSimko, TiborTowards Reproducible Research Data Analyses in LHC Particle PhysicsILIDE2017TalkThe reproducibility of the research data analysis requires having access not only to the original datasets, but also to the computing environment, the analysis software and the workflow used to produce the original results. We present the nascent CERN Analysis Preservation platform with a set of tools developed to support particle physics researchers in preserving the knowledge around analyses so that capturing, sharing, reusing and reinterpreting data becomes easier. The presentation will focus on three pillars: (i) capturing structured knowledge information about data analysis processes; (ii) capturing the computing environment, the software code, the datasets, the configuration and other information assets used in data analyses; (iii) re-instantiating of preserved analyses on a containerised computing cloud for the purposes of re-validation and re-interpretation.IT-TALK-2017-004oai:cds.cern.ch:22636642017
spellingShingle Talk
Simko, Tibor
Towards Reproducible Research Data Analyses in LHC Particle Physics
title Towards Reproducible Research Data Analyses in LHC Particle Physics
title_full Towards Reproducible Research Data Analyses in LHC Particle Physics
title_fullStr Towards Reproducible Research Data Analyses in LHC Particle Physics
title_full_unstemmed Towards Reproducible Research Data Analyses in LHC Particle Physics
title_short Towards Reproducible Research Data Analyses in LHC Particle Physics
title_sort towards reproducible research data analyses in lhc particle physics
topic Talk
url http://cds.cern.ch/record/2263664
work_keys_str_mv AT simkotibor towardsreproducibleresearchdataanalysesinlhcparticlephysics
AT simkotibor ilide2017