Cargando…

RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms

The estimation of the compatibility of large amounts of histogram pairs is a recurrent problem in high energy physics. The issue is common to several different areas, from software quality monitoring to data certification, preservation and analysis. Given two sets of histograms, it is very important...

Descripción completa

Detalles Bibliográficos
Autor principal: Piparo, Danilo
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/2/022011
http://cds.cern.ch/record/1505377
_version_ 1780927247395323904
author Piparo, Danilo
author_facet Piparo, Danilo
author_sort Piparo, Danilo
collection CERN
description The estimation of the compatibility of large amounts of histogram pairs is a recurrent problem in high energy physics. The issue is common to several different areas, from software quality monitoring to data certification, preservation and analysis. Given two sets of histograms, it is very important to be able to scrutinize the outcome of several goodness of fit tests, obtain a clear answer about the overall compatibility, easily spot the single anomalies and directly access the concerned histogram pairs. This procedure must be automated in order to reduce the human workload, therefore improving the process of identification of differences which is usually carried out by a trained human mind. Some solutions to this problem have been proposed, but they are experiment specific. RelMon depends only on ROOT and offers several goodness of fit tests (e.g. chi-squared or Kolmogorov-Smirnov). It produces highly readable web reports, in which aggregations of the comparisons rankings are available as well as all the plots of the single histogram overlays. The comparison procedure is fully automatic and scales smoothly towards ensembles of millions of histograms. Examples of RelMon utilisation within the regular workflows of the CMS collaboration and the advantages therewith obtained are described. Its interplay with the data quality monitoring infrastructure is illustrated as well as its role in the QA of the event reconstruction code, its integration in the CMS software release cycle process, CMS user data analysis and dataset validation.
id cern-1505377
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-15053772022-08-17T13:31:54Zdoi:10.1088/1742-6596/396/2/022011http://cds.cern.ch/record/1505377engPiparo, DaniloRelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histogramsComputing and ComputersThe estimation of the compatibility of large amounts of histogram pairs is a recurrent problem in high energy physics. The issue is common to several different areas, from software quality monitoring to data certification, preservation and analysis. Given two sets of histograms, it is very important to be able to scrutinize the outcome of several goodness of fit tests, obtain a clear answer about the overall compatibility, easily spot the single anomalies and directly access the concerned histogram pairs. This procedure must be automated in order to reduce the human workload, therefore improving the process of identification of differences which is usually carried out by a trained human mind. Some solutions to this problem have been proposed, but they are experiment specific. RelMon depends only on ROOT and offers several goodness of fit tests (e.g. chi-squared or Kolmogorov-Smirnov). It produces highly readable web reports, in which aggregations of the comparisons rankings are available as well as all the plots of the single histogram overlays. The comparison procedure is fully automatic and scales smoothly towards ensembles of millions of histograms. Examples of RelMon utilisation within the regular workflows of the CMS collaboration and the advantages therewith obtained are described. Its interplay with the data quality monitoring infrastructure is illustrated as well as its role in the QA of the event reconstruction code, its integration in the CMS software release cycle process, CMS user data analysis and dataset validation.oai:cds.cern.ch:15053772012
spellingShingle Computing and Computers
Piparo, Danilo
RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title_full RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title_fullStr RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title_full_unstemmed RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title_short RelMon: A general approach to QA, validation and physics analysis through comparison of large sets of histograms
title_sort relmon: a general approach to qa, validation and physics analysis through comparison of large sets of histograms
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/396/2/022011
http://cds.cern.ch/record/1505377
work_keys_str_mv AT piparodanilo relmonageneralapproachtoqavalidationandphysicsanalysisthroughcomparisonoflargesetsofhistograms