Cargando…

Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment

The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pol, Adrian Alan, Azzolini, Virginia, Cerminara, Gianluca, De Guio, Federico, Franzoni, Giovanni, Pierini, Maurizio, Široký, Filip, Vlimant, Jean-Roch
Lenguaje:	eng
Publicado:	2018
Materias:	Detectors and Experimental Techniques
Acceso en línea:	https://dx.doi.org/10.1051/epjconf/201921406008 http://cds.cern.ch/record/2650715

_version_	1780960821254291456
author	Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch
author_facet	Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch
author_sort	Pol, Adrian Alan
collection	CERN
description	The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.
id	cern-2650715
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2018
record_format	invenio
spelling	cern-26507152022-08-10T12:22:38Zdoi:10.1051/epjconf/201921406008http://cds.cern.ch/record/2650715engPol, Adrian AlanAzzolini, VirginiaCerminara, GianlucaDe Guio, FedericoFranzoni, GiovanniPierini, MaurizioŠiroký, FilipVlimant, Jean-RochAnomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experimentDetectors and Experimental TechniquesThe certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.CMS-CR-2018-202oai:cds.cern.ch:26507152018-09-07
spellingShingle	Detectors and Experimental Techniques Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title	Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_full	Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_fullStr	Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_full_unstemmed	Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_short	Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_sort	anomaly detection using deep autoencoders for the assessment of the quality of the data acquired by the cms experiment
topic	Detectors and Experimental Techniques
url	https://dx.doi.org/10.1051/epjconf/201921406008 http://cds.cern.ch/record/2650715
work_keys_str_mv	AT poladrianalan anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT azzolinivirginia anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT cerminaragianluca anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT deguiofederico anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT franzonigiovanni anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT pierinimaurizio anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT sirokyfilip anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT vlimantjeanroch anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment

Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment

Ejemplares similares