Cargando…

Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment

The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated...

Descripción completa

Detalles Bibliográficos
Autores principales: Pol, Adrian Alan, Azzolini, Virginia, Cerminara, Gianluca, De Guio, Federico, Franzoni, Giovanni, Pierini, Maurizio, Široký, Filip, Vlimant, Jean-Roch
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921406008
http://cds.cern.ch/record/2650715
_version_ 1780960821254291456
author Pol, Adrian Alan
Azzolini, Virginia
Cerminara, Gianluca
De Guio, Federico
Franzoni, Giovanni
Pierini, Maurizio
Široký, Filip
Vlimant, Jean-Roch
author_facet Pol, Adrian Alan
Azzolini, Virginia
Cerminara, Gianluca
De Guio, Federico
Franzoni, Giovanni
Pierini, Maurizio
Široký, Filip
Vlimant, Jean-Roch
author_sort Pol, Adrian Alan
collection CERN
description The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.
id cern-2650715
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26507152022-08-10T12:22:38Zdoi:10.1051/epjconf/201921406008http://cds.cern.ch/record/2650715engPol, Adrian AlanAzzolini, VirginiaCerminara, GianlucaDe Guio, FedericoFranzoni, GiovanniPierini, MaurizioŠiroký, FilipVlimant, Jean-RochAnomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experimentDetectors and Experimental TechniquesThe certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.CMS-CR-2018-202oai:cds.cern.ch:26507152018-09-07
spellingShingle Detectors and Experimental Techniques
Pol, Adrian Alan
Azzolini, Virginia
Cerminara, Gianluca
De Guio, Federico
Franzoni, Giovanni
Pierini, Maurizio
Široký, Filip
Vlimant, Jean-Roch
Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_full Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_fullStr Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_full_unstemmed Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_short Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
title_sort anomaly detection using deep autoencoders for the assessment of the quality of the data acquired by the cms experiment
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1051/epjconf/201921406008
http://cds.cern.ch/record/2650715
work_keys_str_mv AT poladrianalan anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT azzolinivirginia anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT cerminaragianluca anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT deguiofederico anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT franzonigiovanni anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT pierinimaurizio anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT sirokyfilip anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment
AT vlimantjeanroch anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment