Cargando…
Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment
The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated...
Autores principales: | , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/201921406008 http://cds.cern.ch/record/2650715 |
_version_ | 1780960821254291456 |
---|---|
author | Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch |
author_facet | Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch |
author_sort | Pol, Adrian Alan |
collection | CERN |
description | The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects. |
id | cern-2650715 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
record_format | invenio |
spelling | cern-26507152022-08-10T12:22:38Zdoi:10.1051/epjconf/201921406008http://cds.cern.ch/record/2650715engPol, Adrian AlanAzzolini, VirginiaCerminara, GianlucaDe Guio, FedericoFranzoni, GiovanniPierini, MaurizioŠiroký, FilipVlimant, Jean-RochAnomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experimentDetectors and Experimental TechniquesThe certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.CMS-CR-2018-202oai:cds.cern.ch:26507152018-09-07 |
spellingShingle | Detectors and Experimental Techniques Pol, Adrian Alan Azzolini, Virginia Cerminara, Gianluca De Guio, Federico Franzoni, Giovanni Pierini, Maurizio Široký, Filip Vlimant, Jean-Roch Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title | Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title_full | Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title_fullStr | Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title_full_unstemmed | Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title_short | Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment |
title_sort | anomaly detection using deep autoencoders for the assessment of the quality of the data acquired by the cms experiment |
topic | Detectors and Experimental Techniques |
url | https://dx.doi.org/10.1051/epjconf/201921406008 http://cds.cern.ch/record/2650715 |
work_keys_str_mv | AT poladrianalan anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT azzolinivirginia anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT cerminaragianluca anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT deguiofederico anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT franzonigiovanni anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT pierinimaurizio anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT sirokyfilip anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment AT vlimantjeanroch anomalydetectionusingdeepautoencodersfortheassessmentofthequalityofthedataacquiredbythecmsexperiment |