Cargando…

End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data

We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrews, Michael, Burkle, Bjorn, Chen, Yi-fan, DiCroce, Davide, Gleyzer, Sergei, Heintz, Ulrich, Narain, Meenakshi, Paulini, Manfred, Pervan, Nikolas, Shafi, Yusef, Sun, Wei, Usai, Emanuele, Yang, Kun
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202125104030
https://dx.doi.org/10.1103/PhysRevD.105.052008
http://cds.cern.ch/record/2780774
_version_ 1780971894676127744
author Andrews, Michael
Burkle, Bjorn
Chen, Yi-fan
DiCroce, Davide
Gleyzer, Sergei
Heintz, Ulrich
Narain, Meenakshi
Paulini, Manfred
Pervan, Nikolas
Shafi, Yusef
Sun, Wei
Usai, Emanuele
Yang, Kun
author_facet Andrews, Michael
Burkle, Bjorn
Chen, Yi-fan
DiCroce, Davide
Gleyzer, Sergei
Heintz, Ulrich
Narain, Meenakshi
Paulini, Manfred
Pervan, Nikolas
Shafi, Yusef
Sun, Wei
Usai, Emanuele
Yang, Kun
author_sort Andrews, Michael
collection CERN
description We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use lowlevel detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves a ROC-AUC score of 0.975±0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier ROC-AUC score increases to 0.9824±0.0013, serving as the first performance benchmark for these CMS Open Data samples.
id cern-2780774
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27807742023-03-12T04:56:13Zdoi:10.1051/epjconf/202125104030doi:10.1103/PhysRevD.105.052008http://cds.cern.ch/record/2780774engAndrews, MichaelBurkle, BjornChen, Yi-fanDiCroce, DavideGleyzer, SergeiHeintz, UlrichNarain, MeenakshiPaulini, ManfredPervan, NikolasShafi, YusefSun, WeiUsai, EmanueleYang, KunEnd-to-End Jet Classification of Boosted Top Quarks with CMS Open Datahep-exParticle Physics - Experimentcs.LGComputing and Computerscs.CVComputing and Computersphysics.data-anOther Fields of PhysicsWe describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use lowlevel detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves a ROC-AUC score of 0.975±0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier ROC-AUC score increases to 0.9824±0.0013, serving as the first performance benchmark for these CMS Open Data samples.We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use low-level detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an AUC score of 0.975$\pm$0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to 0.9824$\pm$0.0013, serving as the first performance benchmark for these CMS Open Data samples. We additionally provide a timing performance comparison of different processor unit architectures for training the network.We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique uses low-level detector representation of high-energy collision event as inputs to deep learning algorithms. In this study, we use low-level detector information from the simulated Compact Muon Solenoid (CMS) open data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an area under the receiver operator curve (AUC) score of <math display="inline"><mrow><mn>0.975</mn><mo>±</mo><mn>0.002</mn></mrow></math> for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to <math display="inline"><mrow><mn>0.9824</mn><mo>±</mo><mn>0.0013</mn></mrow></math>, serving as the first performance benchmark for these CMS open data samples.arXiv:2104.14659oai:cds.cern.ch:27807742021
spellingShingle hep-ex
Particle Physics - Experiment
cs.LG
Computing and Computers
cs.CV
Computing and Computers
physics.data-an
Other Fields of Physics
Andrews, Michael
Burkle, Bjorn
Chen, Yi-fan
DiCroce, Davide
Gleyzer, Sergei
Heintz, Ulrich
Narain, Meenakshi
Paulini, Manfred
Pervan, Nikolas
Shafi, Yusef
Sun, Wei
Usai, Emanuele
Yang, Kun
End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title_full End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title_fullStr End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title_full_unstemmed End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title_short End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
title_sort end-to-end jet classification of boosted top quarks with cms open data
topic hep-ex
Particle Physics - Experiment
cs.LG
Computing and Computers
cs.CV
Computing and Computers
physics.data-an
Other Fields of Physics
url https://dx.doi.org/10.1051/epjconf/202125104030
https://dx.doi.org/10.1103/PhysRevD.105.052008
http://cds.cern.ch/record/2780774
work_keys_str_mv AT andrewsmichael endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT burklebjorn endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT chenyifan endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT dicrocedavide endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT gleyzersergei endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT heintzulrich endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT narainmeenakshi endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT paulinimanfred endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT pervannikolas endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT shafiyusef endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT sunwei endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT usaiemanuele endtoendjetclassificationofboostedtopquarkswithcmsopendata
AT yangkun endtoendjetclassificationofboostedtopquarkswithcmsopendata