Cargando…
End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data
We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector...
Autores principales: | , , , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/202125104030 https://dx.doi.org/10.1103/PhysRevD.105.052008 http://cds.cern.ch/record/2780774 |
_version_ | 1780971894676127744 |
---|---|
author | Andrews, Michael Burkle, Bjorn Chen, Yi-fan DiCroce, Davide Gleyzer, Sergei Heintz, Ulrich Narain, Meenakshi Paulini, Manfred Pervan, Nikolas Shafi, Yusef Sun, Wei Usai, Emanuele Yang, Kun |
author_facet | Andrews, Michael Burkle, Bjorn Chen, Yi-fan DiCroce, Davide Gleyzer, Sergei Heintz, Ulrich Narain, Meenakshi Paulini, Manfred Pervan, Nikolas Shafi, Yusef Sun, Wei Usai, Emanuele Yang, Kun |
author_sort | Andrews, Michael |
collection | CERN |
description | We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use lowlevel detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves a ROC-AUC score of 0.975±0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier ROC-AUC score increases to 0.9824±0.0013, serving as the first performance benchmark for these CMS Open Data samples. |
id | cern-2780774 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27807742023-03-12T04:56:13Zdoi:10.1051/epjconf/202125104030doi:10.1103/PhysRevD.105.052008http://cds.cern.ch/record/2780774engAndrews, MichaelBurkle, BjornChen, Yi-fanDiCroce, DavideGleyzer, SergeiHeintz, UlrichNarain, MeenakshiPaulini, ManfredPervan, NikolasShafi, YusefSun, WeiUsai, EmanueleYang, KunEnd-to-End Jet Classification of Boosted Top Quarks with CMS Open Datahep-exParticle Physics - Experimentcs.LGComputing and Computerscs.CVComputing and Computersphysics.data-anOther Fields of PhysicsWe describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use lowlevel detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves a ROC-AUC score of 0.975±0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier ROC-AUC score increases to 0.9824±0.0013, serving as the first performance benchmark for these CMS Open Data samples.We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique combines deep learning algorithms and low-level detector representation of the high-energy collision event. In this study, we use low-level detector information from the simulated CMS Open Data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an AUC score of 0.975$\pm$0.002 for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to 0.9824$\pm$0.0013, serving as the first performance benchmark for these CMS Open Data samples. We additionally provide a timing performance comparison of different processor unit architectures for training the network.We describe a novel application of the end-to-end deep learning technique to the task of discriminating top quark-initiated jets from those originating from the hadronization of a light quark or a gluon. The end-to-end deep learning technique uses low-level detector representation of high-energy collision event as inputs to deep learning algorithms. In this study, we use low-level detector information from the simulated Compact Muon Solenoid (CMS) open data samples to construct the top jet classifiers. To optimize classifier performance we progressively add low-level information from the CMS tracking detector, including pixel detector reconstructed hits and impact parameters, and demonstrate the value of additional tracking information even when no new spatial structures are added. Relying only on calorimeter energy deposits and reconstructed pixel detector hits, the end-to-end classifier achieves an area under the receiver operator curve (AUC) score of <math display="inline"><mrow><mn>0.975</mn><mo>±</mo><mn>0.002</mn></mrow></math> for the task of classifying boosted top quark jets. After adding derived track quantities, the classifier AUC score increases to <math display="inline"><mrow><mn>0.9824</mn><mo>±</mo><mn>0.0013</mn></mrow></math>, serving as the first performance benchmark for these CMS open data samples.arXiv:2104.14659oai:cds.cern.ch:27807742021 |
spellingShingle | hep-ex Particle Physics - Experiment cs.LG Computing and Computers cs.CV Computing and Computers physics.data-an Other Fields of Physics Andrews, Michael Burkle, Bjorn Chen, Yi-fan DiCroce, Davide Gleyzer, Sergei Heintz, Ulrich Narain, Meenakshi Paulini, Manfred Pervan, Nikolas Shafi, Yusef Sun, Wei Usai, Emanuele Yang, Kun End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title | End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title_full | End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title_fullStr | End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title_full_unstemmed | End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title_short | End-to-End Jet Classification of Boosted Top Quarks with CMS Open Data |
title_sort | end-to-end jet classification of boosted top quarks with cms open data |
topic | hep-ex Particle Physics - Experiment cs.LG Computing and Computers cs.CV Computing and Computers physics.data-an Other Fields of Physics |
url | https://dx.doi.org/10.1051/epjconf/202125104030 https://dx.doi.org/10.1103/PhysRevD.105.052008 http://cds.cern.ch/record/2780774 |
work_keys_str_mv | AT andrewsmichael endtoendjetclassificationofboostedtopquarkswithcmsopendata AT burklebjorn endtoendjetclassificationofboostedtopquarkswithcmsopendata AT chenyifan endtoendjetclassificationofboostedtopquarkswithcmsopendata AT dicrocedavide endtoendjetclassificationofboostedtopquarkswithcmsopendata AT gleyzersergei endtoendjetclassificationofboostedtopquarkswithcmsopendata AT heintzulrich endtoendjetclassificationofboostedtopquarkswithcmsopendata AT narainmeenakshi endtoendjetclassificationofboostedtopquarkswithcmsopendata AT paulinimanfred endtoendjetclassificationofboostedtopquarkswithcmsopendata AT pervannikolas endtoendjetclassificationofboostedtopquarkswithcmsopendata AT shafiyusef endtoendjetclassificationofboostedtopquarkswithcmsopendata AT sunwei endtoendjetclassificationofboostedtopquarkswithcmsopendata AT usaiemanuele endtoendjetclassificationofboostedtopquarkswithcmsopendata AT yangkun endtoendjetclassificationofboostedtopquarkswithcmsopendata |