Cargando…

Leveraging universality of jet taggers through transfer learning

A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to dif...

Descripción completa

Detalles Bibliográficos
Autores principales: Dreyer, Frédéric A., Grabarczyk, Radosław, Monni, Pier Francesco
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:https://dx.doi.org/10.1140/epjc/s10052-022-10469-9
http://cds.cern.ch/record/2804029
_version_ 1780972830686445568
author Dreyer, Frédéric A.
Grabarczyk, Radosław
Monni, Pier Francesco
author_facet Dreyer, Frédéric A.
Grabarczyk, Radosław
Monni, Pier Francesco
author_sort Dreyer, Frédéric A.
collection CERN
description A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.
id cern-2804029
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28040292023-08-09T12:35:14Zdoi:10.1140/epjc/s10052-022-10469-9http://cds.cern.ch/record/2804029engDreyer, Frédéric A.Grabarczyk, RadosławMonni, Pier FrancescoLeveraging universality of jet taggers through transfer learninghep-exParticle Physics - Experimentcs.LGComputing and Computerscs.CVComputing and Computershep-phParticle Physics - PhenomenologyA significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of $W$-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.arXiv:2203.06210OUTP-22-01PCERN-TH-2022-018oai:cds.cern.ch:28040292022-03-11
spellingShingle hep-ex
Particle Physics - Experiment
cs.LG
Computing and Computers
cs.CV
Computing and Computers
hep-ph
Particle Physics - Phenomenology
Dreyer, Frédéric A.
Grabarczyk, Radosław
Monni, Pier Francesco
Leveraging universality of jet taggers through transfer learning
title Leveraging universality of jet taggers through transfer learning
title_full Leveraging universality of jet taggers through transfer learning
title_fullStr Leveraging universality of jet taggers through transfer learning
title_full_unstemmed Leveraging universality of jet taggers through transfer learning
title_short Leveraging universality of jet taggers through transfer learning
title_sort leveraging universality of jet taggers through transfer learning
topic hep-ex
Particle Physics - Experiment
cs.LG
Computing and Computers
cs.CV
Computing and Computers
hep-ph
Particle Physics - Phenomenology
url https://dx.doi.org/10.1140/epjc/s10052-022-10469-9
http://cds.cern.ch/record/2804029
work_keys_str_mv AT dreyerfrederica leveraginguniversalityofjettaggersthroughtransferlearning
AT grabarczykradosław leveraginguniversalityofjettaggersthroughtransferlearning
AT monnipierfrancesco leveraginguniversalityofjettaggersthroughtransferlearning