Cargando…

Leveraging universality of jet taggers through transfer learning

A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to dif...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dreyer, Frédéric A., Grabarczyk, Radosław, Monni, Pier Francesco
Lenguaje:	eng
Publicado:	2022
Materias:	hep-ex Particle Physics - Experiment cs.LG Computing and Computers cs.CV hep-ph Particle Physics - Phenomenology
Acceso en línea:	https://dx.doi.org/10.1140/epjc/s10052-022-10469-9 http://cds.cern.ch/record/2804029

_version_	1780972830686445568
author	Dreyer, Frédéric A. Grabarczyk, Radosław Monni, Pier Francesco
author_facet	Dreyer, Frédéric A. Grabarczyk, Radosław Monni, Pier Francesco
author_sort	Dreyer, Frédéric A.
collection	CERN
description	A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.
id	cern-2804029
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2022
record_format	invenio
spelling	cern-28040292023-08-09T12:35:14Zdoi:10.1140/epjc/s10052-022-10469-9http://cds.cern.ch/record/2804029engDreyer, Frédéric A.Grabarczyk, RadosławMonni, Pier FrancescoLeveraging universality of jet taggers through transfer learninghep-exParticle Physics - Experimentcs.LGComputing and Computerscs.CVComputing and Computershep-phParticle Physics - PhenomenologyA significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of $W$-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.arXiv:2203.06210OUTP-22-01PCERN-TH-2022-018oai:cds.cern.ch:28040292022-03-11
spellingShingle	hep-ex Particle Physics - Experiment cs.LG Computing and Computers cs.CV Computing and Computers hep-ph Particle Physics - Phenomenology Dreyer, Frédéric A. Grabarczyk, Radosław Monni, Pier Francesco Leveraging universality of jet taggers through transfer learning
title	Leveraging universality of jet taggers through transfer learning
title_full	Leveraging universality of jet taggers through transfer learning
title_fullStr	Leveraging universality of jet taggers through transfer learning
title_full_unstemmed	Leveraging universality of jet taggers through transfer learning
title_short	Leveraging universality of jet taggers through transfer learning
title_sort	leveraging universality of jet taggers through transfer learning
topic	hep-ex Particle Physics - Experiment cs.LG Computing and Computers cs.CV Computing and Computers hep-ph Particle Physics - Phenomenology
url	https://dx.doi.org/10.1140/epjc/s10052-022-10469-9 http://cds.cern.ch/record/2804029
work_keys_str_mv	AT dreyerfrederica leveraginguniversalityofjettaggersthroughtransferlearning AT grabarczykradosław leveraginguniversalityofjettaggersthroughtransferlearning AT monnipierfrancesco leveraginguniversalityofjettaggersthroughtransferlearning

Leveraging universality of jet taggers through transfer learning

Ejemplares similares