Cargando…

The ATLAS Higgs Machine Learning Challenge

High Energy Physics has been using Machine Learning techniques (commonly known as Multivariate Analysis) since the 1990s with Artificial Neural Net and more recently with Boosted Decision Trees, Random Forest etc. Meanwhile, Machine Learning has become a full blown field of computer science. With th...

Descripción completa

Detalles Bibliográficos
Autores principales: Cowan, Glen, Rousseau, David, Bourdarios, Claire
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:http://cds.cern.ch/record/2007301
_version_ 1780946355809681408
author Cowan, Glen
Rousseau, David
Bourdarios, Claire
author_facet Cowan, Glen
Rousseau, David
Bourdarios, Claire
author_sort Cowan, Glen
collection CERN
description High Energy Physics has been using Machine Learning techniques (commonly known as Multivariate Analysis) since the 1990s with Artificial Neural Net and more recently with Boosted Decision Trees, Random Forest etc. Meanwhile, Machine Learning has become a full blown field of computer science. With the emergence of Big Data, data scientists are developing new Machine Learning algorithms to extract meaning from large heterogeneous data. HEP has exciting and difficult problems like the extraction of the Higgs boson signal, and at the same time data scientists have advanced algorithms: the goal of the HiggsML project was to bring the two together by a “challenge”: participants from all over the world and any scientific background could compete online to obtain the best Higgs to tau tau signal significance on a set of ATLAS fully simulated Monte Carlo signal and background. Instead of HEP physicists browsing through machine learning papers and trying to infer which new algorithms might be useful for HEP, then coding and tuning them, the challenge has brought realistic HEP data to the data scientists on the Kaggle platform, which is well known in the Machine Learning community. The challenge has been organized by the ATLAS collaboration associated to data scientists, in partnership with the Paris Saclay Center for Data Science, CERN and Google. The challenge ran from May to September 2014, drawing considerable attention. 1785 teams participated, making it the most popular challenge ever on the Kaggle platform. New Machine Learning techniques have been used by the participants with significantly better results than usual HEP tools. This presentation has two parts: the first one describes how a HEP problem was simplified (not too much!) and wrapped up into an online challenge, the second what was learned from the challenge, in terms of new Machine Learning algorithms and techniques which could have an impact on future HEP analysis.
id cern-2007301
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20073012019-09-30T06:29:59Zhttp://cds.cern.ch/record/2007301engCowan, GlenRousseau, DavidBourdarios, ClaireThe ATLAS Higgs Machine Learning ChallengeParticle Physics - ExperimentHigh Energy Physics has been using Machine Learning techniques (commonly known as Multivariate Analysis) since the 1990s with Artificial Neural Net and more recently with Boosted Decision Trees, Random Forest etc. Meanwhile, Machine Learning has become a full blown field of computer science. With the emergence of Big Data, data scientists are developing new Machine Learning algorithms to extract meaning from large heterogeneous data. HEP has exciting and difficult problems like the extraction of the Higgs boson signal, and at the same time data scientists have advanced algorithms: the goal of the HiggsML project was to bring the two together by a “challenge”: participants from all over the world and any scientific background could compete online to obtain the best Higgs to tau tau signal significance on a set of ATLAS fully simulated Monte Carlo signal and background. Instead of HEP physicists browsing through machine learning papers and trying to infer which new algorithms might be useful for HEP, then coding and tuning them, the challenge has brought realistic HEP data to the data scientists on the Kaggle platform, which is well known in the Machine Learning community. The challenge has been organized by the ATLAS collaboration associated to data scientists, in partnership with the Paris Saclay Center for Data Science, CERN and Google. The challenge ran from May to September 2014, drawing considerable attention. 1785 teams participated, making it the most popular challenge ever on the Kaggle platform. New Machine Learning techniques have been used by the participants with significantly better results than usual HEP tools. This presentation has two parts: the first one describes how a HEP problem was simplified (not too much!) and wrapped up into an online challenge, the second what was learned from the challenge, in terms of new Machine Learning algorithms and techniques which could have an impact on future HEP analysis.ATL-SOFT-SLIDE-2015-152oai:cds.cern.ch:20073012015-04-08
spellingShingle Particle Physics - Experiment
Cowan, Glen
Rousseau, David
Bourdarios, Claire
The ATLAS Higgs Machine Learning Challenge
title The ATLAS Higgs Machine Learning Challenge
title_full The ATLAS Higgs Machine Learning Challenge
title_fullStr The ATLAS Higgs Machine Learning Challenge
title_full_unstemmed The ATLAS Higgs Machine Learning Challenge
title_short The ATLAS Higgs Machine Learning Challenge
title_sort atlas higgs machine learning challenge
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2007301
work_keys_str_mv AT cowanglen theatlashiggsmachinelearningchallenge
AT rousseaudavid theatlashiggsmachinelearningchallenge
AT bourdariosclaire theatlashiggsmachinelearningchallenge
AT cowanglen atlashiggsmachinelearningchallenge
AT rousseaudavid atlashiggsmachinelearningchallenge
AT bourdariosclaire atlashiggsmachinelearningchallenge