Cargando…

Direct Learning of Systematics-Aware Summary Statistics

<!--HTML-->Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by...

Descripción completa

Detalles Bibliográficos
Autor principal: De Castro Manzano, Pablo
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2312998
_version_ 1780958028684591104
author De Castro Manzano, Pablo
author_facet De Castro Manzano, Pablo
author_sort De Castro Manzano, Pablo
collection CERN
description <!--HTML-->Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by optimising some classification or regression surrogate objective. The learned feature representations are then used to build a sample-based statistical model to perform inference (e.g. interval estimation or hypothesis testing) over a set of parameters of interest. However, the effectiveness of the mentioned approach can be reduced by the presence of known uncertainties that cause differences between training and experimental data, included in the statistical model via nuisance parameters. This work presents an end-to-end algorithm, which leverages on existing deep learning technologies but directly aims to produce inference-optimal sample-summary statistics. By including the statistical model and a differentiable approximation of the effect of nuisance parameters in the computational graph, loss functions derived form the observed Fisher information are directly optimised by stochastic gradient descent. This new technique leads to summary statistics that are aware of the known uncertainties and maximise the information that can be inferred about the parameters of interest object of a experimental measurement.
id cern-2312998
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-23129982022-11-02T22:34:03Zhttp://cds.cern.ch/record/2312998engDe Castro Manzano, PabloDirect Learning of Systematics-Aware Summary Statistics2nd IML Machine Learning WorkshopMachine Learning<!--HTML-->Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by optimising some classification or regression surrogate objective. The learned feature representations are then used to build a sample-based statistical model to perform inference (e.g. interval estimation or hypothesis testing) over a set of parameters of interest. However, the effectiveness of the mentioned approach can be reduced by the presence of known uncertainties that cause differences between training and experimental data, included in the statistical model via nuisance parameters. This work presents an end-to-end algorithm, which leverages on existing deep learning technologies but directly aims to produce inference-optimal sample-summary statistics. By including the statistical model and a differentiable approximation of the effect of nuisance parameters in the computational graph, loss functions derived form the observed Fisher information are directly optimised by stochastic gradient descent. This new technique leads to summary statistics that are aware of the known uncertainties and maximise the information that can be inferred about the parameters of interest object of a experimental measurement.oai:cds.cern.ch:23129982018
spellingShingle Machine Learning
De Castro Manzano, Pablo
Direct Learning of Systematics-Aware Summary Statistics
title Direct Learning of Systematics-Aware Summary Statistics
title_full Direct Learning of Systematics-Aware Summary Statistics
title_fullStr Direct Learning of Systematics-Aware Summary Statistics
title_full_unstemmed Direct Learning of Systematics-Aware Summary Statistics
title_short Direct Learning of Systematics-Aware Summary Statistics
title_sort direct learning of systematics-aware summary statistics
topic Machine Learning
url http://cds.cern.ch/record/2312998
work_keys_str_mv AT decastromanzanopablo directlearningofsystematicsawaresummarystatistics
AT decastromanzanopablo 2ndimlmachinelearningworkshop