Cargando…

Direct Learning of Systematics-Aware Summary Statistics

Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by...

Descripción completa

Detalles Bibliográficos
Autor principal:	De Castro Manzano, Pablo
Lenguaje:	eng
Publicado:	2018
Materias:	Machine Learning
Acceso en línea:	http://cds.cern.ch/record/2312998

_version_	1780958028684591104
author	De Castro Manzano, Pablo
author_facet	De Castro Manzano, Pablo
author_sort	De Castro Manzano, Pablo
collection	CERN
description	<!--HTML-->Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by optimising some classification or regression surrogate objective. The learned feature representations are then used to build a sample-based statistical model to perform inference (e.g. interval estimation or hypothesis testing) over a set of parameters of interest. However, the effectiveness of the mentioned approach can be reduced by the presence of known uncertainties that cause differences between training and experimental data, included in the statistical model via nuisance parameters. This work presents an end-to-end algorithm, which leverages on existing deep learning technologies but directly aims to produce inference-optimal sample-summary statistics. By including the statistical model and a differentiable approximation of the effect of nuisance parameters in the computational graph, loss functions derived form the observed Fisher information are directly optimised by stochastic gradient descent. This new technique leads to summary statistics that are aware of the known uncertainties and maximise the information that can be inferred about the parameters of interest object of a experimental measurement.
id	cern-2312998
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2018
record_format	invenio
spelling	cern-23129982022-11-02T22:34:03Zhttp://cds.cern.ch/record/2312998engDe Castro Manzano, PabloDirect Learning of Systematics-Aware Summary Statistics2nd IML Machine Learning WorkshopMachine Learning<!--HTML-->Complex machine learning tools, such as deep neural networks and gradient boosting algorithms, are increasingly being used to construct powerful discriminative features for High Energy Physics analyses. These methods are typically trained with simulated or auxiliary data samples by optimising some classification or regression surrogate objective. The learned feature representations are then used to build a sample-based statistical model to perform inference (e.g. interval estimation or hypothesis testing) over a set of parameters of interest. However, the effectiveness of the mentioned approach can be reduced by the presence of known uncertainties that cause differences between training and experimental data, included in the statistical model via nuisance parameters. This work presents an end-to-end algorithm, which leverages on existing deep learning technologies but directly aims to produce inference-optimal sample-summary statistics. By including the statistical model and a differentiable approximation of the effect of nuisance parameters in the computational graph, loss functions derived form the observed Fisher information are directly optimised by stochastic gradient descent. This new technique leads to summary statistics that are aware of the known uncertainties and maximise the information that can be inferred about the parameters of interest object of a experimental measurement.oai:cds.cern.ch:23129982018
spellingShingle	Machine Learning De Castro Manzano, Pablo Direct Learning of Systematics-Aware Summary Statistics
title	Direct Learning of Systematics-Aware Summary Statistics
title_full	Direct Learning of Systematics-Aware Summary Statistics
title_fullStr	Direct Learning of Systematics-Aware Summary Statistics
title_full_unstemmed	Direct Learning of Systematics-Aware Summary Statistics
title_short	Direct Learning of Systematics-Aware Summary Statistics
title_sort	direct learning of systematics-aware summary statistics
topic	Machine Learning
url	http://cds.cern.ch/record/2312998
work_keys_str_mv	AT decastromanzanopablo directlearningofsystematicsawaresummarystatistics AT decastromanzanopablo 2ndimlmachinelearningworkshop

Direct Learning of Systematics-Aware Summary Statistics

Ejemplares similares