Cargando…

Invited talk: Deep Learning Meets Physics

Deep Learning has emerged as one of the most successful fields of machine learning and artificial intelligence with overwhelming success in industrial speech, text and vision benchmarks. Consequently it evolved into the central field of research for IT giants like Google, facebook,...

Descripción completa

Detalles Bibliográficos
Autor principal:	Hochreiter, Sepp
Lenguaje:	eng
Publicado:	2018
Materias:	Machine Learning
Acceso en línea:	http://cds.cern.ch/record/2621728

_version_	1780958528944472064
author	Hochreiter, Sepp
author_facet	Hochreiter, Sepp
author_sort	Hochreiter, Sepp
collection	CERN
description	<!--HTML-->Deep Learning has emerged as one of the most successful fields of machine learning and artificial intelligence with overwhelming success in industrial speech, text and vision benchmarks. Consequently it evolved into the central field of research for IT giants like Google, facebook, Microsoft, Baidu, and Amazon. Deep Learning is founded on novel neural network techniques, the recent availability of very fast computers, and massive data sets. In its core, Deep Learning discovers multiple levels of abstract representations of the input. The main obstacle to learning deep neural networks is the vanishing gradient problem. The vanishing gradient impedes credit assignment to the first layers of a deep network or to early elements of a sequence, therefore limits model selection. Major advances in Deep Learning can be related to avoiding the vanishing gradient like stacking, ReLUs, residual networks, highway networks, and LSTM. For Deep Learning, we suggested self-normalizing neural networks (SNNs) which automatically avoid the vanishing gradient. In unsupervised Deep Learning generative adversarial networks (GANs) excel in generating realistic images outperforming all previous approaches. We proved that a two time-scale update rule for training GANs converge under mild assumptions to a local Nash equilibrium. For deep reinforcement learning we introduced a new approach to learn long delayed rewards, for which methods that estimate value functions like temporal difference, Monte Carlo, or Monte Carlo Tree Search failed. Current applications of Deep Learning in physics comprise analysis of ATLAS data e.g. to identify measurements of the Higgs boson, quantum chemistry, energy prediction without the Schrödinger equation and wave functions, and quantum state classifications. On the other hand, methods from physics are used to describe Deep Learning systems. The Fokker-Plank equation describes the behavior of stochastic gradient descent which finds flat minima in error surfaces. We use electric field equations to define a new GAN objective which can be proved via the continuity equation to have a single (global) Nash equilibrium.
id	cern-2621728
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2018
record_format	invenio
spelling	cern-26217282022-11-02T22:34:05Zhttp://cds.cern.ch/record/2621728engHochreiter, SeppInvited talk: Deep Learning Meets PhysicsIML Machine Learning Working Group: sequential modelsMachine Learning<!--HTML-->Deep Learning has emerged as one of the most successful fields of machine learning and artificial intelligence with overwhelming success in industrial speech, text and vision benchmarks. Consequently it evolved into the central field of research for IT giants like Google, facebook, Microsoft, Baidu, and Amazon. Deep Learning is founded on novel neural network techniques, the recent availability of very fast computers, and massive data sets. In its core, Deep Learning discovers multiple levels of abstract representations of the input. The main obstacle to learning deep neural networks is the vanishing gradient problem. The vanishing gradient impedes credit assignment to the first layers of a deep network or to early elements of a sequence, therefore limits model selection. Major advances in Deep Learning can be related to avoiding the vanishing gradient like stacking, ReLUs, residual networks, highway networks, and LSTM. For Deep Learning, we suggested self-normalizing neural networks (SNNs) which automatically avoid the vanishing gradient. In unsupervised Deep Learning generative adversarial networks (GANs) excel in generating realistic images outperforming all previous approaches. We proved that a two time-scale update rule for training GANs converge under mild assumptions to a local Nash equilibrium. For deep reinforcement learning we introduced a new approach to learn long delayed rewards, for which methods that estimate value functions like temporal difference, Monte Carlo, or Monte Carlo Tree Search failed. Current applications of Deep Learning in physics comprise analysis of ATLAS data e.g. to identify measurements of the Higgs boson, quantum chemistry, energy prediction without the Schrödinger equation and wave functions, and quantum state classifications. On the other hand, methods from physics are used to describe Deep Learning systems. The Fokker-Plank equation describes the behavior of stochastic gradient descent which finds flat minima in error surfaces. We use electric field equations to define a new GAN objective which can be proved via the continuity equation to have a single (global) Nash equilibrium.oai:cds.cern.ch:26217282018
spellingShingle	Machine Learning Hochreiter, Sepp Invited talk: Deep Learning Meets Physics
title	Invited talk: Deep Learning Meets Physics
title_full	Invited talk: Deep Learning Meets Physics
title_fullStr	Invited talk: Deep Learning Meets Physics
title_full_unstemmed	Invited talk: Deep Learning Meets Physics
title_short	Invited talk: Deep Learning Meets Physics
title_sort	invited talk: deep learning meets physics
topic	Machine Learning
url	http://cds.cern.ch/record/2621728
work_keys_str_mv	AT hochreitersepp invitedtalkdeeplearningmeetsphysics AT hochreitersepp imlmachinelearningworkinggroupsequentialmodels

Invited talk: Deep Learning Meets Physics

Ejemplares similares