Cargando…

Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more

Detalles Bibliográficos
Autor principal: Lapan, Maxim
Lenguaje:eng
Publicado: Packt Publishing 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2634441
_version_ 1780959709858103296
author Lapan, Maxim
author_facet Lapan, Maxim
author_sort Lapan, Maxim
collection CERN
id cern-2634441
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
publisher Packt Publishing
record_format invenio
spelling cern-26344412021-04-21T18:44:31Zhttp://cds.cern.ch/record/2634441engLapan, MaximDeep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and moreComputing and ComputersPackt Publishingoai:cds.cern.ch:26344412018
spellingShingle Computing and Computers
Lapan, Maxim
Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title_full Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title_fullStr Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title_full_unstemmed Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title_short Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
title_sort deep reinforcement learning hands-on: apply modern rl methods, with deep q-networks, value iteration, policy gradients, trpo, alphago zero and more
topic Computing and Computers
url http://cds.cern.ch/record/2634441
work_keys_str_mv AT lapanmaxim deepreinforcementlearninghandsonapplymodernrlmethodswithdeepqnetworksvalueiterationpolicygradientstrpoalphagozeroandmore