Cargando…
Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
Packt Publishing
2018
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2634441 |
_version_ | 1780959709858103296 |
---|---|
author | Lapan, Maxim |
author_facet | Lapan, Maxim |
author_sort | Lapan, Maxim |
collection | CERN |
id | cern-2634441 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
publisher | Packt Publishing |
record_format | invenio |
spelling | cern-26344412021-04-21T18:44:31Zhttp://cds.cern.ch/record/2634441engLapan, MaximDeep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and moreComputing and ComputersPackt Publishingoai:cds.cern.ch:26344412018 |
spellingShingle | Computing and Computers Lapan, Maxim Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title | Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title_full | Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title_fullStr | Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title_full_unstemmed | Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title_short | Deep reinforcement learning hands-on: apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more |
title_sort | deep reinforcement learning hands-on: apply modern rl methods, with deep q-networks, value iteration, policy gradients, trpo, alphago zero and more |
topic | Computing and Computers |
url | http://cds.cern.ch/record/2634441 |
work_keys_str_mv | AT lapanmaxim deepreinforcementlearninghandsonapplymodernrlmethodswithdeepqnetworksvalueiterationpolicygradientstrpoalphagozeroandmore |