Cargando…
A tutorial on linear function approximators for dynamic programming and reinforcement learning
This tutorial reviews techniques for planning and learning in Markov Decision Processes (MDPs) with linear function approximation of the value function. Two major paradigms for finding optimal policies were considered: dynamic programming (DP) techniques for planning and reinforcement learning (RL).
Autores principales: | Geramifard, Alborz, Walsh, Thomas J, Stefanie, Tellex, Chowdhary, Girish, Roy, Nicholas, How, Jonathan P |
---|---|
Lenguaje: | eng |
Publicado: |
Now Publishers
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2762208 |
Ejemplares similares
-
Reinforcement learning and approximate dynamic programming for feedback control
por: Lewis, F, et al.
Publicado: (2013) -
Tutorial on Linear Colliders
por: Zimmermann, Frank
Publicado: (2000) -
A tutorial on Thompson sampling
por: Russo, Daniel J, et al.
Publicado: (2018) -
Solution of Linear Integral Equations using Padé Approximants
por: Chisholm, J S R
Publicado: (1963) -
Approximate Entropy and Sample Entropy: A Comprehensive Tutorial
por: Delgado-Bonal, Alfonso, et al.
Publicado: (2019)