Cargando…
A tutorial on linear function approximators for dynamic programming and reinforcement learning
This tutorial reviews techniques for planning and learning in Markov Decision Processes (MDPs) with linear function approximation of the value function. Two major paradigms for finding optimal policies were considered: dynamic programming (DP) techniques for planning and reinforcement learning (RL).
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
Now Publishers
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2762208 |