Cargando…

Evaluation of linearly solvable Markov decision process with dynamic model learning in a mobile robot navigation task

Linearly solvable Markov Decision Process (LMDP) is a class of optimal control problem in which the Bellman's equation can be converted into a linear equation by an exponential transformation of the state value function (Todorov, 2009b). In an LMDP, the optimal value function and the correspond...

Descripción completa

Detalles Bibliográficos
Autores principales: Kinjo, Ken, Uchibe, Eiji, Doya, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3617398/
https://www.ncbi.nlm.nih.gov/pubmed/23576983
http://dx.doi.org/10.3389/fnbot.2013.00007