Cargando…

Constrained stochastic optimal control with learned importance sampling: A path integral approach

Modern robotic systems are expected to operate robustly in partially unknown environments. This article proposes an algorithm capable of controlling a wide range of high-dimensional robotic systems in such challenging scenarios. Our method is based on the path integral formulation of stochastic opti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Carius, Jan, Ranftl, René, Farshidian, Farbod, Hutter, Marco
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	SAGE Publications 2021
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9179940/ https://www.ncbi.nlm.nih.gov/pubmed/35694721 http://dx.doi.org/10.1177/02783649211047890

_version_	1784723394124578816
author	Carius, Jan Ranftl, René Farshidian, Farbod Hutter, Marco
author_facet	Carius, Jan Ranftl, René Farshidian, Farbod Hutter, Marco
author_sort	Carius, Jan
collection	PubMed
description	Modern robotic systems are expected to operate robustly in partially unknown environments. This article proposes an algorithm capable of controlling a wide range of high-dimensional robotic systems in such challenging scenarios. Our method is based on the path integral formulation of stochastic optimal control, which we extend with constraint-handling capabilities. Under our control law, the optimal input is inferred from a set of stochastic rollouts of the system dynamics. These rollouts are simulated by a physics engine, placing minimal restrictions on the types of systems and environments that can be modeled. Although sampling-based algorithms are typically not suitable for online control, we demonstrate in this work how importance sampling and constraints can be used to effectively curb the sampling complexity and enable real-time control applications. Furthermore, the path integral framework provides a natural way of incorporating existing control architectures as ancillary controllers for shaping the sampling distribution. Our results reveal that even in cases where the ancillary controller would fail, our stochastic control algorithm provides an additional safety and robustness layer. Moreover, in the absence of an existing ancillary controller, our method can be used to train a parametrized importance sampling policy using data from the stochastic rollouts. The algorithm may thereby bootstrap itself by learning an importance sampling policy offline and then refining it to unseen environments during online control. We validate our results on three robotic systems, including hardware experiments on a quadrupedal robot.
format	Online Article Text
id	pubmed-9179940
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	SAGE Publications
record_format	MEDLINE/PubMed
spelling	pubmed-91799402022-06-10 Constrained stochastic optimal control with learned importance sampling: A path integral approach Carius, Jan Ranftl, René Farshidian, Farbod Hutter, Marco Int J Rob Res Articles Modern robotic systems are expected to operate robustly in partially unknown environments. This article proposes an algorithm capable of controlling a wide range of high-dimensional robotic systems in such challenging scenarios. Our method is based on the path integral formulation of stochastic optimal control, which we extend with constraint-handling capabilities. Under our control law, the optimal input is inferred from a set of stochastic rollouts of the system dynamics. These rollouts are simulated by a physics engine, placing minimal restrictions on the types of systems and environments that can be modeled. Although sampling-based algorithms are typically not suitable for online control, we demonstrate in this work how importance sampling and constraints can be used to effectively curb the sampling complexity and enable real-time control applications. Furthermore, the path integral framework provides a natural way of incorporating existing control architectures as ancillary controllers for shaping the sampling distribution. Our results reveal that even in cases where the ancillary controller would fail, our stochastic control algorithm provides an additional safety and robustness layer. Moreover, in the absence of an existing ancillary controller, our method can be used to train a parametrized importance sampling policy using data from the stochastic rollouts. The algorithm may thereby bootstrap itself by learning an importance sampling policy offline and then refining it to unseen environments during online control. We validate our results on three robotic systems, including hardware experiments on a quadrupedal robot. SAGE Publications 2021-10-12 2022-02 /pmc/articles/PMC9179940/ /pubmed/35694721 http://dx.doi.org/10.1177/02783649211047890 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle	Articles Carius, Jan Ranftl, René Farshidian, Farbod Hutter, Marco Constrained stochastic optimal control with learned importance sampling: A path integral approach
title	Constrained stochastic optimal control with learned importance sampling: A path integral approach
title_full	Constrained stochastic optimal control with learned importance sampling: A path integral approach
title_fullStr	Constrained stochastic optimal control with learned importance sampling: A path integral approach
title_full_unstemmed	Constrained stochastic optimal control with learned importance sampling: A path integral approach
title_short	Constrained stochastic optimal control with learned importance sampling: A path integral approach
title_sort	constrained stochastic optimal control with learned importance sampling: a path integral approach
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9179940/ https://www.ncbi.nlm.nih.gov/pubmed/35694721 http://dx.doi.org/10.1177/02783649211047890
work_keys_str_mv	AT cariusjan constrainedstochasticoptimalcontrolwithlearnedimportancesamplingapathintegralapproach AT ranftlrene constrainedstochasticoptimalcontrolwithlearnedimportancesamplingapathintegralapproach AT farshidianfarbod constrainedstochasticoptimalcontrolwithlearnedimportancesamplingapathintegralapproach AT huttermarco constrainedstochasticoptimalcontrolwithlearnedimportancesamplingapathintegralapproach

Constrained stochastic optimal control with learned importance sampling: A path integral approach

Ejemplares similares