Cargando…
Risk-Aware Model-Based Control
Model-Based Reinforcement Learning (MBRL) algorithms have been shown to have an advantage on data-efficiency, but often overshadowed by state-of-the-art model-free methods in performance, especially when facing high-dimensional and complex problems. In this work, a novel MBRL method is proposed, cal...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990789/ https://www.ncbi.nlm.nih.gov/pubmed/33778013 http://dx.doi.org/10.3389/frobt.2021.617839 |
_version_ | 1783669122674982912 |
---|---|
author | Yu, Chen Rosendo, Andre |
author_facet | Yu, Chen Rosendo, Andre |
author_sort | Yu, Chen |
collection | PubMed |
description | Model-Based Reinforcement Learning (MBRL) algorithms have been shown to have an advantage on data-efficiency, but often overshadowed by state-of-the-art model-free methods in performance, especially when facing high-dimensional and complex problems. In this work, a novel MBRL method is proposed, called Risk-Aware Model-Based Control (RAMCO). It combines uncertainty-aware deep dynamics models and the risk assessment technique Conditional Value at Risk (CVaR). This mechanism is appropriate for real-world application since it takes epistemic risk into consideration. In addition, we use a model-free solver to produce warm-up training data, and this setting improves the performance in low-dimensional environments and covers the shortage of MBRL’s nature in the high-dimensional scenarios. In comparison with other state-of-the-art reinforcement learning algorithms, we show that it produces superior results on a walking robot model. We also evaluate the method with an Eidos environment, which is a novel experimental method with multi-dimensional randomly initialized deep neural networks to measure the performance of any reinforcement learning algorithm, and the advantages of RAMCO are highlighted. |
format | Online Article Text |
id | pubmed-7990789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79907892021-03-26 Risk-Aware Model-Based Control Yu, Chen Rosendo, Andre Front Robot AI Robotics and AI Model-Based Reinforcement Learning (MBRL) algorithms have been shown to have an advantage on data-efficiency, but often overshadowed by state-of-the-art model-free methods in performance, especially when facing high-dimensional and complex problems. In this work, a novel MBRL method is proposed, called Risk-Aware Model-Based Control (RAMCO). It combines uncertainty-aware deep dynamics models and the risk assessment technique Conditional Value at Risk (CVaR). This mechanism is appropriate for real-world application since it takes epistemic risk into consideration. In addition, we use a model-free solver to produce warm-up training data, and this setting improves the performance in low-dimensional environments and covers the shortage of MBRL’s nature in the high-dimensional scenarios. In comparison with other state-of-the-art reinforcement learning algorithms, we show that it produces superior results on a walking robot model. We also evaluate the method with an Eidos environment, which is a novel experimental method with multi-dimensional randomly initialized deep neural networks to measure the performance of any reinforcement learning algorithm, and the advantages of RAMCO are highlighted. Frontiers Media S.A. 2021-03-11 /pmc/articles/PMC7990789/ /pubmed/33778013 http://dx.doi.org/10.3389/frobt.2021.617839 Text en Copyright © 2021 Yu and Rosendo. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Yu, Chen Rosendo, Andre Risk-Aware Model-Based Control |
title | Risk-Aware Model-Based Control |
title_full | Risk-Aware Model-Based Control |
title_fullStr | Risk-Aware Model-Based Control |
title_full_unstemmed | Risk-Aware Model-Based Control |
title_short | Risk-Aware Model-Based Control |
title_sort | risk-aware model-based control |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990789/ https://www.ncbi.nlm.nih.gov/pubmed/33778013 http://dx.doi.org/10.3389/frobt.2021.617839 |
work_keys_str_mv | AT yuchen riskawaremodelbasedcontrol AT rosendoandre riskawaremodelbasedcontrol |