Cargando…

Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials

Robotic assistance via motorized robotic arm manipulators can be of valuable assistance to individuals with upper-limb motor disabilities. Brain-computer interfaces (BCI) offer an intuitive means to control such assistive robotic manipulators. However, BCI performance may vary due to the non-station...

Descripción completa

Detalles Bibliográficos
Autores principales: Batzianoulis, Iason, Iwane, Fumiaki, Wei, Shupeng, Correia, Carolina Gaspar Pinto Ramos, Chavarriaga, Ricardo, Millán, José del R., Billard, Aude
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677775/
https://www.ncbi.nlm.nih.gov/pubmed/34916587
http://dx.doi.org/10.1038/s42003-021-02891-8
_version_ 1784616212360069120
author Batzianoulis, Iason
Iwane, Fumiaki
Wei, Shupeng
Correia, Carolina Gaspar Pinto Ramos
Chavarriaga, Ricardo
Millán, José del R.
Billard, Aude
author_facet Batzianoulis, Iason
Iwane, Fumiaki
Wei, Shupeng
Correia, Carolina Gaspar Pinto Ramos
Chavarriaga, Ricardo
Millán, José del R.
Billard, Aude
author_sort Batzianoulis, Iason
collection PubMed
description Robotic assistance via motorized robotic arm manipulators can be of valuable assistance to individuals with upper-limb motor disabilities. Brain-computer interfaces (BCI) offer an intuitive means to control such assistive robotic manipulators. However, BCI performance may vary due to the non-stationary nature of the electroencephalogram (EEG) signals. It, hence, cannot be used safely for controlling tasks where errors may be detrimental to the user. Avoiding obstacles is one such task. As there exist many techniques to avoid obstacles in robotics, we propose to give the control to the robot to avoid obstacles and to leave to the user the choice of the robot behavior to do so a matter of personal preference as some users may be more daring while others more careful. We enable the users to train the robot controller to adapt its way to approach obstacles relying on BCI that detects error-related potentials (ErrP), indicative of the user’s error expectation of the robot’s current strategy to meet their preferences. Gaussian process-based inverse reinforcement learning, in combination with the ErrP-BCI, infers the user’s preference and updates the obstacle avoidance controller so as to generate personalized robot trajectories. We validate the approach in experiments with thirteen able-bodied subjects using a robotic arm that picks up, places and avoids real-life objects. Results show that the algorithm can learn user’s preference and adapt the robot behavior rapidly using less than five demonstrations not necessarily optimal.
format Online
Article
Text
id pubmed-8677775
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86777752022-01-04 Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials Batzianoulis, Iason Iwane, Fumiaki Wei, Shupeng Correia, Carolina Gaspar Pinto Ramos Chavarriaga, Ricardo Millán, José del R. Billard, Aude Commun Biol Article Robotic assistance via motorized robotic arm manipulators can be of valuable assistance to individuals with upper-limb motor disabilities. Brain-computer interfaces (BCI) offer an intuitive means to control such assistive robotic manipulators. However, BCI performance may vary due to the non-stationary nature of the electroencephalogram (EEG) signals. It, hence, cannot be used safely for controlling tasks where errors may be detrimental to the user. Avoiding obstacles is one such task. As there exist many techniques to avoid obstacles in robotics, we propose to give the control to the robot to avoid obstacles and to leave to the user the choice of the robot behavior to do so a matter of personal preference as some users may be more daring while others more careful. We enable the users to train the robot controller to adapt its way to approach obstacles relying on BCI that detects error-related potentials (ErrP), indicative of the user’s error expectation of the robot’s current strategy to meet their preferences. Gaussian process-based inverse reinforcement learning, in combination with the ErrP-BCI, infers the user’s preference and updates the obstacle avoidance controller so as to generate personalized robot trajectories. We validate the approach in experiments with thirteen able-bodied subjects using a robotic arm that picks up, places and avoids real-life objects. Results show that the algorithm can learn user’s preference and adapt the robot behavior rapidly using less than five demonstrations not necessarily optimal. Nature Publishing Group UK 2021-12-16 /pmc/articles/PMC8677775/ /pubmed/34916587 http://dx.doi.org/10.1038/s42003-021-02891-8 Text en © The Author(s) 2021, corrected publication 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Batzianoulis, Iason
Iwane, Fumiaki
Wei, Shupeng
Correia, Carolina Gaspar Pinto Ramos
Chavarriaga, Ricardo
Millán, José del R.
Billard, Aude
Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title_full Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title_fullStr Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title_full_unstemmed Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title_short Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
title_sort customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677775/
https://www.ncbi.nlm.nih.gov/pubmed/34916587
http://dx.doi.org/10.1038/s42003-021-02891-8
work_keys_str_mv AT batzianoulisiason customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT iwanefumiaki customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT weishupeng customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT correiacarolinagasparpintoramos customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT chavarriagaricardo customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT millanjosedelr customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials
AT billardaude customizingskillsforassistiveroboticmanipulatorsaninversereinforcementlearningapproachwitherrorrelatedpotentials