Cargando…

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals

Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action s...

Descripción completa

Detalles Bibliográficos
Autores principales:	Navarro-Guerrero, Nicolás, Lowe, Robert J., Wermter, Stefan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2017
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5376586/ https://www.ncbi.nlm.nih.gov/pubmed/28420976 http://dx.doi.org/10.3389/fnbot.2017.00010

_version_	1782519182956429312
author	Navarro-Guerrero, Nicolás Lowe, Robert J. Wermter, Stefan
author_facet	Navarro-Guerrero, Nicolás Lowe, Robert J. Wermter, Stefan
author_sort	Navarro-Guerrero, Nicolás
collection	PubMed
description	Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide an alternative approach for nociceptive signals as drivers of learning rather than simple triggers of preprogrammed behavior. Explicitly, we use nociception to expand the state space while we use punishment as a negative reinforcement learning signal. We compare the performance—in terms of task error, the amount of perceived nociception, and length of learned action sequences—of different neural networks imbued with punishment-based reinforcement signals for inverse kinematic learning. We contrast the performance of a version of the neural network that receives nociceptive inputs to that without such a process. Furthermore, we provide evidence that nociception can improve learning—making the algorithm more robust against network initializations—as well as behavioral performance by reducing the task error, perceived nociception, and length of learned action sequences. Moreover, we provide evidence that punishment, at least as typically used within reinforcement learning applications, may be detrimental in all relevant metrics.
format	Online Article Text
id	pubmed-5376586
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-53765862017-04-18 Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals Navarro-Guerrero, Nicolás Lowe, Robert J. Wermter, Stefan Front Neurorobot Neuroscience Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide an alternative approach for nociceptive signals as drivers of learning rather than simple triggers of preprogrammed behavior. Explicitly, we use nociception to expand the state space while we use punishment as a negative reinforcement learning signal. We compare the performance—in terms of task error, the amount of perceived nociception, and length of learned action sequences—of different neural networks imbued with punishment-based reinforcement signals for inverse kinematic learning. We contrast the performance of a version of the neural network that receives nociceptive inputs to that without such a process. Furthermore, we provide evidence that nociception can improve learning—making the algorithm more robust against network initializations—as well as behavioral performance by reducing the task error, perceived nociception, and length of learned action sequences. Moreover, we provide evidence that punishment, at least as typically used within reinforcement learning applications, may be detrimental in all relevant metrics. Frontiers Media S.A. 2017-04-03 /pmc/articles/PMC5376586/ /pubmed/28420976 http://dx.doi.org/10.3389/fnbot.2017.00010 Text en Copyright © 2017 Navarro-Guerrero, Lowe and Wermter. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Navarro-Guerrero, Nicolás Lowe, Robert J. Wermter, Stefan Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title	Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title_full	Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title_fullStr	Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title_full_unstemmed	Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title_short	Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
title_sort	improving robot motor learning with negatively valenced reinforcement signals
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5376586/ https://www.ncbi.nlm.nih.gov/pubmed/28420976 http://dx.doi.org/10.3389/fnbot.2017.00010
work_keys_str_mv	AT navarroguerreronicolas improvingrobotmotorlearningwithnegativelyvalencedreinforcementsignals AT lowerobertj improvingrobotmotorlearningwithnegativelyvalencedreinforcementsignals AT wermterstefan improvingrobotmotorlearningwithnegativelyvalencedreinforcementsignals

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals

Ejemplares similares