Cargando…
Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning
The recent outburst of interest in cognitive developmental robotics is fueled by the ambition to propose ecologically plausible mechanisms of how, among other things, a learning agent/robot could ground linguistic meanings in its sensorimotor behavior. Along this stream, we propose a model that allo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Research Foundation
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289932/ https://www.ncbi.nlm.nih.gov/pubmed/22393319 http://dx.doi.org/10.3389/fnbot.2012.00001 |
_version_ | 1782224919285727232 |
---|---|
author | Farkaš, Igor Malík, Tomáš Rebrová, Kristína |
author_facet | Farkaš, Igor Malík, Tomáš Rebrová, Kristína |
author_sort | Farkaš, Igor |
collection | PubMed |
description | The recent outburst of interest in cognitive developmental robotics is fueled by the ambition to propose ecologically plausible mechanisms of how, among other things, a learning agent/robot could ground linguistic meanings in its sensorimotor behavior. Along this stream, we propose a model that allows the simulated iCub robot to learn the meanings of actions (point, touch, and push) oriented toward objects in robot’s peripersonal space. In our experiments, the iCub learns to execute motor actions and comment on them. Architecturally, the model is composed of three neural-network-based modules that are trained in different ways. The first module, a two-layer perceptron, is trained by back-propagation to attend to the target position in the visual scene, given the low-level visual information and the feature-based target information. The second module, having the form of an actor-critic architecture, is the most distinguishing part of our model, and is trained by a continuous version of reinforcement learning to execute actions as sequences, based on a linguistic command. The third module, an echo-state network, is trained to provide the linguistic description of the executed actions. The trained model generalizes well in case of novel action-target combinations with randomized initial arm positions. It can also promptly adapt its behavior if the action/target suddenly changes during motor execution. |
format | Online Article Text |
id | pubmed-3289932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Frontiers Research Foundation |
record_format | MEDLINE/PubMed |
spelling | pubmed-32899322012-03-05 Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning Farkaš, Igor Malík, Tomáš Rebrová, Kristína Front Neurorobot Neuroscience The recent outburst of interest in cognitive developmental robotics is fueled by the ambition to propose ecologically plausible mechanisms of how, among other things, a learning agent/robot could ground linguistic meanings in its sensorimotor behavior. Along this stream, we propose a model that allows the simulated iCub robot to learn the meanings of actions (point, touch, and push) oriented toward objects in robot’s peripersonal space. In our experiments, the iCub learns to execute motor actions and comment on them. Architecturally, the model is composed of three neural-network-based modules that are trained in different ways. The first module, a two-layer perceptron, is trained by back-propagation to attend to the target position in the visual scene, given the low-level visual information and the feature-based target information. The second module, having the form of an actor-critic architecture, is the most distinguishing part of our model, and is trained by a continuous version of reinforcement learning to execute actions as sequences, based on a linguistic command. The third module, an echo-state network, is trained to provide the linguistic description of the executed actions. The trained model generalizes well in case of novel action-target combinations with randomized initial arm positions. It can also promptly adapt its behavior if the action/target suddenly changes during motor execution. Frontiers Research Foundation 2012-02-29 /pmc/articles/PMC3289932/ /pubmed/22393319 http://dx.doi.org/10.3389/fnbot.2012.00001 Text en Copyright © 2012 Farkaš, Malík and Rebrová. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution Non Commercial License, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited. |
spellingShingle | Neuroscience Farkaš, Igor Malík, Tomáš Rebrová, Kristína Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title | Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title_full | Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title_fullStr | Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title_full_unstemmed | Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title_short | Grounding the Meanings in Sensorimotor Behavior using Reinforcement Learning |
title_sort | grounding the meanings in sensorimotor behavior using reinforcement learning |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3289932/ https://www.ncbi.nlm.nih.gov/pubmed/22393319 http://dx.doi.org/10.3389/fnbot.2012.00001 |
work_keys_str_mv | AT farkasigor groundingthemeaningsinsensorimotorbehaviorusingreinforcementlearning AT maliktomas groundingthemeaningsinsensorimotorbehaviorusingreinforcementlearning AT rebrovakristina groundingthemeaningsinsensorimotorbehaviorusingreinforcementlearning |