Cargando…

A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology

Over the last few decades, a number of reinforcement learning techniques have emerged, and different reinforcement learning-based applications have proliferated. However, such techniques tend to specialize in a particular field. This is an obstacle to their generalization and extrapolation to other...

Descripción completa

Detalles Bibliográficos
Autores principales:	Álvarez de Toledo, Santiago, Anguera, Aurea, Barreiro, José M., Lara, Juan A., Lizcano, David
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2017
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5298761/ https://www.ncbi.nlm.nih.gov/pubmed/28106849 http://dx.doi.org/10.3390/s17010188

_version_	1782505926661505024
author	Álvarez de Toledo, Santiago Anguera, Aurea Barreiro, José M. Lara, Juan A. Lizcano, David
author_facet	Álvarez de Toledo, Santiago Anguera, Aurea Barreiro, José M. Lara, Juan A. Lizcano, David
author_sort	Álvarez de Toledo, Santiago
collection	PubMed
description	Over the last few decades, a number of reinforcement learning techniques have emerged, and different reinforcement learning-based applications have proliferated. However, such techniques tend to specialize in a particular field. This is an obstacle to their generalization and extrapolation to other areas. Besides, neither the reward-punishment (r-p) learning process nor the convergence of results is fast and efficient enough. To address these obstacles, this research proposes a general reinforcement learning model. This model is independent of input and output types and based on general bioinspired principles that help to speed up the learning process. The model is composed of a perception module based on sensors whose specific perceptions are mapped as perception patterns. In this manner, similar perceptions (even if perceived at different positions in the environment) are accounted for by the same perception pattern. Additionally, the model includes a procedure that statistically associates perception-action pattern pairs depending on the positive or negative results output by executing the respective action in response to a particular perception during the learning process. To do this, the model is fitted with a mechanism that reacts positively or negatively to particular sensory stimuli in order to rate results. The model is supplemented by an action module that can be configured depending on the maneuverability of each specific agent. The model has been applied in the air navigation domain, a field with strong safety restrictions, which led us to implement a simulated system equipped with the proposed model. Accordingly, the perception sensors were based on Automatic Dependent Surveillance-Broadcast (ADS-B) technology, which is described in this paper. The results were quite satisfactory, and it outperformed traditional methods existing in the literature with respect to learning reliability and efficiency.
format	Online Article Text
id	pubmed-5298761
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-52987612017-02-10 A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology Álvarez de Toledo, Santiago Anguera, Aurea Barreiro, José M. Lara, Juan A. Lizcano, David Sensors (Basel) Article Over the last few decades, a number of reinforcement learning techniques have emerged, and different reinforcement learning-based applications have proliferated. However, such techniques tend to specialize in a particular field. This is an obstacle to their generalization and extrapolation to other areas. Besides, neither the reward-punishment (r-p) learning process nor the convergence of results is fast and efficient enough. To address these obstacles, this research proposes a general reinforcement learning model. This model is independent of input and output types and based on general bioinspired principles that help to speed up the learning process. The model is composed of a perception module based on sensors whose specific perceptions are mapped as perception patterns. In this manner, similar perceptions (even if perceived at different positions in the environment) are accounted for by the same perception pattern. Additionally, the model includes a procedure that statistically associates perception-action pattern pairs depending on the positive or negative results output by executing the respective action in response to a particular perception during the learning process. To do this, the model is fitted with a mechanism that reacts positively or negatively to particular sensory stimuli in order to rate results. The model is supplemented by an action module that can be configured depending on the maneuverability of each specific agent. The model has been applied in the air navigation domain, a field with strong safety restrictions, which led us to implement a simulated system equipped with the proposed model. Accordingly, the perception sensors were based on Automatic Dependent Surveillance-Broadcast (ADS-B) technology, which is described in this paper. The results were quite satisfactory, and it outperformed traditional methods existing in the literature with respect to learning reliability and efficiency. MDPI 2017-01-19 /pmc/articles/PMC5298761/ /pubmed/28106849 http://dx.doi.org/10.3390/s17010188 Text en © 2017 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Álvarez de Toledo, Santiago Anguera, Aurea Barreiro, José M. Lara, Juan A. Lizcano, David A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title	A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title_full	A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title_fullStr	A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title_full_unstemmed	A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title_short	A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
title_sort	reinforcement learning model equipped with sensors for generating perception patterns: implementation of a simulated air navigation system using ads-b (automatic dependent surveillance-broadcast) technology
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5298761/ https://www.ncbi.nlm.nih.gov/pubmed/28106849 http://dx.doi.org/10.3390/s17010188
work_keys_str_mv	AT alvarezdetoledosantiago areinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT angueraaurea areinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT barreirojosem areinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT larajuana areinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT lizcanodavid areinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT alvarezdetoledosantiago reinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT angueraaurea reinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT barreirojosem reinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT larajuana reinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology AT lizcanodavid reinforcementlearningmodelequippedwithsensorsforgeneratingperceptionpatternsimplementationofasimulatedairnavigationsystemusingadsbautomaticdependentsurveillancebroadcasttechnology

A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology

Ejemplares similares