Cargando…
Learning from animals: How to Navigate Complex Terrains
We develop a method to learn a bio-inspired motion control policy using data collected from hawkmoths navigating in a virtual forest. A Markov Decision Process (MDP) framework is introduced to model the dynamics of moths and sparse logistic regression is used to learn control policy parameters from...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952082/ https://www.ncbi.nlm.nih.gov/pubmed/31917816 http://dx.doi.org/10.1371/journal.pcbi.1007452 |
_version_ | 1783486382358921216 |
---|---|
author | Zhu, Henghui Liu, Hao Ataei, Armin Munk, Yonatan Daniel, Thomas Paschalidis, Ioannis Ch. |
author_facet | Zhu, Henghui Liu, Hao Ataei, Armin Munk, Yonatan Daniel, Thomas Paschalidis, Ioannis Ch. |
author_sort | Zhu, Henghui |
collection | PubMed |
description | We develop a method to learn a bio-inspired motion control policy using data collected from hawkmoths navigating in a virtual forest. A Markov Decision Process (MDP) framework is introduced to model the dynamics of moths and sparse logistic regression is used to learn control policy parameters from the data. The results show that moths do not favor detailed obstacle location information in navigation, but rely heavily on optical flow. Using the policy learned from the moth data as a starting point, we propose an actor-critic learning algorithm to refine policy parameters and obtain a policy that can be used by an autonomous aerial vehicle operating in a cluttered environment. Compared with the moths’ policy, the policy we obtain integrates both obstacle location and optical flow. We compare the performance of these two policies in terms of their ability to navigate in artificial forest areas. While the optimized policy can adjust its parameters to outperform the moth’s policy in each different terrain, the moth’s policy exhibits a high level of robustness across terrains. |
format | Online Article Text |
id | pubmed-6952082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-69520822020-01-17 Learning from animals: How to Navigate Complex Terrains Zhu, Henghui Liu, Hao Ataei, Armin Munk, Yonatan Daniel, Thomas Paschalidis, Ioannis Ch. PLoS Comput Biol Research Article We develop a method to learn a bio-inspired motion control policy using data collected from hawkmoths navigating in a virtual forest. A Markov Decision Process (MDP) framework is introduced to model the dynamics of moths and sparse logistic regression is used to learn control policy parameters from the data. The results show that moths do not favor detailed obstacle location information in navigation, but rely heavily on optical flow. Using the policy learned from the moth data as a starting point, we propose an actor-critic learning algorithm to refine policy parameters and obtain a policy that can be used by an autonomous aerial vehicle operating in a cluttered environment. Compared with the moths’ policy, the policy we obtain integrates both obstacle location and optical flow. We compare the performance of these two policies in terms of their ability to navigate in artificial forest areas. While the optimized policy can adjust its parameters to outperform the moth’s policy in each different terrain, the moth’s policy exhibits a high level of robustness across terrains. Public Library of Science 2020-01-09 /pmc/articles/PMC6952082/ /pubmed/31917816 http://dx.doi.org/10.1371/journal.pcbi.1007452 Text en © 2020 Zhu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhu, Henghui Liu, Hao Ataei, Armin Munk, Yonatan Daniel, Thomas Paschalidis, Ioannis Ch. Learning from animals: How to Navigate Complex Terrains |
title | Learning from animals: How to Navigate Complex Terrains |
title_full | Learning from animals: How to Navigate Complex Terrains |
title_fullStr | Learning from animals: How to Navigate Complex Terrains |
title_full_unstemmed | Learning from animals: How to Navigate Complex Terrains |
title_short | Learning from animals: How to Navigate Complex Terrains |
title_sort | learning from animals: how to navigate complex terrains |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952082/ https://www.ncbi.nlm.nih.gov/pubmed/31917816 http://dx.doi.org/10.1371/journal.pcbi.1007452 |
work_keys_str_mv | AT zhuhenghui learningfromanimalshowtonavigatecomplexterrains AT liuhao learningfromanimalshowtonavigatecomplexterrains AT ataeiarmin learningfromanimalshowtonavigatecomplexterrains AT munkyonatan learningfromanimalshowtonavigatecomplexterrains AT danielthomas learningfromanimalshowtonavigatecomplexterrains AT paschalidisioannisch learningfromanimalshowtonavigatecomplexterrains |