Cargando…
Learning efficient navigation in vortical flow fields
Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal c...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8654940/ https://www.ncbi.nlm.nih.gov/pubmed/34880221 http://dx.doi.org/10.1038/s41467-021-27015-y |
_version_ | 1784611975044530176 |
---|---|
author | Gunnarson, Peter Mandralis, Ioannis Novati, Guido Koumoutsakos, Petros Dabiri, John O. |
author_facet | Gunnarson, Peter Mandralis, Ioannis Novati, Guido Koumoutsakos, Petros Dabiri, John O. |
author_sort | Gunnarson, Peter |
collection | PubMed |
description | Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer’s actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio-mimetic vorticity sensing approach, and achieved a near 100% success rate in reaching the target locations while approaching the time-efficiency of optimal navigation trajectories. |
format | Online Article Text |
id | pubmed-8654940 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86549402021-12-27 Learning efficient navigation in vortical flow fields Gunnarson, Peter Mandralis, Ioannis Novati, Guido Koumoutsakos, Petros Dabiri, John O. Nat Commun Article Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer’s actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio-mimetic vorticity sensing approach, and achieved a near 100% success rate in reaching the target locations while approaching the time-efficiency of optimal navigation trajectories. Nature Publishing Group UK 2021-12-08 /pmc/articles/PMC8654940/ /pubmed/34880221 http://dx.doi.org/10.1038/s41467-021-27015-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gunnarson, Peter Mandralis, Ioannis Novati, Guido Koumoutsakos, Petros Dabiri, John O. Learning efficient navigation in vortical flow fields |
title | Learning efficient navigation in vortical flow fields |
title_full | Learning efficient navigation in vortical flow fields |
title_fullStr | Learning efficient navigation in vortical flow fields |
title_full_unstemmed | Learning efficient navigation in vortical flow fields |
title_short | Learning efficient navigation in vortical flow fields |
title_sort | learning efficient navigation in vortical flow fields |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8654940/ https://www.ncbi.nlm.nih.gov/pubmed/34880221 http://dx.doi.org/10.1038/s41467-021-27015-y |
work_keys_str_mv | AT gunnarsonpeter learningefficientnavigationinvorticalflowfields AT mandralisioannis learningefficientnavigationinvorticalflowfields AT novatiguido learningefficientnavigationinvorticalflowfields AT koumoutsakospetros learningefficientnavigationinvorticalflowfields AT dabirijohno learningefficientnavigationinvorticalflowfields |