Cargando…

Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic

The marine environment is a hostile setting for robotics. It is strongly unstructured, uncertain, and includes many external disturbances that cannot be easily predicted or modeled. In this work, we attempt to control an autonomous underwater vehicle (AUV) to perform a waypoint tracking task, using...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sola, Yoann, Le Chenadec, Gilles, Clement, Benoit
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9416335/ https://www.ncbi.nlm.nih.gov/pubmed/36015832 http://dx.doi.org/10.3390/s22166072

_version_	1784776454976831488
author	Sola, Yoann Le Chenadec, Gilles Clement, Benoit
author_facet	Sola, Yoann Le Chenadec, Gilles Clement, Benoit
author_sort	Sola, Yoann
collection	PubMed
description	The marine environment is a hostile setting for robotics. It is strongly unstructured, uncertain, and includes many external disturbances that cannot be easily predicted or modeled. In this work, we attempt to control an autonomous underwater vehicle (AUV) to perform a waypoint tracking task, using a machine learning-based controller. There has been great progress in machine learning (in many different domains) in recent years; in the subfield of deep reinforcement learning, several algorithms suitable for the continuous control of dynamical systems have been designed. We implemented the soft actor–critic (SAC) algorithm, an entropy-regularized deep reinforcement learning algorithm that allows fulfilling a learning task and encourages the exploration of the environment simultaneously. We compared a SAC-based controller with a proportional integral derivative (PID) controller on a waypoint tracking task using specific performance metrics. All tests were simulated via the UUV simulator. We applied these two controllers to the RexROV 2, a six degrees of freedom cube-shaped remotely operated underwater Vehicle (ROV) converted in an AUV. We propose several interesting contributions as a result of these tests, such as making the SAC control and guiding the AUV simultaneously, outperforming the PID controller in terms of energy saving, and reducing the amount of information needed by the SAC algorithm inputs. Moreover, our implementation of this controller allows facilitating the transfer towards real-world robots. The code corresponding to this work is available on GitHub.
format	Online Article Text
id	pubmed-9416335
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-94163352022-08-27 Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic Sola, Yoann Le Chenadec, Gilles Clement, Benoit Sensors (Basel) Article The marine environment is a hostile setting for robotics. It is strongly unstructured, uncertain, and includes many external disturbances that cannot be easily predicted or modeled. In this work, we attempt to control an autonomous underwater vehicle (AUV) to perform a waypoint tracking task, using a machine learning-based controller. There has been great progress in machine learning (in many different domains) in recent years; in the subfield of deep reinforcement learning, several algorithms suitable for the continuous control of dynamical systems have been designed. We implemented the soft actor–critic (SAC) algorithm, an entropy-regularized deep reinforcement learning algorithm that allows fulfilling a learning task and encourages the exploration of the environment simultaneously. We compared a SAC-based controller with a proportional integral derivative (PID) controller on a waypoint tracking task using specific performance metrics. All tests were simulated via the UUV simulator. We applied these two controllers to the RexROV 2, a six degrees of freedom cube-shaped remotely operated underwater Vehicle (ROV) converted in an AUV. We propose several interesting contributions as a result of these tests, such as making the SAC control and guiding the AUV simultaneously, outperforming the PID controller in terms of energy saving, and reducing the amount of information needed by the SAC algorithm inputs. Moreover, our implementation of this controller allows facilitating the transfer towards real-world robots. The code corresponding to this work is available on GitHub. MDPI 2022-08-14 /pmc/articles/PMC9416335/ /pubmed/36015832 http://dx.doi.org/10.3390/s22166072 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sola, Yoann Le Chenadec, Gilles Clement, Benoit Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title	Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title_full	Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title_fullStr	Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title_full_unstemmed	Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title_short	Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic
title_sort	simultaneous control and guidance of an auv based on soft actor–critic
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9416335/ https://www.ncbi.nlm.nih.gov/pubmed/36015832 http://dx.doi.org/10.3390/s22166072
work_keys_str_mv	AT solayoann simultaneouscontrolandguidanceofanauvbasedonsoftactorcritic AT lechenadecgilles simultaneouscontrolandguidanceofanauvbasedonsoftactorcritic AT clementbenoit simultaneouscontrolandguidanceofanauvbasedonsoftactorcritic

Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic

Ejemplares similares