Cargando…

Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot

One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem...

Descripción completa

Detalles Bibliográficos
Autores principales:	Le, Anh Vu, Veerajagadheswar, Prabakaran, Thiha Kyaw, Phone, Elara, Mohan Rajesh, Nhan, Nguyen Huu Khanh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8067765/ https://www.ncbi.nlm.nih.gov/pubmed/33916995 http://dx.doi.org/10.3390/s21082577

_version_	1783682879239225344
author	Le, Anh Vu Veerajagadheswar, Prabakaran Thiha Kyaw, Phone Elara, Mohan Rajesh Nhan, Nguyen Huu Khanh
author_facet	Le, Anh Vu Veerajagadheswar, Prabakaran Thiha Kyaw, Phone Elara, Mohan Rajesh Nhan, Nguyen Huu Khanh
author_sort	Le, Anh Vu
collection	PubMed
description	One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem. The tiling robot’s goal enables the complete coverage of the entire area by reconfiguring to different shapes according to the area’s needs. In the particular sequencing of navigation, it is essential to have a structure that allows the robot to extend the coverage range while saving energy usage during navigation. This implies that the robot is able to cover larger areas entirely with the least required actions. This paper presents a complete path planning (CPP) for hTetran, a polyabolo tiled robot, based on a TSP-based reinforcement learning optimization. This structure simultaneously produces robot shapes and sequential trajectories whilst maximizing the reward of the trained reinforcement learning (RL) model within the predefined polyabolo-based tileset. To this end, a reinforcement learning-based travel sales problem (TSP) with proximal policy optimization (PPO) algorithm was trained using the complementary learning computation of the TSP sequencing. The reconstructive results of the proposed RL-TSP-based CPP for hTetran were compared in terms of energy and time spent with the conventional tiled hypothetical models that incorporate TSP solved through an evolutionary based ant colony optimization (ACO) approach. The CPP demonstrates an ability to generate an ideal Pareto optima trajectory that enhances the robot’s navigation inside the real environment with the least energy and time spent in the company of conventional techniques.
format	Online Article Text
id	pubmed-8067765
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-80677652021-04-25 Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot Le, Anh Vu Veerajagadheswar, Prabakaran Thiha Kyaw, Phone Elara, Mohan Rajesh Nhan, Nguyen Huu Khanh Sensors (Basel) Article One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem. The tiling robot’s goal enables the complete coverage of the entire area by reconfiguring to different shapes according to the area’s needs. In the particular sequencing of navigation, it is essential to have a structure that allows the robot to extend the coverage range while saving energy usage during navigation. This implies that the robot is able to cover larger areas entirely with the least required actions. This paper presents a complete path planning (CPP) for hTetran, a polyabolo tiled robot, based on a TSP-based reinforcement learning optimization. This structure simultaneously produces robot shapes and sequential trajectories whilst maximizing the reward of the trained reinforcement learning (RL) model within the predefined polyabolo-based tileset. To this end, a reinforcement learning-based travel sales problem (TSP) with proximal policy optimization (PPO) algorithm was trained using the complementary learning computation of the TSP sequencing. The reconstructive results of the proposed RL-TSP-based CPP for hTetran were compared in terms of energy and time spent with the conventional tiled hypothetical models that incorporate TSP solved through an evolutionary based ant colony optimization (ACO) approach. The CPP demonstrates an ability to generate an ideal Pareto optima trajectory that enhances the robot’s navigation inside the real environment with the least energy and time spent in the company of conventional techniques. MDPI 2021-04-07 /pmc/articles/PMC8067765/ /pubmed/33916995 http://dx.doi.org/10.3390/s21082577 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Le, Anh Vu Veerajagadheswar, Prabakaran Thiha Kyaw, Phone Elara, Mohan Rajesh Nhan, Nguyen Huu Khanh Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title	Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title_full	Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title_fullStr	Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title_full_unstemmed	Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title_short	Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot
title_sort	coverage path planning using reinforcement learning-based tsp for htetran—a polyabolo-inspired self-reconfigurable tiling robot
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8067765/ https://www.ncbi.nlm.nih.gov/pubmed/33916995 http://dx.doi.org/10.3390/s21082577
work_keys_str_mv	AT leanhvu coveragepathplanningusingreinforcementlearningbasedtspforhtetranapolyaboloinspiredselfreconfigurabletilingrobot AT veerajagadheswarprabakaran coveragepathplanningusingreinforcementlearningbasedtspforhtetranapolyaboloinspiredselfreconfigurabletilingrobot AT thihakyawphone coveragepathplanningusingreinforcementlearningbasedtspforhtetranapolyaboloinspiredselfreconfigurabletilingrobot AT elaramohanrajesh coveragepathplanningusingreinforcementlearningbasedtspforhtetranapolyaboloinspiredselfreconfigurabletilingrobot AT nhannguyenhuukhanh coveragepathplanningusingreinforcementlearningbasedtspforhtetranapolyaboloinspiredselfreconfigurabletilingrobot

Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot

Ejemplares similares