Cargando…

Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment

An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant protection such as watering, sowing, and pesticide spraying. It is essential to develop a Decision-making Support System (DSS) for UAVs to help them choose the correct action in states according to the policy. In...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sun, Fengjie, Wang, Xianchang, Zhang, Rui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230613/ https://www.ncbi.nlm.nih.gov/pubmed/34207944 http://dx.doi.org/10.3390/e23060737

_version_	1783713252064100352
author	Sun, Fengjie Wang, Xianchang Zhang, Rui
author_facet	Sun, Fengjie Wang, Xianchang Zhang, Rui
author_sort	Sun, Fengjie
collection	PubMed
description	An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant protection such as watering, sowing, and pesticide spraying. It is essential to develop a Decision-making Support System (DSS) for UAVs to help them choose the correct action in states according to the policy. In an unknown environment, the method of formulating rules for UAVs to help them choose actions is not applicable, and it is a feasible solution to obtain the optimal policy through reinforcement learning. However, experiments show that the existing reinforcement learning algorithms cannot get the optimal policy for a UAV in the agricultural plant protection environment. In this work we propose an improved Q-learning algorithm based on similar state matching, and we prove theoretically that there has a greater probability for UAV choosing the optimal action according to the policy learned by the algorithm we proposed than the classic Q-learning algorithm in the agricultural plant protection environment. This proposed algorithm is implemented and tested on datasets that are evenly distributed based on real UAV parameters and real farm information. The performance evaluation of the algorithm is discussed in detail. Experimental results show that the algorithm we proposed can efficiently learn the optimal policy for UAVs in the agricultural plant protection environment.
format	Online Article Text
id	pubmed-8230613
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-82306132021-06-26 Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment Sun, Fengjie Wang, Xianchang Zhang, Rui Entropy (Basel) Article An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant protection such as watering, sowing, and pesticide spraying. It is essential to develop a Decision-making Support System (DSS) for UAVs to help them choose the correct action in states according to the policy. In an unknown environment, the method of formulating rules for UAVs to help them choose actions is not applicable, and it is a feasible solution to obtain the optimal policy through reinforcement learning. However, experiments show that the existing reinforcement learning algorithms cannot get the optimal policy for a UAV in the agricultural plant protection environment. In this work we propose an improved Q-learning algorithm based on similar state matching, and we prove theoretically that there has a greater probability for UAV choosing the optimal action according to the policy learned by the algorithm we proposed than the classic Q-learning algorithm in the agricultural plant protection environment. This proposed algorithm is implemented and tested on datasets that are evenly distributed based on real UAV parameters and real farm information. The performance evaluation of the algorithm is discussed in detail. Experimental results show that the algorithm we proposed can efficiently learn the optimal policy for UAVs in the agricultural plant protection environment. MDPI 2021-06-11 /pmc/articles/PMC8230613/ /pubmed/34207944 http://dx.doi.org/10.3390/e23060737 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sun, Fengjie Wang, Xianchang Zhang, Rui Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title	Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title_full	Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title_fullStr	Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title_full_unstemmed	Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title_short	Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment
title_sort	improved q-learning algorithm based on approximate state matching in agricultural plant protection environment
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230613/ https://www.ncbi.nlm.nih.gov/pubmed/34207944 http://dx.doi.org/10.3390/e23060737
work_keys_str_mv	AT sunfengjie improvedqlearningalgorithmbasedonapproximatestatematchinginagriculturalplantprotectionenvironment AT wangxianchang improvedqlearningalgorithmbasedonapproximatestatematchinginagriculturalplantprotectionenvironment AT zhangrui improvedqlearningalgorithmbasedonapproximatestatematchinginagriculturalplantprotectionenvironment

Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment

Ejemplares similares