Cargando…

Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks

Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless se...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Xiaoling, Chen, Che, Zeng, Chunnian, Li, Chengtao, Xu, Jing, Gong, Shimin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221232/
https://www.ncbi.nlm.nih.gov/pubmed/37430608
http://dx.doi.org/10.3390/s23104691
_version_ 1785049406994644992
author Luo, Xiaoling
Chen, Che
Zeng, Chunnian
Li, Chengtao
Xu, Jing
Gong, Shimin
author_facet Luo, Xiaoling
Chen, Che
Zeng, Chunnian
Li, Chengtao
Xu, Jing
Gong, Shimin
author_sort Luo, Xiaoling
collection PubMed
description Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance.
format Online
Article
Text
id pubmed-10221232
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102212322023-05-28 Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks Luo, Xiaoling Chen, Che Zeng, Chunnian Li, Chengtao Xu, Jing Gong, Shimin Sensors (Basel) Article Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance. MDPI 2023-05-12 /pmc/articles/PMC10221232/ /pubmed/37430608 http://dx.doi.org/10.3390/s23104691 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Luo, Xiaoling
Chen, Che
Zeng, Chunnian
Li, Chengtao
Xu, Jing
Gong, Shimin
Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title_full Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title_fullStr Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title_full_unstemmed Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title_short Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
title_sort deep reinforcement learning for joint trajectory planning, transmission scheduling, and access control in uav-assisted wireless sensor networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221232/
https://www.ncbi.nlm.nih.gov/pubmed/37430608
http://dx.doi.org/10.3390/s23104691
work_keys_str_mv AT luoxiaoling deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks
AT chenche deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks
AT zengchunnian deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks
AT lichengtao deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks
AT xujing deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks
AT gongshimin deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks