Cargando…
Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks
Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless se...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221232/ https://www.ncbi.nlm.nih.gov/pubmed/37430608 http://dx.doi.org/10.3390/s23104691 |
_version_ | 1785049406994644992 |
---|---|
author | Luo, Xiaoling Chen, Che Zeng, Chunnian Li, Chengtao Xu, Jing Gong, Shimin |
author_facet | Luo, Xiaoling Chen, Che Zeng, Chunnian Li, Chengtao Xu, Jing Gong, Shimin |
author_sort | Luo, Xiaoling |
collection | PubMed |
description | Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance. |
format | Online Article Text |
id | pubmed-10221232 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102212322023-05-28 Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks Luo, Xiaoling Chen, Che Zeng, Chunnian Li, Chengtao Xu, Jing Gong, Shimin Sensors (Basel) Article Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance. MDPI 2023-05-12 /pmc/articles/PMC10221232/ /pubmed/37430608 http://dx.doi.org/10.3390/s23104691 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Luo, Xiaoling Chen, Che Zeng, Chunnian Li, Chengtao Xu, Jing Gong, Shimin Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title | Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title_full | Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title_fullStr | Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title_full_unstemmed | Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title_short | Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks |
title_sort | deep reinforcement learning for joint trajectory planning, transmission scheduling, and access control in uav-assisted wireless sensor networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221232/ https://www.ncbi.nlm.nih.gov/pubmed/37430608 http://dx.doi.org/10.3390/s23104691 |
work_keys_str_mv | AT luoxiaoling deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks AT chenche deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks AT zengchunnian deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks AT lichengtao deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks AT xujing deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks AT gongshimin deepreinforcementlearningforjointtrajectoryplanningtransmissionschedulingandaccesscontrolinuavassistedwirelesssensornetworks |