Cargando…

Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks

This paper studies the problem of distributed spectrum/channel access for cognitive radio-enabled unmanned aerial vehicles (CUAVs) that overlay upon primary channels. Under the framework of cooperative spectrum sensing and opportunistic transmission, a one-shot optimization problem for channel alloc...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Weiheng, Yu, Wanxin, Wang, Wenbo, Huang, Tiancong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8879711/
https://www.ncbi.nlm.nih.gov/pubmed/35214553
http://dx.doi.org/10.3390/s22041651
_version_ 1784658960315318272
author Jiang, Weiheng
Yu, Wanxin
Wang, Wenbo
Huang, Tiancong
author_facet Jiang, Weiheng
Yu, Wanxin
Wang, Wenbo
Huang, Tiancong
author_sort Jiang, Weiheng
collection PubMed
description This paper studies the problem of distributed spectrum/channel access for cognitive radio-enabled unmanned aerial vehicles (CUAVs) that overlay upon primary channels. Under the framework of cooperative spectrum sensing and opportunistic transmission, a one-shot optimization problem for channel allocation, aiming to maximize the expected cumulative weighted reward of multiple CUAVs, is formulated. To handle the uncertainty due to the lack of prior knowledge about the primary user activities as well as the lack of the channel-access coordinator, the original problem is cast into a competition and cooperation hybrid multi-agent reinforcement learning (CCH-MARL) problem in the framework of Markov game (MG). Then, a value-iteration-based RL algorithm, which features upper confidence bound-Hoeffding (UCB-H) strategy searching, is proposed by treating each CUAV as an independent learner (IL). To address the curse of dimensionality, the UCB-H strategy is further extended with a double deep Q-network (DDQN). Numerical simulations show that the proposed algorithms are able to efficiently converge to stable strategies, and significantly improve the network performance when compared with the benchmark algorithms such as the vanilla Q-learning and DDQN algorithms.
format Online
Article
Text
id pubmed-8879711
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88797112022-02-26 Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks Jiang, Weiheng Yu, Wanxin Wang, Wenbo Huang, Tiancong Sensors (Basel) Article This paper studies the problem of distributed spectrum/channel access for cognitive radio-enabled unmanned aerial vehicles (CUAVs) that overlay upon primary channels. Under the framework of cooperative spectrum sensing and opportunistic transmission, a one-shot optimization problem for channel allocation, aiming to maximize the expected cumulative weighted reward of multiple CUAVs, is formulated. To handle the uncertainty due to the lack of prior knowledge about the primary user activities as well as the lack of the channel-access coordinator, the original problem is cast into a competition and cooperation hybrid multi-agent reinforcement learning (CCH-MARL) problem in the framework of Markov game (MG). Then, a value-iteration-based RL algorithm, which features upper confidence bound-Hoeffding (UCB-H) strategy searching, is proposed by treating each CUAV as an independent learner (IL). To address the curse of dimensionality, the UCB-H strategy is further extended with a double deep Q-network (DDQN). Numerical simulations show that the proposed algorithms are able to efficiently converge to stable strategies, and significantly improve the network performance when compared with the benchmark algorithms such as the vanilla Q-learning and DDQN algorithms. MDPI 2022-02-20 /pmc/articles/PMC8879711/ /pubmed/35214553 http://dx.doi.org/10.3390/s22041651 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jiang, Weiheng
Yu, Wanxin
Wang, Wenbo
Huang, Tiancong
Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title_full Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title_fullStr Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title_full_unstemmed Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title_short Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
title_sort multi-agent reinforcement learning for joint cooperative spectrum sensing and channel access in cognitive uav networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8879711/
https://www.ncbi.nlm.nih.gov/pubmed/35214553
http://dx.doi.org/10.3390/s22041651
work_keys_str_mv AT jiangweiheng multiagentreinforcementlearningforjointcooperativespectrumsensingandchannelaccessincognitiveuavnetworks
AT yuwanxin multiagentreinforcementlearningforjointcooperativespectrumsensingandchannelaccessincognitiveuavnetworks
AT wangwenbo multiagentreinforcementlearningforjointcooperativespectrumsensingandchannelaccessincognitiveuavnetworks
AT huangtiancong multiagentreinforcementlearningforjointcooperativespectrumsensingandchannelaccessincognitiveuavnetworks