Cargando…

Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this inter...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Chongli, Lv, Tiejun, Huang, Pingmu, Lin, Zhipeng, Zeng, Jie, Ren, Yuan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422299/ https://www.ncbi.nlm.nih.gov/pubmed/37571605 http://dx.doi.org/10.3390/s23156822

_version_	1785089174778413056
author	Zhang, Chongli Lv, Tiejun Huang, Pingmu Lin, Zhipeng Zeng, Jie Ren, Yuan
author_facet	Zhang, Chongli Lv, Tiejun Huang, Pingmu Lin, Zhipeng Zeng, Jie Ren, Yuan
author_sort	Zhang, Chongli
collection	PubMed
description	Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm.
format	Online Article Text
id	pubmed-10422299
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-104222992023-08-13 Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning Zhang, Chongli Lv, Tiejun Huang, Pingmu Lin, Zhipeng Zeng, Jie Ren, Yuan Sensors (Basel) Article Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm. MDPI 2023-07-31 /pmc/articles/PMC10422299/ /pubmed/37571605 http://dx.doi.org/10.3390/s23156822 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Chongli Lv, Tiejun Huang, Pingmu Lin, Zhipeng Zeng, Jie Ren, Yuan Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title	Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title_full	Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title_fullStr	Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title_full_unstemmed	Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title_short	Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning
title_sort	joint optimization of bandwidth and power allocation in uplink systems with deep reinforcement learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422299/ https://www.ncbi.nlm.nih.gov/pubmed/37571605 http://dx.doi.org/10.3390/s23156822
work_keys_str_mv	AT zhangchongli jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning AT lvtiejun jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning AT huangpingmu jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning AT linzhipeng jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning AT zengjie jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning AT renyuan jointoptimizationofbandwidthandpowerallocationinuplinksystemswithdeepreinforcementlearning

Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

Ejemplares similares