Cargando…

Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network

To accommodate the requirements of extensive coverage and ubiquitous connectivity in 6G communications, satellite plays a more significant role in it. As users and devices explosively grow, new multiple access technologies are called for. Among the new candidates, rate splitting multiple access (RSM...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Qingmiao, Zhu, Lidong, Chen, Yanyan, Jiang, Shan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10537614/ https://www.ncbi.nlm.nih.gov/pubmed/37765915 http://dx.doi.org/10.3390/s23187859

_version_	1785113141960507392
author	Zhang, Qingmiao Zhu, Lidong Chen, Yanyan Jiang, Shan
author_facet	Zhang, Qingmiao Zhu, Lidong Chen, Yanyan Jiang, Shan
author_sort	Zhang, Qingmiao
collection	PubMed
description	To accommodate the requirements of extensive coverage and ubiquitous connectivity in 6G communications, satellite plays a more significant role in it. As users and devices explosively grow, new multiple access technologies are called for. Among the new candidates, rate splitting multiple access (RSMA) shows great potential. Since satellites are power-limited, we investigate the energy-efficient resource allocation in the integrated satellite terrestrial network (ISTN)-adopting RSMA scheme in this paper. However, this non-convex problem is challenging to solve using conventional model-based methods. Because this optimization task has a quality of service (QoS) requirement and continuous action/state space, we propose to use constrained soft actor-critic (SAC) to tackle it. This policy-gradient algorithm incorporates the Lagrangian relaxation technique to convert the original constrained problem into a penalized unconstrained one. The reward is maximized while the requirements are satisfied. Moreover, the learning process is time-consuming and unnecessary when little changes in the network. So, an on–off mechanism is introduced to avoid this situation. By calculating the difference between the current state and the last one, the system will decide to learn a new action or take the last one. The simulation results show that the proposed algorithm can outperform other benchmark algorithms in terms of energy efficiency while satisfying the QoS constraint. In addition, the time consumption is lowered because of the on–off design.
format	Online Article Text
id	pubmed-10537614
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-105376142023-09-29 Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network Zhang, Qingmiao Zhu, Lidong Chen, Yanyan Jiang, Shan Sensors (Basel) Article To accommodate the requirements of extensive coverage and ubiquitous connectivity in 6G communications, satellite plays a more significant role in it. As users and devices explosively grow, new multiple access technologies are called for. Among the new candidates, rate splitting multiple access (RSMA) shows great potential. Since satellites are power-limited, we investigate the energy-efficient resource allocation in the integrated satellite terrestrial network (ISTN)-adopting RSMA scheme in this paper. However, this non-convex problem is challenging to solve using conventional model-based methods. Because this optimization task has a quality of service (QoS) requirement and continuous action/state space, we propose to use constrained soft actor-critic (SAC) to tackle it. This policy-gradient algorithm incorporates the Lagrangian relaxation technique to convert the original constrained problem into a penalized unconstrained one. The reward is maximized while the requirements are satisfied. Moreover, the learning process is time-consuming and unnecessary when little changes in the network. So, an on–off mechanism is introduced to avoid this situation. By calculating the difference between the current state and the last one, the system will decide to learn a new action or take the last one. The simulation results show that the proposed algorithm can outperform other benchmark algorithms in terms of energy efficiency while satisfying the QoS constraint. In addition, the time consumption is lowered because of the on–off design. MDPI 2023-09-13 /pmc/articles/PMC10537614/ /pubmed/37765915 http://dx.doi.org/10.3390/s23187859 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Qingmiao Zhu, Lidong Chen, Yanyan Jiang, Shan Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title	Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title_full	Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title_fullStr	Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title_full_unstemmed	Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title_short	Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network
title_sort	constrained drl for energy efficiency optimization in rsma-based integrated satellite terrestrial network
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10537614/ https://www.ncbi.nlm.nih.gov/pubmed/37765915 http://dx.doi.org/10.3390/s23187859
work_keys_str_mv	AT zhangqingmiao constraineddrlforenergyefficiencyoptimizationinrsmabasedintegratedsatelliteterrestrialnetwork AT zhulidong constraineddrlforenergyefficiencyoptimizationinrsmabasedintegratedsatelliteterrestrialnetwork AT chenyanyan constraineddrlforenergyefficiencyoptimizationinrsmabasedintegratedsatelliteterrestrialnetwork AT jiangshan constraineddrlforenergyefficiencyoptimizationinrsmabasedintegratedsatelliteterrestrialnetwork

Constrained DRL for Energy Efficiency Optimization in RSMA-Based Integrated Satellite Terrestrial Network

Ejemplares similares