Cargando…

Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication

In scenarios, like critical public safety communication networks, On-Scene Available (OSA) user equipment (UE) may be only partially connected with the network infrastructure, e.g., due to physical damages or on-purpose deactivation by the authorities. In this work, we consider multi-hop Device-to-D...

Descripción completa

Detalles Bibliográficos
Autores principales:	Khan, Muhidul Islam, Reggiani, Luca, Alam, Muhammad Mahtab, Le Moullec, Yannick, Sharma, Navuday, Yaacoub, Elias, Magarini, Maurizio
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7700512/ https://www.ncbi.nlm.nih.gov/pubmed/33238453 http://dx.doi.org/10.3390/s20226692

_version_	1783616297609723904
author	Khan, Muhidul Islam Reggiani, Luca Alam, Muhammad Mahtab Le Moullec, Yannick Sharma, Navuday Yaacoub, Elias Magarini, Maurizio
author_facet	Khan, Muhidul Islam Reggiani, Luca Alam, Muhammad Mahtab Le Moullec, Yannick Sharma, Navuday Yaacoub, Elias Magarini, Maurizio
author_sort	Khan, Muhidul Islam
collection	PubMed
description	In scenarios, like critical public safety communication networks, On-Scene Available (OSA) user equipment (UE) may be only partially connected with the network infrastructure, e.g., due to physical damages or on-purpose deactivation by the authorities. In this work, we consider multi-hop Device-to-Device (D2D) communication in a hybrid infrastructure where OSA UEs connect to each other in a seamless manner in order to disseminate critical information to a deployed command center. The challenge that we address is to simultaneously keep the OSA UEs alive as long as possible and send the critical information to a final destination (e.g., a command center) as rapidly as possible, while considering the heterogeneous characteristics of the OSA UEs. We propose a dynamic adaptation approach based on machine learning to improve a joint energy-spectral efficiency (ESE). We apply a Q-learning scheme in a hybrid fashion (partially distributed and centralized) in learner agents (distributed OSA UEs) and scheduler agents (remote radio heads or RRHs), for which the next hop selection and RRH selection algorithms are proposed. Our simulation results show that the proposed dynamic adaptation approach outperforms the baseline system by approximately [Formula: see text] in terms of joint energy-spectral efficiency, wherein the energy efficiency of the OSA UEs benefit from a gain of approximately [Formula: see text]. Finally, the results show also that our proposed framework with C-RAN reduces latency by approximately [Formula: see text] w.r.t. the baseline.
format	Online Article Text
id	pubmed-7700512
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77005122020-11-30 Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication Khan, Muhidul Islam Reggiani, Luca Alam, Muhammad Mahtab Le Moullec, Yannick Sharma, Navuday Yaacoub, Elias Magarini, Maurizio Sensors (Basel) Article In scenarios, like critical public safety communication networks, On-Scene Available (OSA) user equipment (UE) may be only partially connected with the network infrastructure, e.g., due to physical damages or on-purpose deactivation by the authorities. In this work, we consider multi-hop Device-to-Device (D2D) communication in a hybrid infrastructure where OSA UEs connect to each other in a seamless manner in order to disseminate critical information to a deployed command center. The challenge that we address is to simultaneously keep the OSA UEs alive as long as possible and send the critical information to a final destination (e.g., a command center) as rapidly as possible, while considering the heterogeneous characteristics of the OSA UEs. We propose a dynamic adaptation approach based on machine learning to improve a joint energy-spectral efficiency (ESE). We apply a Q-learning scheme in a hybrid fashion (partially distributed and centralized) in learner agents (distributed OSA UEs) and scheduler agents (remote radio heads or RRHs), for which the next hop selection and RRH selection algorithms are proposed. Our simulation results show that the proposed dynamic adaptation approach outperforms the baseline system by approximately [Formula: see text] in terms of joint energy-spectral efficiency, wherein the energy efficiency of the OSA UEs benefit from a gain of approximately [Formula: see text]. Finally, the results show also that our proposed framework with C-RAN reduces latency by approximately [Formula: see text] w.r.t. the baseline. MDPI 2020-11-23 /pmc/articles/PMC7700512/ /pubmed/33238453 http://dx.doi.org/10.3390/s20226692 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Khan, Muhidul Islam Reggiani, Luca Alam, Muhammad Mahtab Le Moullec, Yannick Sharma, Navuday Yaacoub, Elias Magarini, Maurizio Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title	Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title_full	Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title_fullStr	Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title_full_unstemmed	Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title_short	Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication
title_sort	q-learning based joint energy-spectral efficiency optimization in multi-hop device-to-device communication
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7700512/ https://www.ncbi.nlm.nih.gov/pubmed/33238453 http://dx.doi.org/10.3390/s20226692
work_keys_str_mv	AT khanmuhidulislam qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT reggianiluca qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT alammuhammadmahtab qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT lemoullecyannick qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT sharmanavuday qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT yaacoubelias qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication AT magarinimaurizio qlearningbasedjointenergyspectralefficiencyoptimizationinmultihopdevicetodevicecommunication

Q-Learning Based Joint Energy-Spectral Efficiency Optimization in Multi-Hop Device-to-Device Communication

Ejemplares similares