Cargando…

Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems

The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Delaney, James, Dowey, Steve, Cheng, Chi-Tsun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/ https://www.ncbi.nlm.nih.gov/pubmed/37430736 http://dx.doi.org/10.3390/s23104821

_version_	1785049858884763648
author	Delaney, James Dowey, Steve Cheng, Chi-Tsun
author_facet	Delaney, James Dowey, Steve Cheng, Chi-Tsun
author_sort	Delaney, James
collection	PubMed
description	The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies.
format	Online Article Text
id	pubmed-10223095
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-102230952023-05-28 Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems Delaney, James Dowey, Steve Cheng, Chi-Tsun Sensors (Basel) Article The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies. MDPI 2023-05-17 /pmc/articles/PMC10223095/ /pubmed/37430736 http://dx.doi.org/10.3390/s23104821 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Delaney, James Dowey, Steve Cheng, Chi-Tsun Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title	Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_full	Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_fullStr	Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_full_unstemmed	Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_short	Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_sort	reinforcement-learning-based robust resource management for multi-radio systems
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/ https://www.ncbi.nlm.nih.gov/pubmed/37430736 http://dx.doi.org/10.3390/s23104821
work_keys_str_mv	AT delaneyjames reinforcementlearningbasedrobustresourcemanagementformultiradiosystems AT doweysteve reinforcementlearningbasedrobustresourcemanagementformultiradiosystems AT chengchitsun reinforcementlearningbasedrobustresourcemanagementformultiradiosystems

Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems

Ejemplares similares