Cargando…

Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems

The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection t...

Descripción completa

Detalles Bibliográficos
Autores principales: Delaney, James, Dowey, Steve, Cheng, Chi-Tsun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/
https://www.ncbi.nlm.nih.gov/pubmed/37430736
http://dx.doi.org/10.3390/s23104821
_version_ 1785049858884763648
author Delaney, James
Dowey, Steve
Cheng, Chi-Tsun
author_facet Delaney, James
Dowey, Steve
Cheng, Chi-Tsun
author_sort Delaney, James
collection PubMed
description The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies.
format Online
Article
Text
id pubmed-10223095
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102230952023-05-28 Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems Delaney, James Dowey, Steve Cheng, Chi-Tsun Sensors (Basel) Article The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies. MDPI 2023-05-17 /pmc/articles/PMC10223095/ /pubmed/37430736 http://dx.doi.org/10.3390/s23104821 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Delaney, James
Dowey, Steve
Cheng, Chi-Tsun
Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_full Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_fullStr Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_full_unstemmed Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_short Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
title_sort reinforcement-learning-based robust resource management for multi-radio systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/
https://www.ncbi.nlm.nih.gov/pubmed/37430736
http://dx.doi.org/10.3390/s23104821
work_keys_str_mv AT delaneyjames reinforcementlearningbasedrobustresourcemanagementformultiradiosystems
AT doweysteve reinforcementlearningbasedrobustresourcemanagementformultiradiosystems
AT chengchitsun reinforcementlearningbasedrobustresourcemanagementformultiradiosystems