Cargando…
Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems
The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection t...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/ https://www.ncbi.nlm.nih.gov/pubmed/37430736 http://dx.doi.org/10.3390/s23104821 |
_version_ | 1785049858884763648 |
---|---|
author | Delaney, James Dowey, Steve Cheng, Chi-Tsun |
author_facet | Delaney, James Dowey, Steve Cheng, Chi-Tsun |
author_sort | Delaney, James |
collection | PubMed |
description | The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies. |
format | Online Article Text |
id | pubmed-10223095 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102230952023-05-28 Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems Delaney, James Dowey, Steve Cheng, Chi-Tsun Sensors (Basel) Article The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies. MDPI 2023-05-17 /pmc/articles/PMC10223095/ /pubmed/37430736 http://dx.doi.org/10.3390/s23104821 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Delaney, James Dowey, Steve Cheng, Chi-Tsun Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title | Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title_full | Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title_fullStr | Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title_full_unstemmed | Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title_short | Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems |
title_sort | reinforcement-learning-based robust resource management for multi-radio systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223095/ https://www.ncbi.nlm.nih.gov/pubmed/37430736 http://dx.doi.org/10.3390/s23104821 |
work_keys_str_mv | AT delaneyjames reinforcementlearningbasedrobustresourcemanagementformultiradiosystems AT doweysteve reinforcementlearningbasedrobustresourcemanagementformultiradiosystems AT chengchitsun reinforcementlearningbasedrobustresourcemanagementformultiradiosystems |