Cargando…

Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning

With the increase in Internet of Things (IoT) devices and network communications, but with less bandwidth growth, the resulting constraints must be overcome. Due to the network complexity and uncertainty of emergency distribution parameters in smart environments, using predetermined rules seems illo...

Descripción completa

Detalles Bibliográficos
Autores principales: Mobasheri, Motahareh, Kim, Yangwoo, Kim, Woongsup
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587839/
https://www.ncbi.nlm.nih.gov/pubmed/34770360
http://dx.doi.org/10.3390/s21217053
_version_ 1784598268702883840
author Mobasheri, Motahareh
Kim, Yangwoo
Kim, Woongsup
author_facet Mobasheri, Motahareh
Kim, Yangwoo
Kim, Woongsup
author_sort Mobasheri, Motahareh
collection PubMed
description With the increase in Internet of Things (IoT) devices and network communications, but with less bandwidth growth, the resulting constraints must be overcome. Due to the network complexity and uncertainty of emergency distribution parameters in smart environments, using predetermined rules seems illogical. Reinforcement learning (RL), as a powerful machine learning approach, can handle such smart environments without a trainer or supervisor. Recently, we worked on bandwidth management in a smart environment with several fog fragments using limited shared bandwidth, where IoT devices may experience uncertain emergencies in terms of the time and sequence needed for more bandwidth for further higher-level communication. We introduced fog fragment cooperation using an RL approach under a predefined fixed threshold constraint. In this study, we promote this approach by removing the fixed level of restriction of the threshold through hierarchical reinforcement learning (HRL) and completing the cooperation qualification. At the first learning hierarchy level of the proposed approach, the best threshold level is learned over time, and the final results are used by the second learning hierarchy level, where the fog node learns the best device for helping an emergency device by temporarily lending the bandwidth. Although equipping the method to the adaptive threshold and restricting fog fragment cooperation make the learning procedure more difficult, the HRL approach increases the method’s efficiency in terms of time and performance.
format Online
Article
Text
id pubmed-8587839
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85878392021-11-13 Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning Mobasheri, Motahareh Kim, Yangwoo Kim, Woongsup Sensors (Basel) Article With the increase in Internet of Things (IoT) devices and network communications, but with less bandwidth growth, the resulting constraints must be overcome. Due to the network complexity and uncertainty of emergency distribution parameters in smart environments, using predetermined rules seems illogical. Reinforcement learning (RL), as a powerful machine learning approach, can handle such smart environments without a trainer or supervisor. Recently, we worked on bandwidth management in a smart environment with several fog fragments using limited shared bandwidth, where IoT devices may experience uncertain emergencies in terms of the time and sequence needed for more bandwidth for further higher-level communication. We introduced fog fragment cooperation using an RL approach under a predefined fixed threshold constraint. In this study, we promote this approach by removing the fixed level of restriction of the threshold through hierarchical reinforcement learning (HRL) and completing the cooperation qualification. At the first learning hierarchy level of the proposed approach, the best threshold level is learned over time, and the final results are used by the second learning hierarchy level, where the fog node learns the best device for helping an emergency device by temporarily lending the bandwidth. Although equipping the method to the adaptive threshold and restricting fog fragment cooperation make the learning procedure more difficult, the HRL approach increases the method’s efficiency in terms of time and performance. MDPI 2021-10-25 /pmc/articles/PMC8587839/ /pubmed/34770360 http://dx.doi.org/10.3390/s21217053 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mobasheri, Motahareh
Kim, Yangwoo
Kim, Woongsup
Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title_full Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title_fullStr Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title_full_unstemmed Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title_short Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning
title_sort toward an adaptive threshold on cooperative bandwidth management based on hierarchical reinforcement learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587839/
https://www.ncbi.nlm.nih.gov/pubmed/34770360
http://dx.doi.org/10.3390/s21217053
work_keys_str_mv AT mobasherimotahareh towardanadaptivethresholdoncooperativebandwidthmanagementbasedonhierarchicalreinforcementlearning
AT kimyangwoo towardanadaptivethresholdoncooperativebandwidthmanagementbasedonhierarchicalreinforcementlearning
AT kimwoongsup towardanadaptivethresholdoncooperativebandwidthmanagementbasedonhierarchicalreinforcementlearning