Cargando…
Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10059095/ https://www.ncbi.nlm.nih.gov/pubmed/36991766 http://dx.doi.org/10.3390/s23063054 |
_version_ | 1785016792755732480 |
---|---|
author | Ran, Jing Wang, Wenkai Hu, Hefei |
author_facet | Ran, Jing Wang, Wenkai Hu, Hefei |
author_sort | Ran, Jing |
collection | PubMed |
description | With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network in response to dynamic SFC requests poses significant challenges and complexities. This paper proposes a dynamic SFC deployment and readjustment method based on deep Q network (DQN) and M Shortest Path Algorithm (MQDR) to address this problem. We develop a model of the dynamic deployment and readjustment of the SFC problem on the basis of the NFV/SFC network to maximize the request acceptance rate. We transform the problem into a Markov Decision Process (MDP) and further apply Reinforcement Learning (RL) to achieve this goal. In our proposed method (MQDR), we employ two agents that dynamically deploy and readjust SFCs collaboratively to enhance the service request acceptance rate. We reduce the action space for dynamic deployment by applying the M Shortest Path Algorithm (MSPA) and decrease the action space for readjustment from two dimensions to one. By reducing the action space, we decrease the training difficulty and improve the actual training effect of our proposed algorithm. The simulation experiments show that MDQR improves the request acceptance rate by approximately 25% compared with the original DQN algorithm and 9.3% compared with the Load Balancing Shortest Path (LBSP) algorithm. |
format | Online Article Text |
id | pubmed-10059095 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100590952023-03-30 Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning Ran, Jing Wang, Wenkai Hu, Hefei Sensors (Basel) Article With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network in response to dynamic SFC requests poses significant challenges and complexities. This paper proposes a dynamic SFC deployment and readjustment method based on deep Q network (DQN) and M Shortest Path Algorithm (MQDR) to address this problem. We develop a model of the dynamic deployment and readjustment of the SFC problem on the basis of the NFV/SFC network to maximize the request acceptance rate. We transform the problem into a Markov Decision Process (MDP) and further apply Reinforcement Learning (RL) to achieve this goal. In our proposed method (MQDR), we employ two agents that dynamically deploy and readjust SFCs collaboratively to enhance the service request acceptance rate. We reduce the action space for dynamic deployment by applying the M Shortest Path Algorithm (MSPA) and decrease the action space for readjustment from two dimensions to one. By reducing the action space, we decrease the training difficulty and improve the actual training effect of our proposed algorithm. The simulation experiments show that MDQR improves the request acceptance rate by approximately 25% compared with the original DQN algorithm and 9.3% compared with the Load Balancing Shortest Path (LBSP) algorithm. MDPI 2023-03-12 /pmc/articles/PMC10059095/ /pubmed/36991766 http://dx.doi.org/10.3390/s23063054 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ran, Jing Wang, Wenkai Hu, Hefei Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title | Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title_full | Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title_fullStr | Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title_full_unstemmed | Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title_short | Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning |
title_sort | dynamic service function chain deployment and readjustment method based on deep reinforcement learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10059095/ https://www.ncbi.nlm.nih.gov/pubmed/36991766 http://dx.doi.org/10.3390/s23063054 |
work_keys_str_mv | AT ranjing dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning AT wangwenkai dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning AT huhefei dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning |