Cargando…

Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning

With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network...

Descripción completa

Detalles Bibliográficos
Autores principales: Ran, Jing, Wang, Wenkai, Hu, Hefei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10059095/
https://www.ncbi.nlm.nih.gov/pubmed/36991766
http://dx.doi.org/10.3390/s23063054
_version_ 1785016792755732480
author Ran, Jing
Wang, Wenkai
Hu, Hefei
author_facet Ran, Jing
Wang, Wenkai
Hu, Hefei
author_sort Ran, Jing
collection PubMed
description With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network in response to dynamic SFC requests poses significant challenges and complexities. This paper proposes a dynamic SFC deployment and readjustment method based on deep Q network (DQN) and M Shortest Path Algorithm (MQDR) to address this problem. We develop a model of the dynamic deployment and readjustment of the SFC problem on the basis of the NFV/SFC network to maximize the request acceptance rate. We transform the problem into a Markov Decision Process (MDP) and further apply Reinforcement Learning (RL) to achieve this goal. In our proposed method (MQDR), we employ two agents that dynamically deploy and readjust SFCs collaboratively to enhance the service request acceptance rate. We reduce the action space for dynamic deployment by applying the M Shortest Path Algorithm (MSPA) and decrease the action space for readjustment from two dimensions to one. By reducing the action space, we decrease the training difficulty and improve the actual training effect of our proposed algorithm. The simulation experiments show that MDQR improves the request acceptance rate by approximately 25% compared with the original DQN algorithm and 9.3% compared with the Load Balancing Shortest Path (LBSP) algorithm.
format Online
Article
Text
id pubmed-10059095
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100590952023-03-30 Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning Ran, Jing Wang, Wenkai Hu, Hefei Sensors (Basel) Article With the advent of Software Defined Network (SDN) and Network Functions Virtualization (NFV), network operators can offer Service Function Chain (SFC) flexibly to accommodate the diverse network function (NF) requirements of their users. However, deploying SFCs efficiently on the underlying network in response to dynamic SFC requests poses significant challenges and complexities. This paper proposes a dynamic SFC deployment and readjustment method based on deep Q network (DQN) and M Shortest Path Algorithm (MQDR) to address this problem. We develop a model of the dynamic deployment and readjustment of the SFC problem on the basis of the NFV/SFC network to maximize the request acceptance rate. We transform the problem into a Markov Decision Process (MDP) and further apply Reinforcement Learning (RL) to achieve this goal. In our proposed method (MQDR), we employ two agents that dynamically deploy and readjust SFCs collaboratively to enhance the service request acceptance rate. We reduce the action space for dynamic deployment by applying the M Shortest Path Algorithm (MSPA) and decrease the action space for readjustment from two dimensions to one. By reducing the action space, we decrease the training difficulty and improve the actual training effect of our proposed algorithm. The simulation experiments show that MDQR improves the request acceptance rate by approximately 25% compared with the original DQN algorithm and 9.3% compared with the Load Balancing Shortest Path (LBSP) algorithm. MDPI 2023-03-12 /pmc/articles/PMC10059095/ /pubmed/36991766 http://dx.doi.org/10.3390/s23063054 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ran, Jing
Wang, Wenkai
Hu, Hefei
Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title_full Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title_fullStr Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title_full_unstemmed Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title_short Dynamic Service Function Chain Deployment and Readjustment Method Based on Deep Reinforcement Learning
title_sort dynamic service function chain deployment and readjustment method based on deep reinforcement learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10059095/
https://www.ncbi.nlm.nih.gov/pubmed/36991766
http://dx.doi.org/10.3390/s23063054
work_keys_str_mv AT ranjing dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning
AT wangwenkai dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning
AT huhefei dynamicservicefunctionchaindeploymentandreadjustmentmethodbasedondeepreinforcementlearning