Cargando…
A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/ https://www.ncbi.nlm.nih.gov/pubmed/37238547 http://dx.doi.org/10.3390/e25050792 |
_version_ | 1785048420828839936 |
---|---|
author | Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng |
author_facet | Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng |
author_sort | Chen, Wenwei |
collection | PubMed |
description | The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay. |
format | Online Article Text |
id | pubmed-10216958 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102169582023-05-27 A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng Entropy (Basel) Article The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay. MDPI 2023-05-12 /pmc/articles/PMC10216958/ /pubmed/37238547 http://dx.doi.org/10.3390/e25050792 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title | A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title_full | A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title_fullStr | A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title_full_unstemmed | A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title_short | A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks |
title_sort | multi-agent deep reinforcement learning-based popular content distribution scheme in vehicular networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/ https://www.ncbi.nlm.nih.gov/pubmed/37238547 http://dx.doi.org/10.3390/e25050792 |
work_keys_str_mv | AT chenwenwei amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT huangxiujie amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT guanquanlong amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT zhaoshancheng amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT chenwenwei multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT huangxiujie multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT guanquanlong multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT zhaoshancheng multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks |