Cargando…

A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks

The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Wenwei, Huang, Xiujie, Guan, Quanlong, Zhao, Shancheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/
https://www.ncbi.nlm.nih.gov/pubmed/37238547
http://dx.doi.org/10.3390/e25050792
_version_ 1785048420828839936
author Chen, Wenwei
Huang, Xiujie
Guan, Quanlong
Zhao, Shancheng
author_facet Chen, Wenwei
Huang, Xiujie
Guan, Quanlong
Zhao, Shancheng
author_sort Chen, Wenwei
collection PubMed
description The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay.
format Online
Article
Text
id pubmed-10216958
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102169582023-05-27 A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng Entropy (Basel) Article The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay. MDPI 2023-05-12 /pmc/articles/PMC10216958/ /pubmed/37238547 http://dx.doi.org/10.3390/e25050792 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Wenwei
Huang, Xiujie
Guan, Quanlong
Zhao, Shancheng
A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_full A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_fullStr A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_full_unstemmed A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_short A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_sort multi-agent deep reinforcement learning-based popular content distribution scheme in vehicular networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/
https://www.ncbi.nlm.nih.gov/pubmed/37238547
http://dx.doi.org/10.3390/e25050792
work_keys_str_mv AT chenwenwei amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT huangxiujie amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT guanquanlong amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT zhaoshancheng amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT chenwenwei multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT huangxiujie multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT guanquanlong multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks
AT zhaoshancheng multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks