Cargando…

A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks

The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Wenwei, Huang, Xiujie, Guan, Quanlong, Zhao, Shancheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/ https://www.ncbi.nlm.nih.gov/pubmed/37238547 http://dx.doi.org/10.3390/e25050792

_version_	1785048420828839936
author	Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng
author_facet	Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng
author_sort	Chen, Wenwei
collection	PubMed
description	The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay.
format	Online Article Text
id	pubmed-10216958
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-102169582023-05-27 A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng Entropy (Basel) Article The Internet of Vehicles (IoV) enables vehicular data services and applications through vehicle-to-everything (V2X) communications. One of the key services provided by IoV is popular content distribution (PCD), which aims to quickly deliver popular content that most vehicles request. However, it is challenging for vehicles to receive the complete popular content from roadside units (RSUs) due to their mobility and the RSUs’ constrained coverage. The collaboration of vehicles via vehicle-to-vehicle (V2V) communications is an effective solution to assist more vehicles to obtain the entire popular content at a lower time cost. To this end, we propose a multi-agent deep reinforcement learning (MADRL)-based popular content distribution scheme in vehicular networks, where each vehicle deploys an MADRL agent that learns to choose the appropriate data transmission policy. To reduce the complexity of the MADRL-based algorithm, a vehicle clustering algorithm based on spectral clustering is provided to divide all vehicles in the V2V phase into groups, so that only vehicles within the same group exchange data. Then the multi-agent proximal policy optimization (MAPPO) algorithm is used to train the agent. We introduce the self-attention mechanism when constructing the neural network for the MADRL to help the agent accurately represent the environment and make decisions. Furthermore, the invalid action masking technique is utilized to prevent the agent from taking invalid actions, accelerating the training process of the agent. Finally, experimental results are shown and a comprehensive comparison is provided, which demonstrates that our MADRL-PCD scheme outperforms both the coalition game-based scheme and the greedy strategy-based scheme, achieving a higher PCD efficiency and a lower transmission delay. MDPI 2023-05-12 /pmc/articles/PMC10216958/ /pubmed/37238547 http://dx.doi.org/10.3390/e25050792 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Chen, Wenwei Huang, Xiujie Guan, Quanlong Zhao, Shancheng A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title	A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_full	A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_fullStr	A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_full_unstemmed	A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_short	A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks
title_sort	multi-agent deep reinforcement learning-based popular content distribution scheme in vehicular networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216958/ https://www.ncbi.nlm.nih.gov/pubmed/37238547 http://dx.doi.org/10.3390/e25050792
work_keys_str_mv	AT chenwenwei amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT huangxiujie amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT guanquanlong amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT zhaoshancheng amultiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT chenwenwei multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT huangxiujie multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT guanquanlong multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks AT zhaoshancheng multiagentdeepreinforcementlearningbasedpopularcontentdistributionschemeinvehicularnetworks

A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks

Ejemplares similares