Cargando…

A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study

Vehicle evaluation parameters, which are increasingly of concern for governments and consumers, quantify performance indicators, such as vehicle performance, emissions, and driving experience to help guide consumers in purchasing cars. While past approaches for driving cycle prediction have been pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yuewei, Zhang, Wutong, Zhang, Long, Qiao, Yuanyuan, Yang, Jie, Cheng, Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248886/
https://www.ncbi.nlm.nih.gov/pubmed/32344855
http://dx.doi.org/10.3390/s20092448
_version_ 1783538474552393728
author Wu, Yuewei
Zhang, Wutong
Zhang, Long
Qiao, Yuanyuan
Yang, Jie
Cheng, Cheng
author_facet Wu, Yuewei
Zhang, Wutong
Zhang, Long
Qiao, Yuanyuan
Yang, Jie
Cheng, Cheng
author_sort Wu, Yuewei
collection PubMed
description Vehicle evaluation parameters, which are increasingly of concern for governments and consumers, quantify performance indicators, such as vehicle performance, emissions, and driving experience to help guide consumers in purchasing cars. While past approaches for driving cycle prediction have been proven effective and used in many countries, these algorithms are difficult to use in China with its complex traffic environment and increasingly high frequency of traffic jams. Meanwhile, we found that the vehicle dataset used by the driving cycle prediction problem is usually unbalanced in real cases, which means that there are more medium and high speed samples and very few samples at low and ultra-high speeds. If the ordinary clustering algorithm is directly applied to the unbalanced data, it will have a huge impact on the performance to build driving cycle maps, and the parameters of the map will deviate considerable from actual ones. In order to address these issues, this paper propose a novel driving cycle map algorithm framework based on an ensemble learning method named multi-clustering algorithm, to improve the performance of traditional clustering algorithms on unbalanced data sets. It is noteworthy that our model framework can be easily extended to other complicated structure areas due to its flexible modular design and parameter configuration. Finally, we tested our method based on actual traffic data generated in Fujian Province in China. The results prove the multi-clustering algorithm has excellent performance on our dataset.
format Online
Article
Text
id pubmed-7248886
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-72488862020-06-10 A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study Wu, Yuewei Zhang, Wutong Zhang, Long Qiao, Yuanyuan Yang, Jie Cheng, Cheng Sensors (Basel) Article Vehicle evaluation parameters, which are increasingly of concern for governments and consumers, quantify performance indicators, such as vehicle performance, emissions, and driving experience to help guide consumers in purchasing cars. While past approaches for driving cycle prediction have been proven effective and used in many countries, these algorithms are difficult to use in China with its complex traffic environment and increasingly high frequency of traffic jams. Meanwhile, we found that the vehicle dataset used by the driving cycle prediction problem is usually unbalanced in real cases, which means that there are more medium and high speed samples and very few samples at low and ultra-high speeds. If the ordinary clustering algorithm is directly applied to the unbalanced data, it will have a huge impact on the performance to build driving cycle maps, and the parameters of the map will deviate considerable from actual ones. In order to address these issues, this paper propose a novel driving cycle map algorithm framework based on an ensemble learning method named multi-clustering algorithm, to improve the performance of traditional clustering algorithms on unbalanced data sets. It is noteworthy that our model framework can be easily extended to other complicated structure areas due to its flexible modular design and parameter configuration. Finally, we tested our method based on actual traffic data generated in Fujian Province in China. The results prove the multi-clustering algorithm has excellent performance on our dataset. MDPI 2020-04-25 /pmc/articles/PMC7248886/ /pubmed/32344855 http://dx.doi.org/10.3390/s20092448 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wu, Yuewei
Zhang, Wutong
Zhang, Long
Qiao, Yuanyuan
Yang, Jie
Cheng, Cheng
A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title_full A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title_fullStr A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title_full_unstemmed A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title_short A Multi-Clustering Algorithm to Solve Driving Cycle Prediction Problems Based on Unbalanced Data Sets: A Chinese Case Study
title_sort multi-clustering algorithm to solve driving cycle prediction problems based on unbalanced data sets: a chinese case study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248886/
https://www.ncbi.nlm.nih.gov/pubmed/32344855
http://dx.doi.org/10.3390/s20092448
work_keys_str_mv AT wuyuewei amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT zhangwutong amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT zhanglong amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT qiaoyuanyuan amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT yangjie amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT chengcheng amulticlusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT wuyuewei multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT zhangwutong multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT zhanglong multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT qiaoyuanyuan multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT yangjie multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy
AT chengcheng multiclusteringalgorithmtosolvedrivingcyclepredictionproblemsbasedonunbalanceddatasetsachinesecasestudy