Cargando…

Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data

The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified sam...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Kewen, Zhou, Guangyue, Zhai, Jiannan, Li, Fulai, Shao, Mingwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471212/
https://www.ncbi.nlm.nih.gov/pubmed/30917599
http://dx.doi.org/10.3390/s19061476
_version_ 1783411977029156864
author Li, Kewen
Zhou, Guangyue
Zhai, Jiannan
Li, Fulai
Shao, Mingwen
author_facet Li, Kewen
Zhou, Guangyue
Zhai, Jiannan
Li, Fulai
Shao, Mingwen
author_sort Li, Kewen
collection PubMed
description The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified samples rather than samples of minority classes. To better process imbalanced data, this paper introduces the indicator Area Under Curve (AUC) which can reflect the comprehensive performance of the model, and proposes an improved AdaBoost algorithm based on AUC (AdaBoost-A) which improves the error calculation performance of the AdaBoost algorithm by comprehensively considering the effects of misclassification probability and AUC. To prevent redundant or useless weak classifiers the traditional AdaBoost algorithm generated from consuming too much system resources, this paper proposes an ensemble algorithm, PSOPD-AdaBoost-A, which can re-initialize parameters to avoid falling into local optimum, and optimize the coefficients of AdaBoost weak classifiers. Experiment results show that the proposed algorithm is effective for processing imbalanced data, especially the data with relatively high imbalances.
format Online
Article
Text
id pubmed-6471212
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64712122019-04-26 Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data Li, Kewen Zhou, Guangyue Zhai, Jiannan Li, Fulai Shao, Mingwen Sensors (Basel) Article The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified samples rather than samples of minority classes. To better process imbalanced data, this paper introduces the indicator Area Under Curve (AUC) which can reflect the comprehensive performance of the model, and proposes an improved AdaBoost algorithm based on AUC (AdaBoost-A) which improves the error calculation performance of the AdaBoost algorithm by comprehensively considering the effects of misclassification probability and AUC. To prevent redundant or useless weak classifiers the traditional AdaBoost algorithm generated from consuming too much system resources, this paper proposes an ensemble algorithm, PSOPD-AdaBoost-A, which can re-initialize parameters to avoid falling into local optimum, and optimize the coefficients of AdaBoost weak classifiers. Experiment results show that the proposed algorithm is effective for processing imbalanced data, especially the data with relatively high imbalances. MDPI 2019-03-26 /pmc/articles/PMC6471212/ /pubmed/30917599 http://dx.doi.org/10.3390/s19061476 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Kewen
Zhou, Guangyue
Zhai, Jiannan
Li, Fulai
Shao, Mingwen
Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title_full Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title_fullStr Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title_full_unstemmed Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title_short Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
title_sort improved pso_adaboost ensemble algorithm for imbalanced data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471212/
https://www.ncbi.nlm.nih.gov/pubmed/30917599
http://dx.doi.org/10.3390/s19061476
work_keys_str_mv AT likewen improvedpsoadaboostensemblealgorithmforimbalanceddata
AT zhouguangyue improvedpsoadaboostensemblealgorithmforimbalanceddata
AT zhaijiannan improvedpsoadaboostensemblealgorithmforimbalanceddata
AT lifulai improvedpsoadaboostensemblealgorithmforimbalanceddata
AT shaomingwen improvedpsoadaboostensemblealgorithmforimbalanceddata