Cargando…
Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified sam...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471212/ https://www.ncbi.nlm.nih.gov/pubmed/30917599 http://dx.doi.org/10.3390/s19061476 |
_version_ | 1783411977029156864 |
---|---|
author | Li, Kewen Zhou, Guangyue Zhai, Jiannan Li, Fulai Shao, Mingwen |
author_facet | Li, Kewen Zhou, Guangyue Zhai, Jiannan Li, Fulai Shao, Mingwen |
author_sort | Li, Kewen |
collection | PubMed |
description | The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified samples rather than samples of minority classes. To better process imbalanced data, this paper introduces the indicator Area Under Curve (AUC) which can reflect the comprehensive performance of the model, and proposes an improved AdaBoost algorithm based on AUC (AdaBoost-A) which improves the error calculation performance of the AdaBoost algorithm by comprehensively considering the effects of misclassification probability and AUC. To prevent redundant or useless weak classifiers the traditional AdaBoost algorithm generated from consuming too much system resources, this paper proposes an ensemble algorithm, PSOPD-AdaBoost-A, which can re-initialize parameters to avoid falling into local optimum, and optimize the coefficients of AdaBoost weak classifiers. Experiment results show that the proposed algorithm is effective for processing imbalanced data, especially the data with relatively high imbalances. |
format | Online Article Text |
id | pubmed-6471212 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-64712122019-04-26 Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data Li, Kewen Zhou, Guangyue Zhai, Jiannan Li, Fulai Shao, Mingwen Sensors (Basel) Article The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets. However, it is challenging to apply the AdaBoost algorithm directly to imbalanced data since it is designed mainly for processing misclassified samples rather than samples of minority classes. To better process imbalanced data, this paper introduces the indicator Area Under Curve (AUC) which can reflect the comprehensive performance of the model, and proposes an improved AdaBoost algorithm based on AUC (AdaBoost-A) which improves the error calculation performance of the AdaBoost algorithm by comprehensively considering the effects of misclassification probability and AUC. To prevent redundant or useless weak classifiers the traditional AdaBoost algorithm generated from consuming too much system resources, this paper proposes an ensemble algorithm, PSOPD-AdaBoost-A, which can re-initialize parameters to avoid falling into local optimum, and optimize the coefficients of AdaBoost weak classifiers. Experiment results show that the proposed algorithm is effective for processing imbalanced data, especially the data with relatively high imbalances. MDPI 2019-03-26 /pmc/articles/PMC6471212/ /pubmed/30917599 http://dx.doi.org/10.3390/s19061476 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Kewen Zhou, Guangyue Zhai, Jiannan Li, Fulai Shao, Mingwen Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title | Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title_full | Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title_fullStr | Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title_full_unstemmed | Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title_short | Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data |
title_sort | improved pso_adaboost ensemble algorithm for imbalanced data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471212/ https://www.ncbi.nlm.nih.gov/pubmed/30917599 http://dx.doi.org/10.3390/s19061476 |
work_keys_str_mv | AT likewen improvedpsoadaboostensemblealgorithmforimbalanceddata AT zhouguangyue improvedpsoadaboostensemblealgorithmforimbalanceddata AT zhaijiannan improvedpsoadaboostensemblealgorithmforimbalanceddata AT lifulai improvedpsoadaboostensemblealgorithmforimbalanceddata AT shaomingwen improvedpsoadaboostensemblealgorithmforimbalanceddata |