Cargando…
Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data strea...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352686/ https://www.ncbi.nlm.nih.gov/pubmed/34381499 http://dx.doi.org/10.1155/2021/8813806 |
_version_ | 1783736237205487616 |
---|---|
author | Sun, Yange Li, Meng Li, Lei Shao, Han Sun, Yi |
author_facet | Sun, Yange Li, Meng Li, Lei Shao, Han Sun, Yi |
author_sort | Sun, Yange |
collection | PubMed |
description | Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data streams with concept drift. A novel Cost-Sensitive based Data Stream (CSDS) classification is introduced to overcome the two issues simultaneously. The CSDS considers cost information during the procedures of data preprocessing and classification. During the data preprocessing, a cost-sensitive learning strategy is introduced into the ReliefF algorithm for alleviating the class imbalance at the data level. In the classification process, a cost-sensitive weighting schema is devised to enhance the overall performance of the ensemble. Besides, a change detection mechanism is embedded in our algorithm, which guarantees that an ensemble can capture and react to drift promptly. Experimental results validate that our method can obtain better classification results under different imbalanced concept drifting data stream scenarios. |
format | Online Article Text |
id | pubmed-8352686 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-83526862021-08-10 Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance Sun, Yange Li, Meng Li, Lei Shao, Han Sun, Yi Comput Intell Neurosci Research Article Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data streams with concept drift. A novel Cost-Sensitive based Data Stream (CSDS) classification is introduced to overcome the two issues simultaneously. The CSDS considers cost information during the procedures of data preprocessing and classification. During the data preprocessing, a cost-sensitive learning strategy is introduced into the ReliefF algorithm for alleviating the class imbalance at the data level. In the classification process, a cost-sensitive weighting schema is devised to enhance the overall performance of the ensemble. Besides, a change detection mechanism is embedded in our algorithm, which guarantees that an ensemble can capture and react to drift promptly. Experimental results validate that our method can obtain better classification results under different imbalanced concept drifting data stream scenarios. Hindawi 2021-08-02 /pmc/articles/PMC8352686/ /pubmed/34381499 http://dx.doi.org/10.1155/2021/8813806 Text en Copyright © 2021 Yange Sun et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Sun, Yange Li, Meng Li, Lei Shao, Han Sun, Yi Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title | Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title_full | Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title_fullStr | Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title_full_unstemmed | Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title_short | Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance |
title_sort | cost-sensitive classification for evolving data streams with concept drift and class imbalance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352686/ https://www.ncbi.nlm.nih.gov/pubmed/34381499 http://dx.doi.org/10.1155/2021/8813806 |
work_keys_str_mv | AT sunyange costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance AT limeng costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance AT lilei costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance AT shaohan costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance AT sunyi costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance |