Cargando…

Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance

Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data strea...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Yange, Li, Meng, Li, Lei, Shao, Han, Sun, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352686/
https://www.ncbi.nlm.nih.gov/pubmed/34381499
http://dx.doi.org/10.1155/2021/8813806
_version_ 1783736237205487616
author Sun, Yange
Li, Meng
Li, Lei
Shao, Han
Sun, Yi
author_facet Sun, Yange
Li, Meng
Li, Lei
Shao, Han
Sun, Yi
author_sort Sun, Yange
collection PubMed
description Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data streams with concept drift. A novel Cost-Sensitive based Data Stream (CSDS) classification is introduced to overcome the two issues simultaneously. The CSDS considers cost information during the procedures of data preprocessing and classification. During the data preprocessing, a cost-sensitive learning strategy is introduced into the ReliefF algorithm for alleviating the class imbalance at the data level. In the classification process, a cost-sensitive weighting schema is devised to enhance the overall performance of the ensemble. Besides, a change detection mechanism is embedded in our algorithm, which guarantees that an ensemble can capture and react to drift promptly. Experimental results validate that our method can obtain better classification results under different imbalanced concept drifting data stream scenarios.
format Online
Article
Text
id pubmed-8352686
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-83526862021-08-10 Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance Sun, Yange Li, Meng Li, Lei Shao, Han Sun, Yi Comput Intell Neurosci Research Article Class imbalance and concept drift are two primary principles that exist concurrently in data stream classification. Although the two issues have drawn enough attention separately, the joint treatment largely remains unexplored. Moreover, the class imbalance issue is further complicated if data streams with concept drift. A novel Cost-Sensitive based Data Stream (CSDS) classification is introduced to overcome the two issues simultaneously. The CSDS considers cost information during the procedures of data preprocessing and classification. During the data preprocessing, a cost-sensitive learning strategy is introduced into the ReliefF algorithm for alleviating the class imbalance at the data level. In the classification process, a cost-sensitive weighting schema is devised to enhance the overall performance of the ensemble. Besides, a change detection mechanism is embedded in our algorithm, which guarantees that an ensemble can capture and react to drift promptly. Experimental results validate that our method can obtain better classification results under different imbalanced concept drifting data stream scenarios. Hindawi 2021-08-02 /pmc/articles/PMC8352686/ /pubmed/34381499 http://dx.doi.org/10.1155/2021/8813806 Text en Copyright © 2021 Yange Sun et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sun, Yange
Li, Meng
Li, Lei
Shao, Han
Sun, Yi
Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title_full Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title_fullStr Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title_full_unstemmed Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title_short Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance
title_sort cost-sensitive classification for evolving data streams with concept drift and class imbalance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352686/
https://www.ncbi.nlm.nih.gov/pubmed/34381499
http://dx.doi.org/10.1155/2021/8813806
work_keys_str_mv AT sunyange costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance
AT limeng costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance
AT lilei costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance
AT shaohan costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance
AT sunyi costsensitiveclassificationforevolvingdatastreamswithconceptdriftandclassimbalance