Cargando…

Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble

Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to prepr...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Yueling, Ye, Mingquan, Wu, Changrong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093037/
https://www.ncbi.nlm.nih.gov/pubmed/33986823
http://dx.doi.org/10.1155/2021/5556992
_version_ 1783687729914052608
author Xiong, Yueling
Ye, Mingquan
Wu, Changrong
author_facet Xiong, Yueling
Ye, Mingquan
Wu, Changrong
author_sort Xiong, Yueling
collection PubMed
description Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research.
format Online
Article
Text
id pubmed-8093037
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-80930372021-05-12 Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble Xiong, Yueling Ye, Mingquan Wu, Changrong Comput Math Methods Med Research Article Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research. Hindawi 2021-04-26 /pmc/articles/PMC8093037/ /pubmed/33986823 http://dx.doi.org/10.1155/2021/5556992 Text en Copyright © 2021 Yueling Xiong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xiong, Yueling
Ye, Mingquan
Wu, Changrong
Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title_full Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title_fullStr Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title_full_unstemmed Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title_short Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
title_sort cancer classification with a cost-sensitive naive bayes stacking ensemble
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093037/
https://www.ncbi.nlm.nih.gov/pubmed/33986823
http://dx.doi.org/10.1155/2021/5556992
work_keys_str_mv AT xiongyueling cancerclassificationwithacostsensitivenaivebayesstackingensemble
AT yemingquan cancerclassificationwithacostsensitivenaivebayesstackingensemble
AT wuchangrong cancerclassificationwithacostsensitivenaivebayesstackingensemble