Cargando…
Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble
Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to prepr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093037/ https://www.ncbi.nlm.nih.gov/pubmed/33986823 http://dx.doi.org/10.1155/2021/5556992 |
_version_ | 1783687729914052608 |
---|---|
author | Xiong, Yueling Ye, Mingquan Wu, Changrong |
author_facet | Xiong, Yueling Ye, Mingquan Wu, Changrong |
author_sort | Xiong, Yueling |
collection | PubMed |
description | Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research. |
format | Online Article Text |
id | pubmed-8093037 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-80930372021-05-12 Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble Xiong, Yueling Ye, Mingquan Wu, Changrong Comput Math Methods Med Research Article Ensemble learning combines multiple learners to perform combinatorial learning, which has advantages of good flexibility and higher generalization performance. To achieve higher quality cancer classification, in this study, the fast correlation-based feature selection (FCBF) method was used to preprocess the data to eliminate irrelevant and redundant features. Then, the classification was carried out in the stacking ensemble learner. A library for support vector machine (LIBSVM), K-nearest neighbor (KNN), decision tree C4.5 (C4.5), and random forest (RF) were used as the primary learners of the stacking ensemble. Given the imbalanced characteristics of cancer gene expression data, the embedding cost-sensitive naive Bayes was used as the metalearner of the stacking ensemble, which was represented as CSNB stacking. The proposed CSNB stacking method was applied to nine cancer datasets to further verify the classification performance of the model. Compared with other classification methods, such as single classifier algorithms and ensemble algorithms, the experimental results showed the effectiveness and robustness of the proposed method in processing different types of cancer data. This method may therefore help guide cancer diagnosis and research. Hindawi 2021-04-26 /pmc/articles/PMC8093037/ /pubmed/33986823 http://dx.doi.org/10.1155/2021/5556992 Text en Copyright © 2021 Yueling Xiong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Xiong, Yueling Ye, Mingquan Wu, Changrong Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title | Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title_full | Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title_fullStr | Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title_full_unstemmed | Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title_short | Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble |
title_sort | cancer classification with a cost-sensitive naive bayes stacking ensemble |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093037/ https://www.ncbi.nlm.nih.gov/pubmed/33986823 http://dx.doi.org/10.1155/2021/5556992 |
work_keys_str_mv | AT xiongyueling cancerclassificationwithacostsensitivenaivebayesstackingensemble AT yemingquan cancerclassificationwithacostsensitivenaivebayesstackingensemble AT wuchangrong cancerclassificationwithacostsensitivenaivebayesstackingensemble |