Cargando…
Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensional...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535043/ https://www.ncbi.nlm.nih.gov/pubmed/34679634 http://dx.doi.org/10.3390/diagnostics11101936 |
_version_ | 1784587682270150656 |
---|---|
author | Almars, Abdulqader M. Alwateer, Majed Qaraad, Mohammed Amjad, Souad Fathi, Hanaa Kelany, Ayda K. Hussein, Nazar K. Elhosseini, Mostafa |
author_facet | Almars, Abdulqader M. Alwateer, Majed Qaraad, Mohammed Amjad, Souad Fathi, Hanaa Kelany, Ayda K. Hussein, Nazar K. Elhosseini, Mostafa |
author_sort | Almars, Abdulqader M. |
collection | PubMed |
description | The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensionality and making classifying data from microarrays challenging. In most of the examined studies, cancer classification (Malignant and benign) accuracy was examined without disclosing biological information related to the classification process. A new approach was proposed to bridge the gap between cancer classification and the interpretation of the biological studies of the genes implicated in cancer. This study aims to develop a new hybrid model for cancer classification (by using feature selection mRMRe as a key step to improve the performance of classification methods and a distributed hyperparameter optimization for gradient boosting ensemble methods). To evaluate the proposed method, NB, RF, and SVM classifiers have been chosen. In terms of the AUC, sensitivity, and specificity, the optimized CatBoost classifier performed better than the optimized XGBoost in cross-validation 5, 6, 8, and 10. With an accuracy of [Formula: see text] , the optimized CatBoost classifier is more accurate than the CatBoost classifier without optimization, which is [Formula: see text]. By using hybrid algorithms, SVM, RF, and NB automatically become more accurate. Furthermore, in terms of accuracy, SVM and RF ([Formula: see text]) achieve equivalent and higher classification accuracy than NB ([Formula: see text]). The findings of relevant biomedical studies confirm the findings of the selected genes. |
format | Online Article Text |
id | pubmed-8535043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85350432021-10-23 Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier Almars, Abdulqader M. Alwateer, Majed Qaraad, Mohammed Amjad, Souad Fathi, Hanaa Kelany, Ayda K. Hussein, Nazar K. Elhosseini, Mostafa Diagnostics (Basel) Article The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensionality and making classifying data from microarrays challenging. In most of the examined studies, cancer classification (Malignant and benign) accuracy was examined without disclosing biological information related to the classification process. A new approach was proposed to bridge the gap between cancer classification and the interpretation of the biological studies of the genes implicated in cancer. This study aims to develop a new hybrid model for cancer classification (by using feature selection mRMRe as a key step to improve the performance of classification methods and a distributed hyperparameter optimization for gradient boosting ensemble methods). To evaluate the proposed method, NB, RF, and SVM classifiers have been chosen. In terms of the AUC, sensitivity, and specificity, the optimized CatBoost classifier performed better than the optimized XGBoost in cross-validation 5, 6, 8, and 10. With an accuracy of [Formula: see text] , the optimized CatBoost classifier is more accurate than the CatBoost classifier without optimization, which is [Formula: see text]. By using hybrid algorithms, SVM, RF, and NB automatically become more accurate. Furthermore, in terms of accuracy, SVM and RF ([Formula: see text]) achieve equivalent and higher classification accuracy than NB ([Formula: see text]). The findings of relevant biomedical studies confirm the findings of the selected genes. MDPI 2021-10-19 /pmc/articles/PMC8535043/ /pubmed/34679634 http://dx.doi.org/10.3390/diagnostics11101936 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Almars, Abdulqader M. Alwateer, Majed Qaraad, Mohammed Amjad, Souad Fathi, Hanaa Kelany, Ayda K. Hussein, Nazar K. Elhosseini, Mostafa Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title | Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title_full | Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title_fullStr | Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title_full_unstemmed | Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title_short | Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier |
title_sort | brain cancer prediction based on novel interpretable ensemble gene selection algorithm and classifier |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535043/ https://www.ncbi.nlm.nih.gov/pubmed/34679634 http://dx.doi.org/10.3390/diagnostics11101936 |
work_keys_str_mv | AT almarsabdulqaderm braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT alwateermajed braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT qaraadmohammed braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT amjadsouad braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT fathihanaa braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT kelanyaydak braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT husseinnazark braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier AT elhosseinimostafa braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier |