Cargando…

Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier

The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensional...

Descripción completa

Detalles Bibliográficos
Autores principales: Almars, Abdulqader M., Alwateer, Majed, Qaraad, Mohammed, Amjad, Souad, Fathi, Hanaa, Kelany, Ayda K., Hussein, Nazar K., Elhosseini, Mostafa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535043/
https://www.ncbi.nlm.nih.gov/pubmed/34679634
http://dx.doi.org/10.3390/diagnostics11101936
_version_ 1784587682270150656
author Almars, Abdulqader M.
Alwateer, Majed
Qaraad, Mohammed
Amjad, Souad
Fathi, Hanaa
Kelany, Ayda K.
Hussein, Nazar K.
Elhosseini, Mostafa
author_facet Almars, Abdulqader M.
Alwateer, Majed
Qaraad, Mohammed
Amjad, Souad
Fathi, Hanaa
Kelany, Ayda K.
Hussein, Nazar K.
Elhosseini, Mostafa
author_sort Almars, Abdulqader M.
collection PubMed
description The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensionality and making classifying data from microarrays challenging. In most of the examined studies, cancer classification (Malignant and benign) accuracy was examined without disclosing biological information related to the classification process. A new approach was proposed to bridge the gap between cancer classification and the interpretation of the biological studies of the genes implicated in cancer. This study aims to develop a new hybrid model for cancer classification (by using feature selection mRMRe as a key step to improve the performance of classification methods and a distributed hyperparameter optimization for gradient boosting ensemble methods). To evaluate the proposed method, NB, RF, and SVM classifiers have been chosen. In terms of the AUC, sensitivity, and specificity, the optimized CatBoost classifier performed better than the optimized XGBoost in cross-validation 5, 6, 8, and 10. With an accuracy of [Formula: see text] , the optimized CatBoost classifier is more accurate than the CatBoost classifier without optimization, which is [Formula: see text]. By using hybrid algorithms, SVM, RF, and NB automatically become more accurate. Furthermore, in terms of accuracy, SVM and RF ([Formula: see text]) achieve equivalent and higher classification accuracy than NB ([Formula: see text]). The findings of relevant biomedical studies confirm the findings of the selected genes.
format Online
Article
Text
id pubmed-8535043
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85350432021-10-23 Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier Almars, Abdulqader M. Alwateer, Majed Qaraad, Mohammed Amjad, Souad Fathi, Hanaa Kelany, Ayda K. Hussein, Nazar K. Elhosseini, Mostafa Diagnostics (Basel) Article The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensionality and making classifying data from microarrays challenging. In most of the examined studies, cancer classification (Malignant and benign) accuracy was examined without disclosing biological information related to the classification process. A new approach was proposed to bridge the gap between cancer classification and the interpretation of the biological studies of the genes implicated in cancer. This study aims to develop a new hybrid model for cancer classification (by using feature selection mRMRe as a key step to improve the performance of classification methods and a distributed hyperparameter optimization for gradient boosting ensemble methods). To evaluate the proposed method, NB, RF, and SVM classifiers have been chosen. In terms of the AUC, sensitivity, and specificity, the optimized CatBoost classifier performed better than the optimized XGBoost in cross-validation 5, 6, 8, and 10. With an accuracy of [Formula: see text] , the optimized CatBoost classifier is more accurate than the CatBoost classifier without optimization, which is [Formula: see text]. By using hybrid algorithms, SVM, RF, and NB automatically become more accurate. Furthermore, in terms of accuracy, SVM and RF ([Formula: see text]) achieve equivalent and higher classification accuracy than NB ([Formula: see text]). The findings of relevant biomedical studies confirm the findings of the selected genes. MDPI 2021-10-19 /pmc/articles/PMC8535043/ /pubmed/34679634 http://dx.doi.org/10.3390/diagnostics11101936 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Almars, Abdulqader M.
Alwateer, Majed
Qaraad, Mohammed
Amjad, Souad
Fathi, Hanaa
Kelany, Ayda K.
Hussein, Nazar K.
Elhosseini, Mostafa
Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title_full Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title_fullStr Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title_full_unstemmed Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title_short Brain Cancer Prediction Based on Novel Interpretable Ensemble Gene Selection Algorithm and Classifier
title_sort brain cancer prediction based on novel interpretable ensemble gene selection algorithm and classifier
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8535043/
https://www.ncbi.nlm.nih.gov/pubmed/34679634
http://dx.doi.org/10.3390/diagnostics11101936
work_keys_str_mv AT almarsabdulqaderm braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT alwateermajed braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT qaraadmohammed braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT amjadsouad braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT fathihanaa braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT kelanyaydak braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT husseinnazark braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier
AT elhosseinimostafa braincancerpredictionbasedonnovelinterpretableensemblegeneselectionalgorithmandclassifier