Cargando…

Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization

Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammedqasim, Hayder, Mohammedqasem, Roa’a, Ata, Oguz, Alyasin, Eman Ibrahim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9783937/
https://www.ncbi.nlm.nih.gov/pubmed/36556946
http://dx.doi.org/10.3390/medicina58121745
_version_ 1784857692933718016
author Mohammedqasim, Hayder
Mohammedqasem, Roa’a
Ata, Oguz
Alyasin, Eman Ibrahim
author_facet Mohammedqasim, Hayder
Mohammedqasem, Roa’a
Ata, Oguz
Alyasin, Eman Ibrahim
author_sort Mohammedqasim, Hayder
collection PubMed
description Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance of machine learning systems that can allow for timely and accurate treatment. We performed a Hybrid ML framework based on hard ensemble voting optimization (HEVO) to classify patients with CAD using the Z-Alizadeh Sani dataset. All categorical features were converted to numerical forms, the synthetic minority oversampling technique (SMOTE) was employed to overcome imbalanced distribution between two classes in the dataset, and then, recursive feature elimination (RFE) with random forest (RF) was used to obtain the best subset of features. Materials and Methods: After solving the biased distribution in the CAD data set using the SMOTE method and finding the high correlation features that affected the classification of CAD patients. The performance of the proposed model was evaluated using grid search optimization, and the best hyperparameters were identified for developing four applications, namely, RF, AdaBoost, gradient-boosting, and extra trees based on an HEV classifier. Results: Five fold cross-validation experiments with the HEV classifier showed excellent prediction performance results with the 10 best balanced features obtained using SMOTE and feature selection. All evaluation metrics results reached > 98% with the HEV classifier, and the gradient-boosting model was the second best classification model with accuracy = 97% and F1-score = 98%. Conclusions: When compared to modern methods, the proposed method perform well in diagnosing coronary artery disease, and therefore, the proposed method can be used by medical personnel for supplementary therapy for timely, accurate, and efficient identification of CAD cases in suspected patients.
format Online
Article
Text
id pubmed-9783937
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97839372022-12-24 Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization Mohammedqasim, Hayder Mohammedqasem, Roa’a Ata, Oguz Alyasin, Eman Ibrahim Medicina (Kaunas) Article Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance of machine learning systems that can allow for timely and accurate treatment. We performed a Hybrid ML framework based on hard ensemble voting optimization (HEVO) to classify patients with CAD using the Z-Alizadeh Sani dataset. All categorical features were converted to numerical forms, the synthetic minority oversampling technique (SMOTE) was employed to overcome imbalanced distribution between two classes in the dataset, and then, recursive feature elimination (RFE) with random forest (RF) was used to obtain the best subset of features. Materials and Methods: After solving the biased distribution in the CAD data set using the SMOTE method and finding the high correlation features that affected the classification of CAD patients. The performance of the proposed model was evaluated using grid search optimization, and the best hyperparameters were identified for developing four applications, namely, RF, AdaBoost, gradient-boosting, and extra trees based on an HEV classifier. Results: Five fold cross-validation experiments with the HEV classifier showed excellent prediction performance results with the 10 best balanced features obtained using SMOTE and feature selection. All evaluation metrics results reached > 98% with the HEV classifier, and the gradient-boosting model was the second best classification model with accuracy = 97% and F1-score = 98%. Conclusions: When compared to modern methods, the proposed method perform well in diagnosing coronary artery disease, and therefore, the proposed method can be used by medical personnel for supplementary therapy for timely, accurate, and efficient identification of CAD cases in suspected patients. MDPI 2022-11-28 /pmc/articles/PMC9783937/ /pubmed/36556946 http://dx.doi.org/10.3390/medicina58121745 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mohammedqasim, Hayder
Mohammedqasem, Roa’a
Ata, Oguz
Alyasin, Eman Ibrahim
Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title_full Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title_fullStr Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title_full_unstemmed Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title_short Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
title_sort diagnosing coronary artery disease on the basis of hard ensemble voting optimization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9783937/
https://www.ncbi.nlm.nih.gov/pubmed/36556946
http://dx.doi.org/10.3390/medicina58121745
work_keys_str_mv AT mohammedqasimhayder diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization
AT mohammedqasemroaa diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization
AT ataoguz diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization
AT alyasinemanibrahim diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization