Cargando…
Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization
Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9783937/ https://www.ncbi.nlm.nih.gov/pubmed/36556946 http://dx.doi.org/10.3390/medicina58121745 |
_version_ | 1784857692933718016 |
---|---|
author | Mohammedqasim, Hayder Mohammedqasem, Roa’a Ata, Oguz Alyasin, Eman Ibrahim |
author_facet | Mohammedqasim, Hayder Mohammedqasem, Roa’a Ata, Oguz Alyasin, Eman Ibrahim |
author_sort | Mohammedqasim, Hayder |
collection | PubMed |
description | Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance of machine learning systems that can allow for timely and accurate treatment. We performed a Hybrid ML framework based on hard ensemble voting optimization (HEVO) to classify patients with CAD using the Z-Alizadeh Sani dataset. All categorical features were converted to numerical forms, the synthetic minority oversampling technique (SMOTE) was employed to overcome imbalanced distribution between two classes in the dataset, and then, recursive feature elimination (RFE) with random forest (RF) was used to obtain the best subset of features. Materials and Methods: After solving the biased distribution in the CAD data set using the SMOTE method and finding the high correlation features that affected the classification of CAD patients. The performance of the proposed model was evaluated using grid search optimization, and the best hyperparameters were identified for developing four applications, namely, RF, AdaBoost, gradient-boosting, and extra trees based on an HEV classifier. Results: Five fold cross-validation experiments with the HEV classifier showed excellent prediction performance results with the 10 best balanced features obtained using SMOTE and feature selection. All evaluation metrics results reached > 98% with the HEV classifier, and the gradient-boosting model was the second best classification model with accuracy = 97% and F1-score = 98%. Conclusions: When compared to modern methods, the proposed method perform well in diagnosing coronary artery disease, and therefore, the proposed method can be used by medical personnel for supplementary therapy for timely, accurate, and efficient identification of CAD cases in suspected patients. |
format | Online Article Text |
id | pubmed-9783937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97839372022-12-24 Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization Mohammedqasim, Hayder Mohammedqasem, Roa’a Ata, Oguz Alyasin, Eman Ibrahim Medicina (Kaunas) Article Background and Objectives: Recently, many studies have focused on the early diagnosis of coronary artery disease (CAD), which is one of the leading causes of cardiac-associated death worldwide. The effectiveness of the most important features influencing disease diagnosis determines the performance of machine learning systems that can allow for timely and accurate treatment. We performed a Hybrid ML framework based on hard ensemble voting optimization (HEVO) to classify patients with CAD using the Z-Alizadeh Sani dataset. All categorical features were converted to numerical forms, the synthetic minority oversampling technique (SMOTE) was employed to overcome imbalanced distribution between two classes in the dataset, and then, recursive feature elimination (RFE) with random forest (RF) was used to obtain the best subset of features. Materials and Methods: After solving the biased distribution in the CAD data set using the SMOTE method and finding the high correlation features that affected the classification of CAD patients. The performance of the proposed model was evaluated using grid search optimization, and the best hyperparameters were identified for developing four applications, namely, RF, AdaBoost, gradient-boosting, and extra trees based on an HEV classifier. Results: Five fold cross-validation experiments with the HEV classifier showed excellent prediction performance results with the 10 best balanced features obtained using SMOTE and feature selection. All evaluation metrics results reached > 98% with the HEV classifier, and the gradient-boosting model was the second best classification model with accuracy = 97% and F1-score = 98%. Conclusions: When compared to modern methods, the proposed method perform well in diagnosing coronary artery disease, and therefore, the proposed method can be used by medical personnel for supplementary therapy for timely, accurate, and efficient identification of CAD cases in suspected patients. MDPI 2022-11-28 /pmc/articles/PMC9783937/ /pubmed/36556946 http://dx.doi.org/10.3390/medicina58121745 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Mohammedqasim, Hayder Mohammedqasem, Roa’a Ata, Oguz Alyasin, Eman Ibrahim Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title | Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title_full | Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title_fullStr | Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title_full_unstemmed | Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title_short | Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization |
title_sort | diagnosing coronary artery disease on the basis of hard ensemble voting optimization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9783937/ https://www.ncbi.nlm.nih.gov/pubmed/36556946 http://dx.doi.org/10.3390/medicina58121745 |
work_keys_str_mv | AT mohammedqasimhayder diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization AT mohammedqasemroaa diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization AT ataoguz diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization AT alyasinemanibrahim diagnosingcoronaryarterydiseaseonthebasisofhardensemblevotingoptimization |