Cargando…
Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792315/ http://dx.doi.org/10.1007/s42600-022-00253-9 |
_version_ | 1784859607826432000 |
---|---|
author | Albert, Anitha Juliette Murugan, R. Sripriya, T. |
author_facet | Albert, Anitha Juliette Murugan, R. Sripriya, T. |
author_sort | Albert, Anitha Juliette |
collection | PubMed |
description | PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized machine-learning algorithm for heart disease detection. This technique combines oversampling techniques, attribute pruning, CART decision tree classifier, and rule pruning through hyper-parameter tuning to identify the presence of heart disease. It further identifies the key attributes that contribute to the occurrence of heart malfunctioning. RESULTS: Experimental results show that SMOTE sampled dataset exhibits effective performance when implemented using a balanced and optimized machine learning algorithm, with an improvement of 11%, 75%, 62%, and 71% in accuracy, precision, recall, and f1 scores when compared with the dataset that was not subjected to sampling. The algorithm works effectively when the imbalance ratio is high for a dataset. CONCLUSION: The algorithm can be used to predict the presence of heart disease even in highly imbalanced datasets and identify critical features in the malfunctioning of the heart. |
format | Online Article Text |
id | pubmed-9792315 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-97923152022-12-27 Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology Albert, Anitha Juliette Murugan, R. Sripriya, T. Res. Biomed. Eng. Original Article PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized machine-learning algorithm for heart disease detection. This technique combines oversampling techniques, attribute pruning, CART decision tree classifier, and rule pruning through hyper-parameter tuning to identify the presence of heart disease. It further identifies the key attributes that contribute to the occurrence of heart malfunctioning. RESULTS: Experimental results show that SMOTE sampled dataset exhibits effective performance when implemented using a balanced and optimized machine learning algorithm, with an improvement of 11%, 75%, 62%, and 71% in accuracy, precision, recall, and f1 scores when compared with the dataset that was not subjected to sampling. The algorithm works effectively when the imbalance ratio is high for a dataset. CONCLUSION: The algorithm can be used to predict the presence of heart disease even in highly imbalanced datasets and identify critical features in the malfunctioning of the heart. Springer International Publishing 2022-12-27 2023 /pmc/articles/PMC9792315/ http://dx.doi.org/10.1007/s42600-022-00253-9 Text en © The Author(s), under exclusive licence to The Brazilian Society of Biomedical Engineering 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Albert, Anitha Juliette Murugan, R. Sripriya, T. Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title | Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title_full | Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title_fullStr | Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title_full_unstemmed | Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title_short | Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
title_sort | diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792315/ http://dx.doi.org/10.1007/s42600-022-00253-9 |
work_keys_str_mv | AT albertanithajuliette diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology AT muruganr diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology AT sripriyat diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology |