Cargando…

Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology

PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized...

Descripción completa

Detalles Bibliográficos
Autores principales: Albert, Anitha Juliette, Murugan, R., Sripriya, T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792315/
http://dx.doi.org/10.1007/s42600-022-00253-9
_version_ 1784859607826432000
author Albert, Anitha Juliette
Murugan, R.
Sripriya, T.
author_facet Albert, Anitha Juliette
Murugan, R.
Sripriya, T.
author_sort Albert, Anitha Juliette
collection PubMed
description PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized machine-learning algorithm for heart disease detection. This technique combines oversampling techniques, attribute pruning, CART decision tree classifier, and rule pruning through hyper-parameter tuning to identify the presence of heart disease. It further identifies the key attributes that contribute to the occurrence of heart malfunctioning. RESULTS: Experimental results show that SMOTE sampled dataset exhibits effective performance when implemented using a balanced and optimized machine learning algorithm, with an improvement of 11%, 75%, 62%, and 71% in accuracy, precision, recall, and f1 scores when compared with the dataset that was not subjected to sampling. The algorithm works effectively when the imbalance ratio is high for a dataset. CONCLUSION: The algorithm can be used to predict the presence of heart disease even in highly imbalanced datasets and identify critical features in the malfunctioning of the heart.
format Online
Article
Text
id pubmed-9792315
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-97923152022-12-27 Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology Albert, Anitha Juliette Murugan, R. Sripriya, T. Res. Biomed. Eng. Original Article PURPOSE: Heart disease is one of the most prevalent and critical diseases that endangers the lives of human beings. In addition to clinical diagnosis, machine learning and deep learning-based approaches are vital in the diagnosis of heart disease. METHOD: This paper proposes a balanced and optimized machine-learning algorithm for heart disease detection. This technique combines oversampling techniques, attribute pruning, CART decision tree classifier, and rule pruning through hyper-parameter tuning to identify the presence of heart disease. It further identifies the key attributes that contribute to the occurrence of heart malfunctioning. RESULTS: Experimental results show that SMOTE sampled dataset exhibits effective performance when implemented using a balanced and optimized machine learning algorithm, with an improvement of 11%, 75%, 62%, and 71% in accuracy, precision, recall, and f1 scores when compared with the dataset that was not subjected to sampling. The algorithm works effectively when the imbalance ratio is high for a dataset. CONCLUSION: The algorithm can be used to predict the presence of heart disease even in highly imbalanced datasets and identify critical features in the malfunctioning of the heart. Springer International Publishing 2022-12-27 2023 /pmc/articles/PMC9792315/ http://dx.doi.org/10.1007/s42600-022-00253-9 Text en © The Author(s), under exclusive licence to The Brazilian Society of Biomedical Engineering 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Albert, Anitha Juliette
Murugan, R.
Sripriya, T.
Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title_full Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title_fullStr Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title_full_unstemmed Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title_short Diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
title_sort diagnosis of heart disease using oversampling methods and decision tree classifier in cardiology
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792315/
http://dx.doi.org/10.1007/s42600-022-00253-9
work_keys_str_mv AT albertanithajuliette diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology
AT muruganr diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology
AT sripriyat diagnosisofheartdiseaseusingoversamplingmethodsanddecisiontreeclassifierincardiology