Cargando…

Hyperparameter optimization for cardiovascular disease data-driven prognostic system

Prediction and diagnosis of cardiovascular diseases (CVDs) based, among other things, on medical examinations and patient symptoms are the biggest challenges in medicine. About 17.9 million people die from CVDs annually, accounting for 31% of all deaths worldwide. With a timely prognosis and thoroug...

Descripción completa

Detalles Bibliográficos
Autores principales: Saputra, Jayson, Lawrencya, Cindy, Saini, Jecky Mitra, Suharjito, Suharjito
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10390457/
https://www.ncbi.nlm.nih.gov/pubmed/37524951
http://dx.doi.org/10.1186/s42492-023-00143-6
_version_ 1785082477754187776
author Saputra, Jayson
Lawrencya, Cindy
Saini, Jecky Mitra
Suharjito, Suharjito
author_facet Saputra, Jayson
Lawrencya, Cindy
Saini, Jecky Mitra
Suharjito, Suharjito
author_sort Saputra, Jayson
collection PubMed
description Prediction and diagnosis of cardiovascular diseases (CVDs) based, among other things, on medical examinations and patient symptoms are the biggest challenges in medicine. About 17.9 million people die from CVDs annually, accounting for 31% of all deaths worldwide. With a timely prognosis and thorough consideration of the patient’s medical history and lifestyle, it is possible to predict CVDs and take preventive measures to eliminate or control this life-threatening disease. In this study, we used various patient datasets from a major hospital in the United States as prognostic factors for CVD. The data was obtained by monitoring a total of 918 patients whose criteria for adults were 28-77 years old. In this study, we present a data mining modeling approach to analyze the performance, classification accuracy and number of clusters on Cardiovascular Disease Prognostic datasets in unsupervised machine learning (ML) using the Orange data mining software. Various techniques are then used to classify the model parameters, such as k-nearest neighbors, support vector machine, random forest, artificial neural network (ANN), naïve bayes, logistic regression, stochastic gradient descent (SGD), and AdaBoost. To determine the number of clusters, various unsupervised ML clustering methods were used, such as k-means, hierarchical, and density-based spatial clustering of applications with noise clustering. The results showed that the best model performance analysis and classification accuracy were SGD and ANN, both of which had a high score of 0.900 on Cardiovascular Disease Prognostic datasets. Based on the results of most clustering methods, such as k-means and hierarchical clustering, Cardiovascular Disease Prognostic datasets can be divided into two clusters. The prognostic accuracy of CVD depends on the accuracy of the proposed model in determining the diagnostic model. The more accurate the model, the better it can predict which patients are at risk for CVD.
format Online
Article
Text
id pubmed-10390457
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-103904572023-08-02 Hyperparameter optimization for cardiovascular disease data-driven prognostic system Saputra, Jayson Lawrencya, Cindy Saini, Jecky Mitra Suharjito, Suharjito Vis Comput Ind Biomed Art Original Article Prediction and diagnosis of cardiovascular diseases (CVDs) based, among other things, on medical examinations and patient symptoms are the biggest challenges in medicine. About 17.9 million people die from CVDs annually, accounting for 31% of all deaths worldwide. With a timely prognosis and thorough consideration of the patient’s medical history and lifestyle, it is possible to predict CVDs and take preventive measures to eliminate or control this life-threatening disease. In this study, we used various patient datasets from a major hospital in the United States as prognostic factors for CVD. The data was obtained by monitoring a total of 918 patients whose criteria for adults were 28-77 years old. In this study, we present a data mining modeling approach to analyze the performance, classification accuracy and number of clusters on Cardiovascular Disease Prognostic datasets in unsupervised machine learning (ML) using the Orange data mining software. Various techniques are then used to classify the model parameters, such as k-nearest neighbors, support vector machine, random forest, artificial neural network (ANN), naïve bayes, logistic regression, stochastic gradient descent (SGD), and AdaBoost. To determine the number of clusters, various unsupervised ML clustering methods were used, such as k-means, hierarchical, and density-based spatial clustering of applications with noise clustering. The results showed that the best model performance analysis and classification accuracy were SGD and ANN, both of which had a high score of 0.900 on Cardiovascular Disease Prognostic datasets. Based on the results of most clustering methods, such as k-means and hierarchical clustering, Cardiovascular Disease Prognostic datasets can be divided into two clusters. The prognostic accuracy of CVD depends on the accuracy of the proposed model in determining the diagnostic model. The more accurate the model, the better it can predict which patients are at risk for CVD. Springer Nature Singapore 2023-08-01 /pmc/articles/PMC10390457/ /pubmed/37524951 http://dx.doi.org/10.1186/s42492-023-00143-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Saputra, Jayson
Lawrencya, Cindy
Saini, Jecky Mitra
Suharjito, Suharjito
Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title_full Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title_fullStr Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title_full_unstemmed Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title_short Hyperparameter optimization for cardiovascular disease data-driven prognostic system
title_sort hyperparameter optimization for cardiovascular disease data-driven prognostic system
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10390457/
https://www.ncbi.nlm.nih.gov/pubmed/37524951
http://dx.doi.org/10.1186/s42492-023-00143-6
work_keys_str_mv AT saputrajayson hyperparameteroptimizationforcardiovasculardiseasedatadrivenprognosticsystem
AT lawrencyacindy hyperparameteroptimizationforcardiovasculardiseasedatadrivenprognosticsystem
AT sainijeckymitra hyperparameteroptimizationforcardiovasculardiseasedatadrivenprognosticsystem
AT suharjitosuharjito hyperparameteroptimizationforcardiovasculardiseasedatadrivenprognosticsystem