Cargando…

Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction

Cardiovascular diseases (CVDs) are now the leading cause of death, as the quality of life and human habits have changed significantly. CVDs are accompanied by various complications, including all pathological changes involving the heart and/or blood vessels. The list of pathological changes includes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dritsas, Elias, Trigka, Maria
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9921621/ https://www.ncbi.nlm.nih.gov/pubmed/36772201 http://dx.doi.org/10.3390/s23031161

_version_	1784887355685994496
author	Dritsas, Elias Trigka, Maria
author_facet	Dritsas, Elias Trigka, Maria
author_sort	Dritsas, Elias
collection	PubMed
description	Cardiovascular diseases (CVDs) are now the leading cause of death, as the quality of life and human habits have changed significantly. CVDs are accompanied by various complications, including all pathological changes involving the heart and/or blood vessels. The list of pathological changes includes hypertension, coronary heart disease, heart failure, angina, myocardial infarction and stroke. Hence, prevention and early diagnosis could limit the onset or progression of the disease. Nowadays, machine learning (ML) techniques have gained a significant role in disease prediction and are an essential tool in medicine. In this study, a supervised ML-based methodology is presented through which we aim to design efficient prediction models for CVD manifestation, highlighting the SMOTE technique’s superiority. Detailed analysis and understanding of risk factors are shown to explore their importance and contribution to CVD prediction. These factors are fed as input features to a plethora of ML models, which are trained and tested to identify the most appropriate for our objective under a binary classification problem with a uniform class probability distribution. Various ML models were evaluated after the use or non-use of Synthetic Minority Oversampling Technique (SMOTE), and comparing them in terms of Accuracy, Recall, Precision and an Area Under the Curve (AUC). The experiment results showed that the Stacking ensemble model after SMOTE with 10-fold cross-validation prevailed over the other ones achieving an Accuracy of 87.8%, Recall of 88.3%, Precision of 88% and an AUC equal to 98.2%.
format	Online Article Text
id	pubmed-9921621
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99216212023-02-12 Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction Dritsas, Elias Trigka, Maria Sensors (Basel) Article Cardiovascular diseases (CVDs) are now the leading cause of death, as the quality of life and human habits have changed significantly. CVDs are accompanied by various complications, including all pathological changes involving the heart and/or blood vessels. The list of pathological changes includes hypertension, coronary heart disease, heart failure, angina, myocardial infarction and stroke. Hence, prevention and early diagnosis could limit the onset or progression of the disease. Nowadays, machine learning (ML) techniques have gained a significant role in disease prediction and are an essential tool in medicine. In this study, a supervised ML-based methodology is presented through which we aim to design efficient prediction models for CVD manifestation, highlighting the SMOTE technique’s superiority. Detailed analysis and understanding of risk factors are shown to explore their importance and contribution to CVD prediction. These factors are fed as input features to a plethora of ML models, which are trained and tested to identify the most appropriate for our objective under a binary classification problem with a uniform class probability distribution. Various ML models were evaluated after the use or non-use of Synthetic Minority Oversampling Technique (SMOTE), and comparing them in terms of Accuracy, Recall, Precision and an Area Under the Curve (AUC). The experiment results showed that the Stacking ensemble model after SMOTE with 10-fold cross-validation prevailed over the other ones achieving an Accuracy of 87.8%, Recall of 88.3%, Precision of 88% and an AUC equal to 98.2%. MDPI 2023-01-19 /pmc/articles/PMC9921621/ /pubmed/36772201 http://dx.doi.org/10.3390/s23031161 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Dritsas, Elias Trigka, Maria Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title	Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title_full	Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title_fullStr	Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title_full_unstemmed	Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title_short	Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
title_sort	efficient data-driven machine learning models for cardiovascular diseases risk prediction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9921621/ https://www.ncbi.nlm.nih.gov/pubmed/36772201 http://dx.doi.org/10.3390/s23031161
work_keys_str_mv	AT dritsaselias efficientdatadrivenmachinelearningmodelsforcardiovasculardiseasesriskprediction AT trigkamaria efficientdatadrivenmachinelearningmodelsforcardiovasculardiseasesriskprediction

Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction

Ejemplares similares