Cargando…

Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension

INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS...

Descripción completa

Detalles Bibliográficos
Autores principales: Xi, Yang, Wang, Hongyi, Sun, Ningling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701715/
https://www.ncbi.nlm.nih.gov/pubmed/36451926
http://dx.doi.org/10.3389/fcvm.2022.1025705
_version_ 1784839596429803520
author Xi, Yang
Wang, Hongyi
Sun, Ningling
author_facet Xi, Yang
Wang, Hongyi
Sun, Ningling
author_sort Xi, Yang
collection PubMed
description INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS AND METHODS: We compared traditional logistic regression (LR) with five ML algorithms LR with Elastic-Net, Random Forest (RF), XGBoost (XGB), Support Vector Machine, Deep Learning, and an Ensemble model averaging predictions from RF, XGB, and Deep Learning for CVD risk prediction using pre-existing patient-level data from a multi-center, cross-sectional study (the Microalbuminuria Screening in Hypertensive Patients Project initiated by the China International Exchange and Promotive Association for Medical and Healthcare) that enrolled 143,043 patients with hypertension from 600 tertiary, secondary, or community hospitals. Each of the five ML algorithms incorporated 18 variables, such as demographics, examinations, comorbidities, and treatment regimens, and were trained and evaluated using 5-fold cross-validation. Predictive accuracy was assessed by the area under the receiver operating curve (AUROC). RESULTS: Patients’ mean age was 62 ± 12 years and 57% were men. Advanced ML algorithms outperformed the traditional LR model. Particularly, the Ensemble model had superior discrimination with an AUROC of 0.760 than LR (AUC = 0.737) and other tested models. CONCLUSION: We establishes an Ensemble model that shows better performance in predicting patients’ current CVD risk using routine information compared to the traditional LR model. ML can help physicians design follow-up plans with more accurate results, offering new possibilities for short-term risk prediction and early detection. Further, ML models can be trained with longitudinal data and used to predict long-term CVD risks, thereby informing CVD prevention.
format Online
Article
Text
id pubmed-9701715
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97017152022-11-29 Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension Xi, Yang Wang, Hongyi Sun, Ningling Front Cardiovasc Med Cardiovascular Medicine INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS AND METHODS: We compared traditional logistic regression (LR) with five ML algorithms LR with Elastic-Net, Random Forest (RF), XGBoost (XGB), Support Vector Machine, Deep Learning, and an Ensemble model averaging predictions from RF, XGB, and Deep Learning for CVD risk prediction using pre-existing patient-level data from a multi-center, cross-sectional study (the Microalbuminuria Screening in Hypertensive Patients Project initiated by the China International Exchange and Promotive Association for Medical and Healthcare) that enrolled 143,043 patients with hypertension from 600 tertiary, secondary, or community hospitals. Each of the five ML algorithms incorporated 18 variables, such as demographics, examinations, comorbidities, and treatment regimens, and were trained and evaluated using 5-fold cross-validation. Predictive accuracy was assessed by the area under the receiver operating curve (AUROC). RESULTS: Patients’ mean age was 62 ± 12 years and 57% were men. Advanced ML algorithms outperformed the traditional LR model. Particularly, the Ensemble model had superior discrimination with an AUROC of 0.760 than LR (AUC = 0.737) and other tested models. CONCLUSION: We establishes an Ensemble model that shows better performance in predicting patients’ current CVD risk using routine information compared to the traditional LR model. ML can help physicians design follow-up plans with more accurate results, offering new possibilities for short-term risk prediction and early detection. Further, ML models can be trained with longitudinal data and used to predict long-term CVD risks, thereby informing CVD prevention. Frontiers Media S.A. 2022-11-14 /pmc/articles/PMC9701715/ /pubmed/36451926 http://dx.doi.org/10.3389/fcvm.2022.1025705 Text en Copyright © 2022 Xi, Wang and Sun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cardiovascular Medicine
Xi, Yang
Wang, Hongyi
Sun, Ningling
Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title_full Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title_fullStr Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title_full_unstemmed Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title_short Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
title_sort machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: a study involving 143,043 chinese patients with hypertension
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701715/
https://www.ncbi.nlm.nih.gov/pubmed/36451926
http://dx.doi.org/10.3389/fcvm.2022.1025705
work_keys_str_mv AT xiyang machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension
AT wanghongyi machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension
AT sunningling machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension