Cargando…
Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension
INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701715/ https://www.ncbi.nlm.nih.gov/pubmed/36451926 http://dx.doi.org/10.3389/fcvm.2022.1025705 |
_version_ | 1784839596429803520 |
---|---|
author | Xi, Yang Wang, Hongyi Sun, Ningling |
author_facet | Xi, Yang Wang, Hongyi Sun, Ningling |
author_sort | Xi, Yang |
collection | PubMed |
description | INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS AND METHODS: We compared traditional logistic regression (LR) with five ML algorithms LR with Elastic-Net, Random Forest (RF), XGBoost (XGB), Support Vector Machine, Deep Learning, and an Ensemble model averaging predictions from RF, XGB, and Deep Learning for CVD risk prediction using pre-existing patient-level data from a multi-center, cross-sectional study (the Microalbuminuria Screening in Hypertensive Patients Project initiated by the China International Exchange and Promotive Association for Medical and Healthcare) that enrolled 143,043 patients with hypertension from 600 tertiary, secondary, or community hospitals. Each of the five ML algorithms incorporated 18 variables, such as demographics, examinations, comorbidities, and treatment regimens, and were trained and evaluated using 5-fold cross-validation. Predictive accuracy was assessed by the area under the receiver operating curve (AUROC). RESULTS: Patients’ mean age was 62 ± 12 years and 57% were men. Advanced ML algorithms outperformed the traditional LR model. Particularly, the Ensemble model had superior discrimination with an AUROC of 0.760 than LR (AUC = 0.737) and other tested models. CONCLUSION: We establishes an Ensemble model that shows better performance in predicting patients’ current CVD risk using routine information compared to the traditional LR model. ML can help physicians design follow-up plans with more accurate results, offering new possibilities for short-term risk prediction and early detection. Further, ML models can be trained with longitudinal data and used to predict long-term CVD risks, thereby informing CVD prevention. |
format | Online Article Text |
id | pubmed-9701715 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97017152022-11-29 Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension Xi, Yang Wang, Hongyi Sun, Ningling Front Cardiovasc Med Cardiovascular Medicine INTRODUCTION: Identifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study). MATERIALS AND METHODS: We compared traditional logistic regression (LR) with five ML algorithms LR with Elastic-Net, Random Forest (RF), XGBoost (XGB), Support Vector Machine, Deep Learning, and an Ensemble model averaging predictions from RF, XGB, and Deep Learning for CVD risk prediction using pre-existing patient-level data from a multi-center, cross-sectional study (the Microalbuminuria Screening in Hypertensive Patients Project initiated by the China International Exchange and Promotive Association for Medical and Healthcare) that enrolled 143,043 patients with hypertension from 600 tertiary, secondary, or community hospitals. Each of the five ML algorithms incorporated 18 variables, such as demographics, examinations, comorbidities, and treatment regimens, and were trained and evaluated using 5-fold cross-validation. Predictive accuracy was assessed by the area under the receiver operating curve (AUROC). RESULTS: Patients’ mean age was 62 ± 12 years and 57% were men. Advanced ML algorithms outperformed the traditional LR model. Particularly, the Ensemble model had superior discrimination with an AUROC of 0.760 than LR (AUC = 0.737) and other tested models. CONCLUSION: We establishes an Ensemble model that shows better performance in predicting patients’ current CVD risk using routine information compared to the traditional LR model. ML can help physicians design follow-up plans with more accurate results, offering new possibilities for short-term risk prediction and early detection. Further, ML models can be trained with longitudinal data and used to predict long-term CVD risks, thereby informing CVD prevention. Frontiers Media S.A. 2022-11-14 /pmc/articles/PMC9701715/ /pubmed/36451926 http://dx.doi.org/10.3389/fcvm.2022.1025705 Text en Copyright © 2022 Xi, Wang and Sun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Xi, Yang Wang, Hongyi Sun, Ningling Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title | Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title_full | Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title_fullStr | Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title_full_unstemmed | Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title_short | Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension |
title_sort | machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: a study involving 143,043 chinese patients with hypertension |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701715/ https://www.ncbi.nlm.nih.gov/pubmed/36451926 http://dx.doi.org/10.3389/fcvm.2022.1025705 |
work_keys_str_mv | AT xiyang machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension AT wanghongyi machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension AT sunningling machinelearningoutperformstraditionallogisticregressionandoffersnewpossibilitiesforcardiovascularriskpredictionastudyinvolving143043chinesepatientswithhypertension |