Cargando…
A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model bas...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9247206/ https://www.ncbi.nlm.nih.gov/pubmed/35783868 http://dx.doi.org/10.3389/fcvm.2022.854287 |
_version_ | 1784739102468341760 |
---|---|
author | Qian, Xin Li, Yu Zhang, Xianghui Guo, Heng He, Jia Wang, Xinping Yan, Yizhong Ma, Jiaolong Ma, Rulin Guo, Shuxia |
author_facet | Qian, Xin Li, Yu Zhang, Xianghui Guo, Heng He, Jia Wang, Xinping Yan, Yizhong Ma, Jiaolong Ma, Rulin Guo, Shuxia |
author_sort | Qian, Xin |
collection | PubMed |
description | BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model based on routine physical examination indicators suitable for the Xinjiang rural population. METHOD: The research cohort data collection was divided into two stages. The first stage involved a baseline survey from 2010 to 2012, with follow-up ending in December 2017. The second-phase baseline survey was conducted from September to December 2016, and follow-up ended in August 2021. A total of 12,692 participants (10,407 Uyghur and 2,285 Kazak) were included in the study. Screening predictors and establishing variable subsets were based on least absolute shrinkage and selection operator (Lasso) regression, logistic regression forward partial likelihood estimation (FLR), random forest (RF) feature importance, and RF variable importance. The selected subset of variables was compared with L1 regularized logistic regression (L1-LR), RF, support vector machine (SVM), and AdaBoost algorithm to establish a CVD prediction model suitable for this population. The incidence of CVD in this population was then analyzed. RESULT: After 4.94 years of follow-up, a total of 1,176 people were diagnosed with CVD (cumulative incidence: 9.27%). In the comparison of discrimination and calibration, the prediction performance of the subset of variables selected based on FLR was better than that of other models. Combining the results of discrimination, calibration, and clinical validity, the prediction model based on L1-LR had the best prediction performance. Age, systolic blood pressure, low-density lipoprotein-L/high-density lipoproteins-C, triglyceride blood glucose index, body mass index, and body adiposity index were all important predictors of the onset of CVD in the Xinjiang rural population. CONCLUSION: In the Xinjiang rural population, the prediction model based on L1-LR had the best prediction performance. |
format | Online Article Text |
id | pubmed-9247206 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92472062022-07-02 A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study Qian, Xin Li, Yu Zhang, Xianghui Guo, Heng He, Jia Wang, Xinping Yan, Yizhong Ma, Jiaolong Ma, Rulin Guo, Shuxia Front Cardiovasc Med Cardiovascular Medicine BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model based on routine physical examination indicators suitable for the Xinjiang rural population. METHOD: The research cohort data collection was divided into two stages. The first stage involved a baseline survey from 2010 to 2012, with follow-up ending in December 2017. The second-phase baseline survey was conducted from September to December 2016, and follow-up ended in August 2021. A total of 12,692 participants (10,407 Uyghur and 2,285 Kazak) were included in the study. Screening predictors and establishing variable subsets were based on least absolute shrinkage and selection operator (Lasso) regression, logistic regression forward partial likelihood estimation (FLR), random forest (RF) feature importance, and RF variable importance. The selected subset of variables was compared with L1 regularized logistic regression (L1-LR), RF, support vector machine (SVM), and AdaBoost algorithm to establish a CVD prediction model suitable for this population. The incidence of CVD in this population was then analyzed. RESULT: After 4.94 years of follow-up, a total of 1,176 people were diagnosed with CVD (cumulative incidence: 9.27%). In the comparison of discrimination and calibration, the prediction performance of the subset of variables selected based on FLR was better than that of other models. Combining the results of discrimination, calibration, and clinical validity, the prediction model based on L1-LR had the best prediction performance. Age, systolic blood pressure, low-density lipoprotein-L/high-density lipoproteins-C, triglyceride blood glucose index, body mass index, and body adiposity index were all important predictors of the onset of CVD in the Xinjiang rural population. CONCLUSION: In the Xinjiang rural population, the prediction model based on L1-LR had the best prediction performance. Frontiers Media S.A. 2022-06-17 /pmc/articles/PMC9247206/ /pubmed/35783868 http://dx.doi.org/10.3389/fcvm.2022.854287 Text en Copyright © 2022 Qian, Li, Zhang, Guo, He, Wang, Yan, Ma, Ma and Guo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cardiovascular Medicine Qian, Xin Li, Yu Zhang, Xianghui Guo, Heng He, Jia Wang, Xinping Yan, Yizhong Ma, Jiaolong Ma, Rulin Guo, Shuxia A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title | A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title_full | A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title_fullStr | A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title_full_unstemmed | A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title_short | A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study |
title_sort | cardiovascular disease prediction model based on routine physical examination indicators using machine learning methods: a cohort study |
topic | Cardiovascular Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9247206/ https://www.ncbi.nlm.nih.gov/pubmed/35783868 http://dx.doi.org/10.3389/fcvm.2022.854287 |
work_keys_str_mv | AT qianxin acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT liyu acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT zhangxianghui acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT guoheng acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT hejia acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT wangxinping acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT yanyizhong acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT majiaolong acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT marulin acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT guoshuxia acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT qianxin cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT liyu cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT zhangxianghui cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT guoheng cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT hejia cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT wangxinping cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT yanyizhong cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT majiaolong cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT marulin cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy AT guoshuxia cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy |