Cargando…

A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study

BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Xin, Li, Yu, Zhang, Xianghui, Guo, Heng, He, Jia, Wang, Xinping, Yan, Yizhong, Ma, Jiaolong, Ma, Rulin, Guo, Shuxia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9247206/
https://www.ncbi.nlm.nih.gov/pubmed/35783868
http://dx.doi.org/10.3389/fcvm.2022.854287
_version_ 1784739102468341760
author Qian, Xin
Li, Yu
Zhang, Xianghui
Guo, Heng
He, Jia
Wang, Xinping
Yan, Yizhong
Ma, Jiaolong
Ma, Rulin
Guo, Shuxia
author_facet Qian, Xin
Li, Yu
Zhang, Xianghui
Guo, Heng
He, Jia
Wang, Xinping
Yan, Yizhong
Ma, Jiaolong
Ma, Rulin
Guo, Shuxia
author_sort Qian, Xin
collection PubMed
description BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model based on routine physical examination indicators suitable for the Xinjiang rural population. METHOD: The research cohort data collection was divided into two stages. The first stage involved a baseline survey from 2010 to 2012, with follow-up ending in December 2017. The second-phase baseline survey was conducted from September to December 2016, and follow-up ended in August 2021. A total of 12,692 participants (10,407 Uyghur and 2,285 Kazak) were included in the study. Screening predictors and establishing variable subsets were based on least absolute shrinkage and selection operator (Lasso) regression, logistic regression forward partial likelihood estimation (FLR), random forest (RF) feature importance, and RF variable importance. The selected subset of variables was compared with L1 regularized logistic regression (L1-LR), RF, support vector machine (SVM), and AdaBoost algorithm to establish a CVD prediction model suitable for this population. The incidence of CVD in this population was then analyzed. RESULT: After 4.94 years of follow-up, a total of 1,176 people were diagnosed with CVD (cumulative incidence: 9.27%). In the comparison of discrimination and calibration, the prediction performance of the subset of variables selected based on FLR was better than that of other models. Combining the results of discrimination, calibration, and clinical validity, the prediction model based on L1-LR had the best prediction performance. Age, systolic blood pressure, low-density lipoprotein-L/high-density lipoproteins-C, triglyceride blood glucose index, body mass index, and body adiposity index were all important predictors of the onset of CVD in the Xinjiang rural population. CONCLUSION: In the Xinjiang rural population, the prediction model based on L1-LR had the best prediction performance.
format Online
Article
Text
id pubmed-9247206
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92472062022-07-02 A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study Qian, Xin Li, Yu Zhang, Xianghui Guo, Heng He, Jia Wang, Xinping Yan, Yizhong Ma, Jiaolong Ma, Rulin Guo, Shuxia Front Cardiovasc Med Cardiovascular Medicine BACKGROUND: Cardiovascular diseases (CVD) are currently the leading cause of premature death worldwide. Model-based early detection of high-risk populations for CVD is the key to CVD prevention. Thus, this research aimed to use machine learning (ML) algorithms to establish a CVD prediction model based on routine physical examination indicators suitable for the Xinjiang rural population. METHOD: The research cohort data collection was divided into two stages. The first stage involved a baseline survey from 2010 to 2012, with follow-up ending in December 2017. The second-phase baseline survey was conducted from September to December 2016, and follow-up ended in August 2021. A total of 12,692 participants (10,407 Uyghur and 2,285 Kazak) were included in the study. Screening predictors and establishing variable subsets were based on least absolute shrinkage and selection operator (Lasso) regression, logistic regression forward partial likelihood estimation (FLR), random forest (RF) feature importance, and RF variable importance. The selected subset of variables was compared with L1 regularized logistic regression (L1-LR), RF, support vector machine (SVM), and AdaBoost algorithm to establish a CVD prediction model suitable for this population. The incidence of CVD in this population was then analyzed. RESULT: After 4.94 years of follow-up, a total of 1,176 people were diagnosed with CVD (cumulative incidence: 9.27%). In the comparison of discrimination and calibration, the prediction performance of the subset of variables selected based on FLR was better than that of other models. Combining the results of discrimination, calibration, and clinical validity, the prediction model based on L1-LR had the best prediction performance. Age, systolic blood pressure, low-density lipoprotein-L/high-density lipoproteins-C, triglyceride blood glucose index, body mass index, and body adiposity index were all important predictors of the onset of CVD in the Xinjiang rural population. CONCLUSION: In the Xinjiang rural population, the prediction model based on L1-LR had the best prediction performance. Frontiers Media S.A. 2022-06-17 /pmc/articles/PMC9247206/ /pubmed/35783868 http://dx.doi.org/10.3389/fcvm.2022.854287 Text en Copyright © 2022 Qian, Li, Zhang, Guo, He, Wang, Yan, Ma, Ma and Guo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cardiovascular Medicine
Qian, Xin
Li, Yu
Zhang, Xianghui
Guo, Heng
He, Jia
Wang, Xinping
Yan, Yizhong
Ma, Jiaolong
Ma, Rulin
Guo, Shuxia
A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title_full A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title_fullStr A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title_full_unstemmed A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title_short A Cardiovascular Disease Prediction Model Based on Routine Physical Examination Indicators Using Machine Learning Methods: A Cohort Study
title_sort cardiovascular disease prediction model based on routine physical examination indicators using machine learning methods: a cohort study
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9247206/
https://www.ncbi.nlm.nih.gov/pubmed/35783868
http://dx.doi.org/10.3389/fcvm.2022.854287
work_keys_str_mv AT qianxin acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT liyu acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT zhangxianghui acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT guoheng acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT hejia acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT wangxinping acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT yanyizhong acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT majiaolong acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT marulin acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT guoshuxia acardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT qianxin cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT liyu cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT zhangxianghui cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT guoheng cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT hejia cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT wangxinping cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT yanyizhong cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT majiaolong cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT marulin cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy
AT guoshuxia cardiovasculardiseasepredictionmodelbasedonroutinephysicalexaminationindicatorsusingmachinelearningmethodsacohortstudy