Cargando…

Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)

BACKGROUND: Identifying factors associated with cardiovascular disease (CVD) is critical for its prevention, but this topic is scarcely investigated in Kashgar prefecture, Xinjiang, northwestern China. We thus explored the CVD epidemiology and identified prominent factors associated with CVD in this...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jia-Xin, Li, Li, Zhong, Xuemei, Fan, Shu-Jun, Cen, Tao, Wang, Jianquan, He, Chuanjiang, Zhang, Zhoubin, Luo, Ya-Na, Liu, Xiao-Xuan, Hu, Li-Xin, Zhang, Yi-Dan, Qiu, Hui-Ling, Dong, Guang-Hui, Zou, Xiao-Guang, Yang, Bo-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724436/
https://www.ncbi.nlm.nih.gov/pubmed/36474302
http://dx.doi.org/10.1186/s41256-022-00282-y
_version_ 1784844415612747776
author Li, Jia-Xin
Li, Li
Zhong, Xuemei
Fan, Shu-Jun
Cen, Tao
Wang, Jianquan
He, Chuanjiang
Zhang, Zhoubin
Luo, Ya-Na
Liu, Xiao-Xuan
Hu, Li-Xin
Zhang, Yi-Dan
Qiu, Hui-Ling
Dong, Guang-Hui
Zou, Xiao-Guang
Yang, Bo-Yi
author_facet Li, Jia-Xin
Li, Li
Zhong, Xuemei
Fan, Shu-Jun
Cen, Tao
Wang, Jianquan
He, Chuanjiang
Zhang, Zhoubin
Luo, Ya-Na
Liu, Xiao-Xuan
Hu, Li-Xin
Zhang, Yi-Dan
Qiu, Hui-Ling
Dong, Guang-Hui
Zou, Xiao-Guang
Yang, Bo-Yi
author_sort Li, Jia-Xin
collection PubMed
description BACKGROUND: Identifying factors associated with cardiovascular disease (CVD) is critical for its prevention, but this topic is scarcely investigated in Kashgar prefecture, Xinjiang, northwestern China. We thus explored the CVD epidemiology and identified prominent factors associated with CVD in this region. METHODS: A total of 1,887,710 adults at baseline (in 2017) of the Kashgar Prospective Cohort Study were included in the analysis. Sixteen candidate factors, including seven demographic factors, 4 lifestyle factors, and 5 clinical factors, were collected from a questionnaire and health examination records. CVD was defined according to International Clinical Diagnosis (ICD-10) codes. We first used logistic regression models to investigate the association between each of the candidate factors and CVD. Then, we employed 3 machine learning methods—Random Forest, Random Ferns, and Extreme Gradient Boosting—to rank and identify prominent factors associated with CVD. Stratification analyses by sex, ethnicity, education level, economic status, and residential setting were also performed to test the consistency of the ranking. RESULTS: The prevalence of CVD in Kashgar prefecture was 8.1%. All the 16 candidate factors were confirmed to be significantly associated with CVD (odds ratios ranged from 1.03 to 2.99, all p values < 0.05) in logistic regression models. Further machine learning-based analysis suggested that age, occupation, hypertension, exercise frequency, and dietary pattern were the five most prominent factors associated with CVD. The ranking of relative importance for prominent factors in stratification analyses showed that the factor importance generally followed the same pattern as that in the overall sample. CONCLUSIONS: CVD is a major public health concern in Kashgar prefecture. Age, occupation, hypertension, exercise frequency, and dietary pattern might be the prominent factors associated with CVD in this region.In the future, these factors should be given priority in preventing CVD in future. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s41256-022-00282-y.
format Online
Article
Text
id pubmed-9724436
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97244362022-12-07 Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS) Li, Jia-Xin Li, Li Zhong, Xuemei Fan, Shu-Jun Cen, Tao Wang, Jianquan He, Chuanjiang Zhang, Zhoubin Luo, Ya-Na Liu, Xiao-Xuan Hu, Li-Xin Zhang, Yi-Dan Qiu, Hui-Ling Dong, Guang-Hui Zou, Xiao-Guang Yang, Bo-Yi Glob Health Res Policy Research BACKGROUND: Identifying factors associated with cardiovascular disease (CVD) is critical for its prevention, but this topic is scarcely investigated in Kashgar prefecture, Xinjiang, northwestern China. We thus explored the CVD epidemiology and identified prominent factors associated with CVD in this region. METHODS: A total of 1,887,710 adults at baseline (in 2017) of the Kashgar Prospective Cohort Study were included in the analysis. Sixteen candidate factors, including seven demographic factors, 4 lifestyle factors, and 5 clinical factors, were collected from a questionnaire and health examination records. CVD was defined according to International Clinical Diagnosis (ICD-10) codes. We first used logistic regression models to investigate the association between each of the candidate factors and CVD. Then, we employed 3 machine learning methods—Random Forest, Random Ferns, and Extreme Gradient Boosting—to rank and identify prominent factors associated with CVD. Stratification analyses by sex, ethnicity, education level, economic status, and residential setting were also performed to test the consistency of the ranking. RESULTS: The prevalence of CVD in Kashgar prefecture was 8.1%. All the 16 candidate factors were confirmed to be significantly associated with CVD (odds ratios ranged from 1.03 to 2.99, all p values < 0.05) in logistic regression models. Further machine learning-based analysis suggested that age, occupation, hypertension, exercise frequency, and dietary pattern were the five most prominent factors associated with CVD. The ranking of relative importance for prominent factors in stratification analyses showed that the factor importance generally followed the same pattern as that in the overall sample. CONCLUSIONS: CVD is a major public health concern in Kashgar prefecture. Age, occupation, hypertension, exercise frequency, and dietary pattern might be the prominent factors associated with CVD in this region.In the future, these factors should be given priority in preventing CVD in future. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s41256-022-00282-y. BioMed Central 2022-12-06 /pmc/articles/PMC9724436/ /pubmed/36474302 http://dx.doi.org/10.1186/s41256-022-00282-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Li, Jia-Xin
Li, Li
Zhong, Xuemei
Fan, Shu-Jun
Cen, Tao
Wang, Jianquan
He, Chuanjiang
Zhang, Zhoubin
Luo, Ya-Na
Liu, Xiao-Xuan
Hu, Li-Xin
Zhang, Yi-Dan
Qiu, Hui-Ling
Dong, Guang-Hui
Zou, Xiao-Guang
Yang, Bo-Yi
Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title_full Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title_fullStr Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title_full_unstemmed Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title_short Machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the Kashgar Prospective Cohort Study (KPCS)
title_sort machine learning identifies prominent factors associated with cardiovascular disease: findings from two million adults in the kashgar prospective cohort study (kpcs)
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724436/
https://www.ncbi.nlm.nih.gov/pubmed/36474302
http://dx.doi.org/10.1186/s41256-022-00282-y
work_keys_str_mv AT lijiaxin machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT lili machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT zhongxuemei machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT fanshujun machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT centao machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT wangjianquan machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT hechuanjiang machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT zhangzhoubin machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT luoyana machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT liuxiaoxuan machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT hulixin machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT zhangyidan machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT qiuhuiling machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT dongguanghui machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT zouxiaoguang machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs
AT yangboyi machinelearningidentifiesprominentfactorsassociatedwithcardiovasculardiseasefindingsfromtwomillionadultsinthekashgarprospectivecohortstudykpcs