Cargando…

Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China

BACKGROUND: Cardiovascular disease (CVD) is the leading cause of mortality worldwide. Accurately identifying subjects at high-risk of CVD may improve CVD outcomes. We sought to systematically examine the feasibility and performance of 7 widely used machine learning (ML) algorithms in predicting CVD...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Yunxing, Zhang, Xianghui, Ma, Rulin, Wang, Xinping, Liu, Jiaming, Keerman, Mulatibieke, Yan, Yizhong, Ma, Jiaolong, Song, Yanpeng, Zhang, Jingyu, He, Jia, Guo, Shuxia, Guo, Heng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8200454/
https://www.ncbi.nlm.nih.gov/pubmed/34135637
http://dx.doi.org/10.2147/CLEP.S313343
_version_ 1783707606561325056
author Jiang, Yunxing
Zhang, Xianghui
Ma, Rulin
Wang, Xinping
Liu, Jiaming
Keerman, Mulatibieke
Yan, Yizhong
Ma, Jiaolong
Song, Yanpeng
Zhang, Jingyu
He, Jia
Guo, Shuxia
Guo, Heng
author_facet Jiang, Yunxing
Zhang, Xianghui
Ma, Rulin
Wang, Xinping
Liu, Jiaming
Keerman, Mulatibieke
Yan, Yizhong
Ma, Jiaolong
Song, Yanpeng
Zhang, Jingyu
He, Jia
Guo, Shuxia
Guo, Heng
author_sort Jiang, Yunxing
collection PubMed
description BACKGROUND: Cardiovascular disease (CVD) is the leading cause of mortality worldwide. Accurately identifying subjects at high-risk of CVD may improve CVD outcomes. We sought to systematically examine the feasibility and performance of 7 widely used machine learning (ML) algorithms in predicting CVD risks. METHODS: The final analysis included 1508 Kazakh subjects in China without CVD at baseline who completed follow-up. All subjects were randomly divided into the training set (80%) and the test set (20%). L1-penalized logistic regression (LR), support vector machine with radial basis function (SVM), decision tree (DT), random forest (RF), k-nearest neighbors (KNN), Gaussian naive Bayes (NB), and extreme gradient boosting (XGB) were employed for prediction CVD outcomes. Ten-fold cross-validation was used during model developing and hyperparameters tuning in the training set. Model performance was evaluated in the test set in light of discrimination, calibration, and clinical usefulness. RF was applied to obtain the variable importance of included variables. Twenty-two variables, including sociodemographic characteristics, medical history, cytokines, and synthetic indices, were used for model development. RESULTS: Among 1508 subjects, 203 were diagnosed with CVD over a median follow-up of 5.17 years. All 7 models had moderate to excellent discrimination (AUC ranged from 0.770 to 0.872) and were well calibrated. LR and SVM performed identically with an AUC of 0.872 (95% CI: 0.829–0.907) and 0.868 (95% CI: 0.825–0.904), respectively. LR had the lowest Brier score (0.078) and the highest sensitivity (97.1%). Decision curve analysis indicated that SVM was slightly better than LR. The inflammatory cytokines, such as hs-CRP and IL-6, were identified as strong predictors of CVD. CONCLUSION: SVM and LR can be applied to guide clinical decision-making in the Kazakh Chinese population, and further study is required to ensure their accuracies.
format Online
Article
Text
id pubmed-8200454
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-82004542021-06-15 Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China Jiang, Yunxing Zhang, Xianghui Ma, Rulin Wang, Xinping Liu, Jiaming Keerman, Mulatibieke Yan, Yizhong Ma, Jiaolong Song, Yanpeng Zhang, Jingyu He, Jia Guo, Shuxia Guo, Heng Clin Epidemiol Original Research BACKGROUND: Cardiovascular disease (CVD) is the leading cause of mortality worldwide. Accurately identifying subjects at high-risk of CVD may improve CVD outcomes. We sought to systematically examine the feasibility and performance of 7 widely used machine learning (ML) algorithms in predicting CVD risks. METHODS: The final analysis included 1508 Kazakh subjects in China without CVD at baseline who completed follow-up. All subjects were randomly divided into the training set (80%) and the test set (20%). L1-penalized logistic regression (LR), support vector machine with radial basis function (SVM), decision tree (DT), random forest (RF), k-nearest neighbors (KNN), Gaussian naive Bayes (NB), and extreme gradient boosting (XGB) were employed for prediction CVD outcomes. Ten-fold cross-validation was used during model developing and hyperparameters tuning in the training set. Model performance was evaluated in the test set in light of discrimination, calibration, and clinical usefulness. RF was applied to obtain the variable importance of included variables. Twenty-two variables, including sociodemographic characteristics, medical history, cytokines, and synthetic indices, were used for model development. RESULTS: Among 1508 subjects, 203 were diagnosed with CVD over a median follow-up of 5.17 years. All 7 models had moderate to excellent discrimination (AUC ranged from 0.770 to 0.872) and were well calibrated. LR and SVM performed identically with an AUC of 0.872 (95% CI: 0.829–0.907) and 0.868 (95% CI: 0.825–0.904), respectively. LR had the lowest Brier score (0.078) and the highest sensitivity (97.1%). Decision curve analysis indicated that SVM was slightly better than LR. The inflammatory cytokines, such as hs-CRP and IL-6, were identified as strong predictors of CVD. CONCLUSION: SVM and LR can be applied to guide clinical decision-making in the Kazakh Chinese population, and further study is required to ensure their accuracies. Dove 2021-06-09 /pmc/articles/PMC8200454/ /pubmed/34135637 http://dx.doi.org/10.2147/CLEP.S313343 Text en © 2021 Jiang et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Jiang, Yunxing
Zhang, Xianghui
Ma, Rulin
Wang, Xinping
Liu, Jiaming
Keerman, Mulatibieke
Yan, Yizhong
Ma, Jiaolong
Song, Yanpeng
Zhang, Jingyu
He, Jia
Guo, Shuxia
Guo, Heng
Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title_full Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title_fullStr Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title_full_unstemmed Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title_short Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China
title_sort cardiovascular disease prediction by machine learning algorithms based on cytokines in kazakhs of china
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8200454/
https://www.ncbi.nlm.nih.gov/pubmed/34135637
http://dx.doi.org/10.2147/CLEP.S313343
work_keys_str_mv AT jiangyunxing cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT zhangxianghui cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT marulin cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT wangxinping cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT liujiaming cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT keermanmulatibieke cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT yanyizhong cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT majiaolong cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT songyanpeng cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT zhangjingyu cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT hejia cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT guoshuxia cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina
AT guoheng cardiovasculardiseasepredictionbymachinelearningalgorithmsbasedoncytokinesinkazakhsofchina