Cargando…

Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study

Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by eval...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Junyao, Luo, Yuxiang, Dong, Meina, Liang, Yating, Zhao, Xuejing, Zhang, Yafeng, Ge, Zhaoming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10110369/
https://www.ncbi.nlm.nih.gov/pubmed/37078009
http://dx.doi.org/10.1155/2023/7352191
_version_ 1785027248649142272
author Li, Junyao
Luo, Yuxiang
Dong, Meina
Liang, Yating
Zhao, Xuejing
Zhang, Yafeng
Ge, Zhaoming
author_facet Li, Junyao
Luo, Yuxiang
Dong, Meina
Liang, Yating
Zhao, Xuejing
Zhang, Yafeng
Ge, Zhaoming
author_sort Li, Junyao
collection PubMed
description Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by evaluation of the relationships between factors and response, as well as by ranking the importance of characteristics. Then, after discarding negligible factors, some well-known multicategorical classification algorithms are used to predict the level of stroke. In addition, using the Shapley additive explanation method (SHAP), factors with positive and negative effects are identified, and some important interactions for classifying the level of stroke are proposed. A waterfall plot for a specific patient is presented and used to determine the risk degree of that patient. Results and Conclusion. The results show that (1) the most important risk factors for stroke are hypertension, history of transient ischemia, and history of stroke; age and gender have a negligible impact. (2) The XGBoost model shows the best performance in predicting stroke risk; it also gives a ranking of risk factors based on their impact. (3) A combination of SHAP and XGBoost can be used to identify positive and negative factors and their interactions in stroke prediction, thereby providing helpful guidance for diagnosis.
format Online
Article
Text
id pubmed-10110369
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-101103692023-04-18 Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study Li, Junyao Luo, Yuxiang Dong, Meina Liang, Yating Zhao, Xuejing Zhang, Yafeng Ge, Zhaoming Biomed Res Int Research Article Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by evaluation of the relationships between factors and response, as well as by ranking the importance of characteristics. Then, after discarding negligible factors, some well-known multicategorical classification algorithms are used to predict the level of stroke. In addition, using the Shapley additive explanation method (SHAP), factors with positive and negative effects are identified, and some important interactions for classifying the level of stroke are proposed. A waterfall plot for a specific patient is presented and used to determine the risk degree of that patient. Results and Conclusion. The results show that (1) the most important risk factors for stroke are hypertension, history of transient ischemia, and history of stroke; age and gender have a negligible impact. (2) The XGBoost model shows the best performance in predicting stroke risk; it also gives a ranking of risk factors based on their impact. (3) A combination of SHAP and XGBoost can be used to identify positive and negative factors and their interactions in stroke prediction, thereby providing helpful guidance for diagnosis. Hindawi 2023-04-10 /pmc/articles/PMC10110369/ /pubmed/37078009 http://dx.doi.org/10.1155/2023/7352191 Text en Copyright © 2023 Junyao Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Junyao
Luo, Yuxiang
Dong, Meina
Liang, Yating
Zhao, Xuejing
Zhang, Yafeng
Ge, Zhaoming
Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title_full Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title_fullStr Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title_full_unstemmed Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title_short Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
title_sort tree-based risk factor identification and stroke level prediction in stroke cohort study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10110369/
https://www.ncbi.nlm.nih.gov/pubmed/37078009
http://dx.doi.org/10.1155/2023/7352191
work_keys_str_mv AT lijunyao treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT luoyuxiang treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT dongmeina treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT liangyating treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT zhaoxuejing treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT zhangyafeng treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy
AT gezhaoming treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy