Cargando…
Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study
Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by eval...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10110369/ https://www.ncbi.nlm.nih.gov/pubmed/37078009 http://dx.doi.org/10.1155/2023/7352191 |
_version_ | 1785027248649142272 |
---|---|
author | Li, Junyao Luo, Yuxiang Dong, Meina Liang, Yating Zhao, Xuejing Zhang, Yafeng Ge, Zhaoming |
author_facet | Li, Junyao Luo, Yuxiang Dong, Meina Liang, Yating Zhao, Xuejing Zhang, Yafeng Ge, Zhaoming |
author_sort | Li, Junyao |
collection | PubMed |
description | Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by evaluation of the relationships between factors and response, as well as by ranking the importance of characteristics. Then, after discarding negligible factors, some well-known multicategorical classification algorithms are used to predict the level of stroke. In addition, using the Shapley additive explanation method (SHAP), factors with positive and negative effects are identified, and some important interactions for classifying the level of stroke are proposed. A waterfall plot for a specific patient is presented and used to determine the risk degree of that patient. Results and Conclusion. The results show that (1) the most important risk factors for stroke are hypertension, history of transient ischemia, and history of stroke; age and gender have a negligible impact. (2) The XGBoost model shows the best performance in predicting stroke risk; it also gives a ranking of risk factors based on their impact. (3) A combination of SHAP and XGBoost can be used to identify positive and negative factors and their interactions in stroke prediction, thereby providing helpful guidance for diagnosis. |
format | Online Article Text |
id | pubmed-10110369 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-101103692023-04-18 Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study Li, Junyao Luo, Yuxiang Dong, Meina Liang, Yating Zhao, Xuejing Zhang, Yafeng Ge, Zhaoming Biomed Res Int Research Article Objective. This study focuses on the identification of risk factors, classification of stroke level, and evaluation of the importance and interactions of various patient characteristics using cohort data from the Second Hospital of Lanzhou University. Methodology. Risk factors are identified by evaluation of the relationships between factors and response, as well as by ranking the importance of characteristics. Then, after discarding negligible factors, some well-known multicategorical classification algorithms are used to predict the level of stroke. In addition, using the Shapley additive explanation method (SHAP), factors with positive and negative effects are identified, and some important interactions for classifying the level of stroke are proposed. A waterfall plot for a specific patient is presented and used to determine the risk degree of that patient. Results and Conclusion. The results show that (1) the most important risk factors for stroke are hypertension, history of transient ischemia, and history of stroke; age and gender have a negligible impact. (2) The XGBoost model shows the best performance in predicting stroke risk; it also gives a ranking of risk factors based on their impact. (3) A combination of SHAP and XGBoost can be used to identify positive and negative factors and their interactions in stroke prediction, thereby providing helpful guidance for diagnosis. Hindawi 2023-04-10 /pmc/articles/PMC10110369/ /pubmed/37078009 http://dx.doi.org/10.1155/2023/7352191 Text en Copyright © 2023 Junyao Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Junyao Luo, Yuxiang Dong, Meina Liang, Yating Zhao, Xuejing Zhang, Yafeng Ge, Zhaoming Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title | Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title_full | Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title_fullStr | Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title_full_unstemmed | Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title_short | Tree-Based Risk Factor Identification and Stroke Level Prediction in Stroke Cohort Study |
title_sort | tree-based risk factor identification and stroke level prediction in stroke cohort study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10110369/ https://www.ncbi.nlm.nih.gov/pubmed/37078009 http://dx.doi.org/10.1155/2023/7352191 |
work_keys_str_mv | AT lijunyao treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT luoyuxiang treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT dongmeina treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT liangyating treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT zhaoxuejing treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT zhangyafeng treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy AT gezhaoming treebasedriskfactoridentificationandstrokelevelpredictioninstrokecohortstudy |