Cargando…
Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden
OBJECTIVES: We aimed to develop and externally validate a generalisable risk prediction model for 30-day stroke mortality suitable for supporting quality improvement analytics in stroke care using large nationwide stroke registers in the UK and Sweden. DESIGN: Registry-based cohort study. SETTING: S...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10660948/ https://www.ncbi.nlm.nih.gov/pubmed/37968001 http://dx.doi.org/10.1136/bmjopen-2022-069811 |
_version_ | 1785148442810515456 |
---|---|
author | Wang, Wenjuan Otieno, Josline A Eriksson, Marie Wolfe, Charles D Curcin, Vasa Bray, Benjamin D |
author_facet | Wang, Wenjuan Otieno, Josline A Eriksson, Marie Wolfe, Charles D Curcin, Vasa Bray, Benjamin D |
author_sort | Wang, Wenjuan |
collection | PubMed |
description | OBJECTIVES: We aimed to develop and externally validate a generalisable risk prediction model for 30-day stroke mortality suitable for supporting quality improvement analytics in stroke care using large nationwide stroke registers in the UK and Sweden. DESIGN: Registry-based cohort study. SETTING: Stroke registries including the Sentinel Stroke National Audit Programme (SSNAP) in England, Wales and Northern Ireland (2013–2019) and the national Swedish stroke register (Riksstroke 2015–2020). PARTICIPANTS AND METHODS: Data from SSNAP were used for developing and temporally validating the model, and data from Riksstroke were used for external validation. Models were developed with the variables available in both registries using logistic regression (LR), LR with elastic net and interaction terms and eXtreme Gradient Boosting (XGBoost). Performances were evaluated with discrimination, calibration and decision curves. OUTCOME MEASURES: The primary outcome was all-cause 30-day in-hospital mortality after stroke. RESULTS: In total, 488 497 patients who had a stroke with 12.4% 30-day in-hospital mortality were used for developing and temporally validating the model in the UK. A total of 128 360 patients who had a stroke with 10.8% 30-day in-hospital mortality and 13.1% all mortality were used for external validation in Sweden. In the SSNAP temporal validation set, the final XGBoost model achieved the highest area under the receiver operating characteristic curve (AUC) (0.852 (95% CI 0.848 to 0.855)) and was well calibrated. The performances on the external validation in Riksstroke were as good and achieved AUC at 0.861 (95% CI 0.858 to 0.865) for in-hospital mortality. For Riksstroke, the models slightly overestimated the risk for in-hospital mortality, while they were better calibrated at the risk for all mortality. CONCLUSION: The risk prediction model was accurate and externally validated using high quality registry data. This is potentially suitable to be deployed as part of quality improvement analytics in stroke care to enable the fair comparison of stroke mortality outcomes across hospitals and health systems across countries |
format | Online Article Text |
id | pubmed-10660948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-106609482023-11-15 Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden Wang, Wenjuan Otieno, Josline A Eriksson, Marie Wolfe, Charles D Curcin, Vasa Bray, Benjamin D BMJ Open Neurology OBJECTIVES: We aimed to develop and externally validate a generalisable risk prediction model for 30-day stroke mortality suitable for supporting quality improvement analytics in stroke care using large nationwide stroke registers in the UK and Sweden. DESIGN: Registry-based cohort study. SETTING: Stroke registries including the Sentinel Stroke National Audit Programme (SSNAP) in England, Wales and Northern Ireland (2013–2019) and the national Swedish stroke register (Riksstroke 2015–2020). PARTICIPANTS AND METHODS: Data from SSNAP were used for developing and temporally validating the model, and data from Riksstroke were used for external validation. Models were developed with the variables available in both registries using logistic regression (LR), LR with elastic net and interaction terms and eXtreme Gradient Boosting (XGBoost). Performances were evaluated with discrimination, calibration and decision curves. OUTCOME MEASURES: The primary outcome was all-cause 30-day in-hospital mortality after stroke. RESULTS: In total, 488 497 patients who had a stroke with 12.4% 30-day in-hospital mortality were used for developing and temporally validating the model in the UK. A total of 128 360 patients who had a stroke with 10.8% 30-day in-hospital mortality and 13.1% all mortality were used for external validation in Sweden. In the SSNAP temporal validation set, the final XGBoost model achieved the highest area under the receiver operating characteristic curve (AUC) (0.852 (95% CI 0.848 to 0.855)) and was well calibrated. The performances on the external validation in Riksstroke were as good and achieved AUC at 0.861 (95% CI 0.858 to 0.865) for in-hospital mortality. For Riksstroke, the models slightly overestimated the risk for in-hospital mortality, while they were better calibrated at the risk for all mortality. CONCLUSION: The risk prediction model was accurate and externally validated using high quality registry data. This is potentially suitable to be deployed as part of quality improvement analytics in stroke care to enable the fair comparison of stroke mortality outcomes across hospitals and health systems across countries BMJ Publishing Group 2023-11-15 /pmc/articles/PMC10660948/ /pubmed/37968001 http://dx.doi.org/10.1136/bmjopen-2022-069811 Text en © Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY. Published by BMJ. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Neurology Wang, Wenjuan Otieno, Josline A Eriksson, Marie Wolfe, Charles D Curcin, Vasa Bray, Benjamin D Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title | Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title_full | Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title_fullStr | Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title_full_unstemmed | Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title_short | Developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the UK and Sweden |
title_sort | developing and externally validating a machine learning risk prediction model for 30-day mortality after stroke using national stroke registers in the uk and sweden |
topic | Neurology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10660948/ https://www.ncbi.nlm.nih.gov/pubmed/37968001 http://dx.doi.org/10.1136/bmjopen-2022-069811 |
work_keys_str_mv | AT wangwenjuan developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden AT otienojoslinea developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden AT erikssonmarie developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden AT wolfecharlesd developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden AT curcinvasa developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden AT braybenjamind developingandexternallyvalidatingamachinelearningriskpredictionmodelfor30daymortalityafterstrokeusingnationalstrokeregistersintheukandsweden |