Cargando…

Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19

A key task of emergency departments is to promptly identify patients who require hospital admission. Early identification ensures patient safety and aids organisational planning. Supervised machine learning algorithms can use data describing historical episodes to make ahead-of-time predictions of c...

Descripción completa

Detalles Bibliográficos
Autores principales: Duckworth, Christopher, Chmiel, Francis P., Burns, Dan K., Zlatev, Zlatko D., White, Neil M., Daniels, Thomas W. V., Kiuber, Michael, Boniface, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8626460/
https://www.ncbi.nlm.nih.gov/pubmed/34837021
http://dx.doi.org/10.1038/s41598-021-02481-y
_version_ 1784606661715951616
author Duckworth, Christopher
Chmiel, Francis P.
Burns, Dan K.
Zlatev, Zlatko D.
White, Neil M.
Daniels, Thomas W. V.
Kiuber, Michael
Boniface, Michael J.
author_facet Duckworth, Christopher
Chmiel, Francis P.
Burns, Dan K.
Zlatev, Zlatko D.
White, Neil M.
Daniels, Thomas W. V.
Kiuber, Michael
Boniface, Michael J.
author_sort Duckworth, Christopher
collection PubMed
description A key task of emergency departments is to promptly identify patients who require hospital admission. Early identification ensures patient safety and aids organisational planning. Supervised machine learning algorithms can use data describing historical episodes to make ahead-of-time predictions of clinical outcomes. Despite this, clinical settings are dynamic environments and the underlying data distributions characterising episodes can change with time (data drift), and so can the relationship between episode characteristics and associated clinical outcomes (concept drift). Practically this means deployed algorithms must be monitored to ensure their safety. We demonstrate how explainable machine learning can be used to monitor data drift, using the COVID-19 pandemic as a severe example. We present a machine learning classifier trained using (pre-COVID-19) data, to identify patients at high risk of admission during an emergency department attendance. We then evaluate our model’s performance on attendances occurring pre-pandemic (AUROC of 0.856 with 95%CI [0.852, 0.859]) and during the COVID-19 pandemic (AUROC of 0.826 with 95%CI [0.814, 0.837]). We demonstrate two benefits of explainable machine learning (SHAP) for models deployed in healthcare settings: (1) By tracking the variation in a feature’s SHAP value relative to its global importance, a complimentary measure of data drift is found which highlights the need to retrain a predictive model. (2) By observing the relative changes in feature importance emergent health risks can be identified.
format Online
Article
Text
id pubmed-8626460
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86264602021-11-29 Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19 Duckworth, Christopher Chmiel, Francis P. Burns, Dan K. Zlatev, Zlatko D. White, Neil M. Daniels, Thomas W. V. Kiuber, Michael Boniface, Michael J. Sci Rep Article A key task of emergency departments is to promptly identify patients who require hospital admission. Early identification ensures patient safety and aids organisational planning. Supervised machine learning algorithms can use data describing historical episodes to make ahead-of-time predictions of clinical outcomes. Despite this, clinical settings are dynamic environments and the underlying data distributions characterising episodes can change with time (data drift), and so can the relationship between episode characteristics and associated clinical outcomes (concept drift). Practically this means deployed algorithms must be monitored to ensure their safety. We demonstrate how explainable machine learning can be used to monitor data drift, using the COVID-19 pandemic as a severe example. We present a machine learning classifier trained using (pre-COVID-19) data, to identify patients at high risk of admission during an emergency department attendance. We then evaluate our model’s performance on attendances occurring pre-pandemic (AUROC of 0.856 with 95%CI [0.852, 0.859]) and during the COVID-19 pandemic (AUROC of 0.826 with 95%CI [0.814, 0.837]). We demonstrate two benefits of explainable machine learning (SHAP) for models deployed in healthcare settings: (1) By tracking the variation in a feature’s SHAP value relative to its global importance, a complimentary measure of data drift is found which highlights the need to retrain a predictive model. (2) By observing the relative changes in feature importance emergent health risks can be identified. Nature Publishing Group UK 2021-11-26 /pmc/articles/PMC8626460/ /pubmed/34837021 http://dx.doi.org/10.1038/s41598-021-02481-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Duckworth, Christopher
Chmiel, Francis P.
Burns, Dan K.
Zlatev, Zlatko D.
White, Neil M.
Daniels, Thomas W. V.
Kiuber, Michael
Boniface, Michael J.
Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title_full Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title_fullStr Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title_full_unstemmed Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title_short Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19
title_sort using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8626460/
https://www.ncbi.nlm.nih.gov/pubmed/34837021
http://dx.doi.org/10.1038/s41598-021-02481-y
work_keys_str_mv AT duckworthchristopher usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT chmielfrancisp usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT burnsdank usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT zlatevzlatkod usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT whiteneilm usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT danielsthomaswv usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT kiubermichael usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19
AT bonifacemichaelj usingexplainablemachinelearningtocharacterisedatadriftanddetectemergenthealthrisksforemergencydepartmentadmissionsduringcovid19