Cargando…

Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19

In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisati...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Matthew, Alvarez, Francisco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7948528/
https://www.ncbi.nlm.nih.gov/pubmed/33723478
http://dx.doi.org/10.1016/j.eswa.2021.114832
_version_ 1783663427852435456
author Smith, Matthew
Alvarez, Francisco
author_facet Smith, Matthew
Alvarez, Francisco
author_sort Smith, Matthew
collection PubMed
description In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisation and 174 patients who died whilst in hospital. The focus of the paper is not only on seeing which Machine Learning model is able to obtain the absolute highest accuracy but more on the interpretation of what the Machine Learning models provides. We find that age, days in hospital, Lymphocyte and Neutrophils are important and robust predictors when predicting a patients mortality. Furthermore, the algorithms we use allows us to observe the marginal impact of each variable on a case-by-case patient level, which might help practicioneers to easily detect anomalous patterns. This paper analyses the global and local interpretation of the Machine Learning models on patients with COVID19.
format Online
Article
Text
id pubmed-7948528
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-79485282021-03-11 Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 Smith, Matthew Alvarez, Francisco Expert Syst Appl Short Communication In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisation and 174 patients who died whilst in hospital. The focus of the paper is not only on seeing which Machine Learning model is able to obtain the absolute highest accuracy but more on the interpretation of what the Machine Learning models provides. We find that age, days in hospital, Lymphocyte and Neutrophils are important and robust predictors when predicting a patients mortality. Furthermore, the algorithms we use allows us to observe the marginal impact of each variable on a case-by-case patient level, which might help practicioneers to easily detect anomalous patterns. This paper analyses the global and local interpretation of the Machine Learning models on patients with COVID19. Elsevier Ltd. 2021-08-15 2021-03-11 /pmc/articles/PMC7948528/ /pubmed/33723478 http://dx.doi.org/10.1016/j.eswa.2021.114832 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Short Communication
Smith, Matthew
Alvarez, Francisco
Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title_full Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title_fullStr Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title_full_unstemmed Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title_short Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
title_sort identifying mortality factors from machine learning using shapley values – a case of covid19
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7948528/
https://www.ncbi.nlm.nih.gov/pubmed/33723478
http://dx.doi.org/10.1016/j.eswa.2021.114832
work_keys_str_mv AT smithmatthew identifyingmortalityfactorsfrommachinelearningusingshapleyvaluesacaseofcovid19
AT alvarezfrancisco identifyingmortalityfactorsfrommachinelearningusingshapleyvaluesacaseofcovid19