Cargando…
Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19
In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisati...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7948528/ https://www.ncbi.nlm.nih.gov/pubmed/33723478 http://dx.doi.org/10.1016/j.eswa.2021.114832 |
_version_ | 1783663427852435456 |
---|---|
author | Smith, Matthew Alvarez, Francisco |
author_facet | Smith, Matthew Alvarez, Francisco |
author_sort | Smith, Matthew |
collection | PubMed |
description | In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisation and 174 patients who died whilst in hospital. The focus of the paper is not only on seeing which Machine Learning model is able to obtain the absolute highest accuracy but more on the interpretation of what the Machine Learning models provides. We find that age, days in hospital, Lymphocyte and Neutrophils are important and robust predictors when predicting a patients mortality. Furthermore, the algorithms we use allows us to observe the marginal impact of each variable on a case-by-case patient level, which might help practicioneers to easily detect anomalous patterns. This paper analyses the global and local interpretation of the Machine Learning models on patients with COVID19. |
format | Online Article Text |
id | pubmed-7948528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79485282021-03-11 Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 Smith, Matthew Alvarez, Francisco Expert Syst Appl Short Communication In this paper we apply a series of Machine Learning models to a recently published unique dataset on the mortality of COVID19 patients. We use a dataset consisting of blood samples of 375 patients admitted to a hospital in the region of Wuhan, China. There are 201 patients who survived hospitalisation and 174 patients who died whilst in hospital. The focus of the paper is not only on seeing which Machine Learning model is able to obtain the absolute highest accuracy but more on the interpretation of what the Machine Learning models provides. We find that age, days in hospital, Lymphocyte and Neutrophils are important and robust predictors when predicting a patients mortality. Furthermore, the algorithms we use allows us to observe the marginal impact of each variable on a case-by-case patient level, which might help practicioneers to easily detect anomalous patterns. This paper analyses the global and local interpretation of the Machine Learning models on patients with COVID19. Elsevier Ltd. 2021-08-15 2021-03-11 /pmc/articles/PMC7948528/ /pubmed/33723478 http://dx.doi.org/10.1016/j.eswa.2021.114832 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Short Communication Smith, Matthew Alvarez, Francisco Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title | Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title_full | Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title_fullStr | Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title_full_unstemmed | Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title_short | Identifying mortality factors from Machine Learning using Shapley values – a case of COVID19 |
title_sort | identifying mortality factors from machine learning using shapley values – a case of covid19 |
topic | Short Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7948528/ https://www.ncbi.nlm.nih.gov/pubmed/33723478 http://dx.doi.org/10.1016/j.eswa.2021.114832 |
work_keys_str_mv | AT smithmatthew identifyingmortalityfactorsfrommachinelearningusingshapleyvaluesacaseofcovid19 AT alvarezfrancisco identifyingmortalityfactorsfrommachinelearningusingshapleyvaluesacaseofcovid19 |