Cargando…
Causal connections between socioeconomic disparities and COVID-19 in the USA
With the increasing use of machine learning models in computational socioeconomics, the development of methods for explaining these models and understanding the causal connections is gradually gaining importance. In this work, we advocate the use of an explanatory framework from cooperative game the...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499932/ https://www.ncbi.nlm.nih.gov/pubmed/36138106 http://dx.doi.org/10.1038/s41598-022-18725-4 |
_version_ | 1784795107190374400 |
---|---|
author | Banerjee, Tannista Paul, Ayan Srikanth, Vishak Strümke, Inga |
author_facet | Banerjee, Tannista Paul, Ayan Srikanth, Vishak Strümke, Inga |
author_sort | Banerjee, Tannista |
collection | PubMed |
description | With the increasing use of machine learning models in computational socioeconomics, the development of methods for explaining these models and understanding the causal connections is gradually gaining importance. In this work, we advocate the use of an explanatory framework from cooperative game theory augmented with do calculus, namely causal Shapley values. Using causal Shapley values, we analyze socioeconomic disparities that have a causal link to the spread of COVID-19 in the USA. We study several phases of the disease spread to show how the causal connections change over time. We perform a causal analysis using random effects models and discuss the correspondence between the two methods to verify our results. We show the distinct advantages a non-linear machine learning models have over linear models when performing a multivariate analysis, especially since the machine learning models can map out non-linear correlations in the data. In addition, the causal Shapley values allow for including the causal structure in the variable importance computed for the machine learning model. |
format | Online Article Text |
id | pubmed-9499932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-94999322022-09-24 Causal connections between socioeconomic disparities and COVID-19 in the USA Banerjee, Tannista Paul, Ayan Srikanth, Vishak Strümke, Inga Sci Rep Article With the increasing use of machine learning models in computational socioeconomics, the development of methods for explaining these models and understanding the causal connections is gradually gaining importance. In this work, we advocate the use of an explanatory framework from cooperative game theory augmented with do calculus, namely causal Shapley values. Using causal Shapley values, we analyze socioeconomic disparities that have a causal link to the spread of COVID-19 in the USA. We study several phases of the disease spread to show how the causal connections change over time. We perform a causal analysis using random effects models and discuss the correspondence between the two methods to verify our results. We show the distinct advantages a non-linear machine learning models have over linear models when performing a multivariate analysis, especially since the machine learning models can map out non-linear correlations in the data. In addition, the causal Shapley values allow for including the causal structure in the variable importance computed for the machine learning model. Nature Publishing Group UK 2022-09-22 /pmc/articles/PMC9499932/ /pubmed/36138106 http://dx.doi.org/10.1038/s41598-022-18725-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Banerjee, Tannista Paul, Ayan Srikanth, Vishak Strümke, Inga Causal connections between socioeconomic disparities and COVID-19 in the USA |
title | Causal connections between socioeconomic disparities and COVID-19 in the USA |
title_full | Causal connections between socioeconomic disparities and COVID-19 in the USA |
title_fullStr | Causal connections between socioeconomic disparities and COVID-19 in the USA |
title_full_unstemmed | Causal connections between socioeconomic disparities and COVID-19 in the USA |
title_short | Causal connections between socioeconomic disparities and COVID-19 in the USA |
title_sort | causal connections between socioeconomic disparities and covid-19 in the usa |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499932/ https://www.ncbi.nlm.nih.gov/pubmed/36138106 http://dx.doi.org/10.1038/s41598-022-18725-4 |
work_keys_str_mv | AT banerjeetannista causalconnectionsbetweensocioeconomicdisparitiesandcovid19intheusa AT paulayan causalconnectionsbetweensocioeconomicdisparitiesandcovid19intheusa AT srikanthvishak causalconnectionsbetweensocioeconomicdisparitiesandcovid19intheusa AT strumkeinga causalconnectionsbetweensocioeconomicdisparitiesandcovid19intheusa |