Cargando…
Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DA...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8852237/ https://www.ncbi.nlm.nih.gov/pubmed/35172994 http://dx.doi.org/10.1136/bmjopen-2021-049852 |
_version_ | 1784653002441752576 |
---|---|
author | Doblhammer, Gabriele Reinke, Constantin Kreft, Daniel |
author_facet | Doblhammer, Gabriele Reinke, Constantin Kreft, Daniel |
author_sort | Doblhammer, Gabriele |
collection | PubMed |
description | OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DATA AND METHOD: We used data from the Robert Koch-Institute (RKI) on 204 217 COVID-19 diagnoses in the total German population of 83.1 million, distinguishing five distinct periods between 1 January and 23 July 2020. For each period, we calculated age-standardised incidence rates of COVID-19 diagnoses on the county level and characterised the counties by 166 macro variables. We trained gradient boosting models to predict the age-standardised incidence rates with the macrostructures of the counties and used SHapley Additive exPlanations (SHAP) values to characterise the 20 most prominent features in terms of negative/positive correlations with the outcome variable. RESULTS: The first COVID-19 wave started as a disease in wealthy rural counties in southern Germany and ventured into poorer urban and agricultural counties during the course of the first wave. High age-standardised incidence in low socioeconomic status (SES) counties became more pronounced from the second lockdown period onwards, when wealthy counties appeared to be better protected. Features related to economic and educational characteristics of the young population in a county played an important role at the beginning of the pandemic up to the second lockdown phase, as did features related to the population living in nursing homes; those related to international migration and a large proportion of foreigners living in a county became important in the postlockdown period. CONCLUSION: High mobility of high SES groups may drive the pandemic at the beginning of waves, while mitigation measures and beliefs about the seriousness of the pandemic as well as the compliance with mitigation measures may put lower SES groups at higher risks later on. |
format | Online Article Text |
id | pubmed-8852237 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-88522372022-02-18 Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach Doblhammer, Gabriele Reinke, Constantin Kreft, Daniel BMJ Open Infectious Diseases OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DATA AND METHOD: We used data from the Robert Koch-Institute (RKI) on 204 217 COVID-19 diagnoses in the total German population of 83.1 million, distinguishing five distinct periods between 1 January and 23 July 2020. For each period, we calculated age-standardised incidence rates of COVID-19 diagnoses on the county level and characterised the counties by 166 macro variables. We trained gradient boosting models to predict the age-standardised incidence rates with the macrostructures of the counties and used SHapley Additive exPlanations (SHAP) values to characterise the 20 most prominent features in terms of negative/positive correlations with the outcome variable. RESULTS: The first COVID-19 wave started as a disease in wealthy rural counties in southern Germany and ventured into poorer urban and agricultural counties during the course of the first wave. High age-standardised incidence in low socioeconomic status (SES) counties became more pronounced from the second lockdown period onwards, when wealthy counties appeared to be better protected. Features related to economic and educational characteristics of the young population in a county played an important role at the beginning of the pandemic up to the second lockdown phase, as did features related to the population living in nursing homes; those related to international migration and a large proportion of foreigners living in a county became important in the postlockdown period. CONCLUSION: High mobility of high SES groups may drive the pandemic at the beginning of waves, while mitigation measures and beliefs about the seriousness of the pandemic as well as the compliance with mitigation measures may put lower SES groups at higher risks later on. BMJ Publishing Group 2022-02-15 /pmc/articles/PMC8852237/ /pubmed/35172994 http://dx.doi.org/10.1136/bmjopen-2021-049852 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Infectious Diseases Doblhammer, Gabriele Reinke, Constantin Kreft, Daniel Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title | Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title_full | Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title_fullStr | Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title_full_unstemmed | Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title_short | Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach |
title_sort | social disparities in the first wave of covid-19 incidence rates in germany: a county-scale explainable machine learning approach |
topic | Infectious Diseases |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8852237/ https://www.ncbi.nlm.nih.gov/pubmed/35172994 http://dx.doi.org/10.1136/bmjopen-2021-049852 |
work_keys_str_mv | AT doblhammergabriele socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach AT reinkeconstantin socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach AT kreftdaniel socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach |