Cargando…

Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach

OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DA...

Descripción completa

Detalles Bibliográficos
Autores principales: Doblhammer, Gabriele, Reinke, Constantin, Kreft, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8852237/
https://www.ncbi.nlm.nih.gov/pubmed/35172994
http://dx.doi.org/10.1136/bmjopen-2021-049852
_version_ 1784653002441752576
author Doblhammer, Gabriele
Reinke, Constantin
Kreft, Daniel
author_facet Doblhammer, Gabriele
Reinke, Constantin
Kreft, Daniel
author_sort Doblhammer, Gabriele
collection PubMed
description OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DATA AND METHOD: We used data from the Robert Koch-Institute (RKI) on 204 217 COVID-19 diagnoses in the total German population of 83.1 million, distinguishing five distinct periods between 1 January and 23 July 2020. For each period, we calculated age-standardised incidence rates of COVID-19 diagnoses on the county level and characterised the counties by 166 macro variables. We trained gradient boosting models to predict the age-standardised incidence rates with the macrostructures of the counties and used SHapley Additive exPlanations (SHAP) values to characterise the 20 most prominent features in terms of negative/positive correlations with the outcome variable. RESULTS: The first COVID-19 wave started as a disease in wealthy rural counties in southern Germany and ventured into poorer urban and agricultural counties during the course of the first wave. High age-standardised incidence in low socioeconomic status (SES) counties became more pronounced from the second lockdown period onwards, when wealthy counties appeared to be better protected. Features related to economic and educational characteristics of the young population in a county played an important role at the beginning of the pandemic up to the second lockdown phase, as did features related to the population living in nursing homes; those related to international migration and a large proportion of foreigners living in a county became important in the postlockdown period. CONCLUSION: High mobility of high SES groups may drive the pandemic at the beginning of waves, while mitigation measures and beliefs about the seriousness of the pandemic as well as the compliance with mitigation measures may put lower SES groups at higher risks later on.
format Online
Article
Text
id pubmed-8852237
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-88522372022-02-18 Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach Doblhammer, Gabriele Reinke, Constantin Kreft, Daniel BMJ Open Infectious Diseases OBJECTIVES: Knowledge about the socioeconomic spread of the first wave of COVID-19 infections in Germany is scattered across different studies. We explored whether COVID-19 incidence rates differed between counties according to their socioeconomic characteristics using a wide range of indicators. DATA AND METHOD: We used data from the Robert Koch-Institute (RKI) on 204 217 COVID-19 diagnoses in the total German population of 83.1 million, distinguishing five distinct periods between 1 January and 23 July 2020. For each period, we calculated age-standardised incidence rates of COVID-19 diagnoses on the county level and characterised the counties by 166 macro variables. We trained gradient boosting models to predict the age-standardised incidence rates with the macrostructures of the counties and used SHapley Additive exPlanations (SHAP) values to characterise the 20 most prominent features in terms of negative/positive correlations with the outcome variable. RESULTS: The first COVID-19 wave started as a disease in wealthy rural counties in southern Germany and ventured into poorer urban and agricultural counties during the course of the first wave. High age-standardised incidence in low socioeconomic status (SES) counties became more pronounced from the second lockdown period onwards, when wealthy counties appeared to be better protected. Features related to economic and educational characteristics of the young population in a county played an important role at the beginning of the pandemic up to the second lockdown phase, as did features related to the population living in nursing homes; those related to international migration and a large proportion of foreigners living in a county became important in the postlockdown period. CONCLUSION: High mobility of high SES groups may drive the pandemic at the beginning of waves, while mitigation measures and beliefs about the seriousness of the pandemic as well as the compliance with mitigation measures may put lower SES groups at higher risks later on. BMJ Publishing Group 2022-02-15 /pmc/articles/PMC8852237/ /pubmed/35172994 http://dx.doi.org/10.1136/bmjopen-2021-049852 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Infectious Diseases
Doblhammer, Gabriele
Reinke, Constantin
Kreft, Daniel
Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title_full Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title_fullStr Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title_full_unstemmed Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title_short Social disparities in the first wave of COVID-19 incidence rates in Germany: a county-scale explainable machine learning approach
title_sort social disparities in the first wave of covid-19 incidence rates in germany: a county-scale explainable machine learning approach
topic Infectious Diseases
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8852237/
https://www.ncbi.nlm.nih.gov/pubmed/35172994
http://dx.doi.org/10.1136/bmjopen-2021-049852
work_keys_str_mv AT doblhammergabriele socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach
AT reinkeconstantin socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach
AT kreftdaniel socialdisparitiesinthefirstwaveofcovid19incidenceratesingermanyacountyscaleexplainablemachinelearningapproach