Cargando…
Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts
PURPOSE: To facilitate use of timely, granular, and publicly available data on COVID-19 mortality, we provide a method for imputing suppressed COVID-19 death counts in the National Center for Health Statistic’s 2020 provisional mortality data by quarter, county, and age. METHODS: We used a Bayesian...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10399909/ https://www.ncbi.nlm.nih.gov/pubmed/37535647 http://dx.doi.org/10.1371/journal.pone.0288961 |
_version_ | 1785084351713640448 |
---|---|
author | Kao, Szu-Yu Zoe Tutwiler, M. Shane Ekwueme, Donatus U. Truman, Benedict I. |
author_facet | Kao, Szu-Yu Zoe Tutwiler, M. Shane Ekwueme, Donatus U. Truman, Benedict I. |
author_sort | Kao, Szu-Yu Zoe |
collection | PubMed |
description | PURPOSE: To facilitate use of timely, granular, and publicly available data on COVID-19 mortality, we provide a method for imputing suppressed COVID-19 death counts in the National Center for Health Statistic’s 2020 provisional mortality data by quarter, county, and age. METHODS: We used a Bayesian approach to impute suppressed COVID-19 death counts by quarter, county, and age in provisional data for 3,138 US counties. Our model accounts for multilevel data structures; numerous zero death counts among persons aged <50 years, rural counties, early quarters in 2020; highly right-skewed distributions; and different levels of data granularity (county, state or locality, and national levels). We compared three models with different prior assumptions of suppressed COVID-19 deaths, including noninformative priors (M1), the same weakly informative priors for all age groups (M2), and weakly informative priors that differ by age (M3) to impute the suppressed death counts. After the imputed suppressed counts were available, we assessed three prior assumptions at the national, state/locality, and county level, respectively. Finally, we compared US counties by two types of COVID-19 death rates, crude (CDR) and age-standardized death rates (ASDR), which can be estimated only through imputing suppressed death counts. RESULTS: Without imputation, the total COVID-19 death counts estimated from the raw data underestimated the reported national COVID-19 deaths by 18.60%. Using imputed data, we overestimated the national COVID-19 deaths by 3.57% (95% CI: 3.37%-3.80%) in model M1, 2.23% (95% CI: 2.04%-2.43%) in model M2, and 2.96% (95% CI: 2.76%-3.16%) in model M3 compared with the national report. The top 20 counties that were most affected by COVID-19 mortality were different between CDR and ASDR. CONCLUSIONS: Bayesian imputation of suppressed county-level, age-specific COVID-19 deaths in US provisional data can improve county ASDR estimates and aid public health officials in identifying disparities in deaths from COVID-19. |
format | Online Article Text |
id | pubmed-10399909 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-103999092023-08-04 Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts Kao, Szu-Yu Zoe Tutwiler, M. Shane Ekwueme, Donatus U. Truman, Benedict I. PLoS One Research Article PURPOSE: To facilitate use of timely, granular, and publicly available data on COVID-19 mortality, we provide a method for imputing suppressed COVID-19 death counts in the National Center for Health Statistic’s 2020 provisional mortality data by quarter, county, and age. METHODS: We used a Bayesian approach to impute suppressed COVID-19 death counts by quarter, county, and age in provisional data for 3,138 US counties. Our model accounts for multilevel data structures; numerous zero death counts among persons aged <50 years, rural counties, early quarters in 2020; highly right-skewed distributions; and different levels of data granularity (county, state or locality, and national levels). We compared three models with different prior assumptions of suppressed COVID-19 deaths, including noninformative priors (M1), the same weakly informative priors for all age groups (M2), and weakly informative priors that differ by age (M3) to impute the suppressed death counts. After the imputed suppressed counts were available, we assessed three prior assumptions at the national, state/locality, and county level, respectively. Finally, we compared US counties by two types of COVID-19 death rates, crude (CDR) and age-standardized death rates (ASDR), which can be estimated only through imputing suppressed death counts. RESULTS: Without imputation, the total COVID-19 death counts estimated from the raw data underestimated the reported national COVID-19 deaths by 18.60%. Using imputed data, we overestimated the national COVID-19 deaths by 3.57% (95% CI: 3.37%-3.80%) in model M1, 2.23% (95% CI: 2.04%-2.43%) in model M2, and 2.96% (95% CI: 2.76%-3.16%) in model M3 compared with the national report. The top 20 counties that were most affected by COVID-19 mortality were different between CDR and ASDR. CONCLUSIONS: Bayesian imputation of suppressed county-level, age-specific COVID-19 deaths in US provisional data can improve county ASDR estimates and aid public health officials in identifying disparities in deaths from COVID-19. Public Library of Science 2023-08-03 /pmc/articles/PMC10399909/ /pubmed/37535647 http://dx.doi.org/10.1371/journal.pone.0288961 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Kao, Szu-Yu Zoe Tutwiler, M. Shane Ekwueme, Donatus U. Truman, Benedict I. Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title | Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title_full | Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title_fullStr | Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title_full_unstemmed | Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title_short | Better data for decision-making through Bayesian imputation of suppressed provisional COVID-19 death counts |
title_sort | better data for decision-making through bayesian imputation of suppressed provisional covid-19 death counts |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10399909/ https://www.ncbi.nlm.nih.gov/pubmed/37535647 http://dx.doi.org/10.1371/journal.pone.0288961 |
work_keys_str_mv | AT kaoszuyuzoe betterdatafordecisionmakingthroughbayesianimputationofsuppressedprovisionalcovid19deathcounts AT tutwilermshane betterdatafordecisionmakingthroughbayesianimputationofsuppressedprovisionalcovid19deathcounts AT ekwuemedonatusu betterdatafordecisionmakingthroughbayesianimputationofsuppressedprovisionalcovid19deathcounts AT trumanbenedicti betterdatafordecisionmakingthroughbayesianimputationofsuppressedprovisionalcovid19deathcounts |