Cargando…

Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California

In emerging epidemics, early estimates of key epidemiological characteristics of the disease are critical for guiding public policy. In particular, identifying high-risk population subgroups aids policymakers and health officials in combating the epidemic. This has been challenging during the corona...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Di, Zhang, Lu, Watson, Gregory L., Sundin, Phillip, Bufford, Teresa, Zoller, Joseph A., Shamshoian, John, Suchard, Marc A., Ramirez, Christina M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier B.V. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837024/
https://www.ncbi.nlm.nih.gov/pubmed/33221671
http://dx.doi.org/10.1016/j.epidem.2020.100418
_version_ 1783642872303583232
author Xiong, Di
Zhang, Lu
Watson, Gregory L.
Sundin, Phillip
Bufford, Teresa
Zoller, Joseph A.
Shamshoian, John
Suchard, Marc A.
Ramirez, Christina M.
author_facet Xiong, Di
Zhang, Lu
Watson, Gregory L.
Sundin, Phillip
Bufford, Teresa
Zoller, Joseph A.
Shamshoian, John
Suchard, Marc A.
Ramirez, Christina M.
author_sort Xiong, Di
collection PubMed
description In emerging epidemics, early estimates of key epidemiological characteristics of the disease are critical for guiding public policy. In particular, identifying high-risk population subgroups aids policymakers and health officials in combating the epidemic. This has been challenging during the coronavirus disease 2019 (COVID-19) pandemic because governmental agencies typically release aggregate COVID-19 data as summary statistics of patient demographics. These data may identify disparities in COVID-19 outcomes between broad population subgroups, but do not provide comparisons between more granular population subgroups defined by combinations of multiple demographics. We introduce a method that helps to overcome the limitations of aggregated summary statistics and yields estimates of COVID-19 infection and case fatality rates — key quantities for guiding public policy related to the control and prevention of COVID-19 — for population subgroups across combinations of demographic characteristics. Our approach uses pseudo-likelihood based logistic regression to combine aggregate COVID-19 case and fatality data with population-level demographic survey data to estimate infection and case fatality rates for population subgroups across combinations of demographic characteristics. We illustrate our method on California COVID-19 data to estimate test-based infection and case fatality rates for population subgroups defined by gender, age, and race/ethnicity. Our analysis indicates that in California, males have higher test-based infection rates and test-based case fatality rates across age and race/ethnicity groups, with the gender gap widening with increasing age. Although elderly infected with COVID-19 are at an elevated risk of mortality, the test-based infection rates do not increase monotonically with age. The workforce population, especially, has a higher test-based infection rate than children, adolescents, and other elderly people in their 60–80. LatinX and African Americans have higher test-based infection rates than other race/ethnicity groups. The subgroups with the highest 5 test-based case fatality rates are all-male groups with race as African American, Asian, Multi-race, LatinX, and White, followed by African American females, indicating that African Americans are an especially vulnerable California subpopulation.
format Online
Article
Text
id pubmed-7837024
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Authors. Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-78370242021-01-26 Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California Xiong, Di Zhang, Lu Watson, Gregory L. Sundin, Phillip Bufford, Teresa Zoller, Joseph A. Shamshoian, John Suchard, Marc A. Ramirez, Christina M. Epidemics Article In emerging epidemics, early estimates of key epidemiological characteristics of the disease are critical for guiding public policy. In particular, identifying high-risk population subgroups aids policymakers and health officials in combating the epidemic. This has been challenging during the coronavirus disease 2019 (COVID-19) pandemic because governmental agencies typically release aggregate COVID-19 data as summary statistics of patient demographics. These data may identify disparities in COVID-19 outcomes between broad population subgroups, but do not provide comparisons between more granular population subgroups defined by combinations of multiple demographics. We introduce a method that helps to overcome the limitations of aggregated summary statistics and yields estimates of COVID-19 infection and case fatality rates — key quantities for guiding public policy related to the control and prevention of COVID-19 — for population subgroups across combinations of demographic characteristics. Our approach uses pseudo-likelihood based logistic regression to combine aggregate COVID-19 case and fatality data with population-level demographic survey data to estimate infection and case fatality rates for population subgroups across combinations of demographic characteristics. We illustrate our method on California COVID-19 data to estimate test-based infection and case fatality rates for population subgroups defined by gender, age, and race/ethnicity. Our analysis indicates that in California, males have higher test-based infection rates and test-based case fatality rates across age and race/ethnicity groups, with the gender gap widening with increasing age. Although elderly infected with COVID-19 are at an elevated risk of mortality, the test-based infection rates do not increase monotonically with age. The workforce population, especially, has a higher test-based infection rate than children, adolescents, and other elderly people in their 60–80. LatinX and African Americans have higher test-based infection rates than other race/ethnicity groups. The subgroups with the highest 5 test-based case fatality rates are all-male groups with race as African American, Asian, Multi-race, LatinX, and White, followed by African American females, indicating that African Americans are an especially vulnerable California subpopulation. The Authors. Published by Elsevier B.V. 2020-12 2020-11-09 /pmc/articles/PMC7837024/ /pubmed/33221671 http://dx.doi.org/10.1016/j.epidem.2020.100418 Text en © 2020 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Xiong, Di
Zhang, Lu
Watson, Gregory L.
Sundin, Phillip
Bufford, Teresa
Zoller, Joseph A.
Shamshoian, John
Suchard, Marc A.
Ramirez, Christina M.
Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title_full Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title_fullStr Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title_full_unstemmed Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title_short Pseudo-likelihood based logistic regression for estimating COVID-19 infection and case fatality rates by gender, race, and age in California
title_sort pseudo-likelihood based logistic regression for estimating covid-19 infection and case fatality rates by gender, race, and age in california
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837024/
https://www.ncbi.nlm.nih.gov/pubmed/33221671
http://dx.doi.org/10.1016/j.epidem.2020.100418
work_keys_str_mv AT xiongdi pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT zhanglu pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT watsongregoryl pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT sundinphillip pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT buffordteresa pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT zollerjosepha pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT shamshoianjohn pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT suchardmarca pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia
AT ramirezchristinam pseudolikelihoodbasedlogisticregressionforestimatingcovid19infectionandcasefatalityratesbygenderraceandageincalifornia