Cargando…

Data Source Concordance for Infectious Disease Epidemiology

BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of source...

Descripción completa

Detalles Bibliográficos
Autores principales: Majumder, Maimuna, Cusick, Marika Mae, Rose, Sherri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176660/
https://www.ncbi.nlm.nih.gov/pubmed/35677068
http://dx.doi.org/10.1101/2022.06.02.22275926
_version_ 1784722713820004352
author Majumder, Maimuna
Cusick, Marika Mae
Rose, Sherri
author_facet Majumder, Maimuna
Cusick, Marika Mae
Rose, Sherri
author_sort Majumder, Maimuna
collection PubMed
description BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of sources currently being used for such research. These are critical for policy makers to understand when interpreting study findings. METHODS: To fill this gap in the literature, we compared infectious disease reporting for three diseases (measles, mumps, and varicella) across four different data sources: Optum (health insurance billing claims data), HealthMap (online news surveillance data), Morbidity and Mortality Weekly Reports (official government reports), and National Notifiable Disease Surveillance System (government case surveillance data). We reported the yearly number of national- and state-level disease-specific case counts and disease clusters according to each of our sources during a five-year study period (2013–2017). FINDINGS: Our study demonstrated drastic differences in reported infectious disease incidence across data sources. When compared against the other three sources of interest, Optum data showed substantially higher, implausible standardized case counts for all three diseases. Although there was some concordance in identified state-level case counts and disease clusters, all four sources identified variations in state-level reporting. INTERPRETATION: Researchers should consider data source limitations when attempting to characterize the epidemiology of infectious diseases. Some data sources, such as billing claims data, may be unsuitable for epidemiological research within the infectious disease context.
format Online
Article
Text
id pubmed-9176660
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-91766602022-06-09 Data Source Concordance for Infectious Disease Epidemiology Majumder, Maimuna Cusick, Marika Mae Rose, Sherri medRxiv Article BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of sources currently being used for such research. These are critical for policy makers to understand when interpreting study findings. METHODS: To fill this gap in the literature, we compared infectious disease reporting for three diseases (measles, mumps, and varicella) across four different data sources: Optum (health insurance billing claims data), HealthMap (online news surveillance data), Morbidity and Mortality Weekly Reports (official government reports), and National Notifiable Disease Surveillance System (government case surveillance data). We reported the yearly number of national- and state-level disease-specific case counts and disease clusters according to each of our sources during a five-year study period (2013–2017). FINDINGS: Our study demonstrated drastic differences in reported infectious disease incidence across data sources. When compared against the other three sources of interest, Optum data showed substantially higher, implausible standardized case counts for all three diseases. Although there was some concordance in identified state-level case counts and disease clusters, all four sources identified variations in state-level reporting. INTERPRETATION: Researchers should consider data source limitations when attempting to characterize the epidemiology of infectious diseases. Some data sources, such as billing claims data, may be unsuitable for epidemiological research within the infectious disease context. Cold Spring Harbor Laboratory 2022-06-03 /pmc/articles/PMC9176660/ /pubmed/35677068 http://dx.doi.org/10.1101/2022.06.02.22275926 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Majumder, Maimuna
Cusick, Marika Mae
Rose, Sherri
Data Source Concordance for Infectious Disease Epidemiology
title Data Source Concordance for Infectious Disease Epidemiology
title_full Data Source Concordance for Infectious Disease Epidemiology
title_fullStr Data Source Concordance for Infectious Disease Epidemiology
title_full_unstemmed Data Source Concordance for Infectious Disease Epidemiology
title_short Data Source Concordance for Infectious Disease Epidemiology
title_sort data source concordance for infectious disease epidemiology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176660/
https://www.ncbi.nlm.nih.gov/pubmed/35677068
http://dx.doi.org/10.1101/2022.06.02.22275926
work_keys_str_mv AT majumdermaimuna datasourceconcordanceforinfectiousdiseaseepidemiology
AT cusickmarikamae datasourceconcordanceforinfectiousdiseaseepidemiology
AT rosesherri datasourceconcordanceforinfectiousdiseaseepidemiology