Cargando…
Data Source Concordance for Infectious Disease Epidemiology
BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of source...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176660/ https://www.ncbi.nlm.nih.gov/pubmed/35677068 http://dx.doi.org/10.1101/2022.06.02.22275926 |
_version_ | 1784722713820004352 |
---|---|
author | Majumder, Maimuna Cusick, Marika Mae Rose, Sherri |
author_facet | Majumder, Maimuna Cusick, Marika Mae Rose, Sherri |
author_sort | Majumder, Maimuna |
collection | PubMed |
description | BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of sources currently being used for such research. These are critical for policy makers to understand when interpreting study findings. METHODS: To fill this gap in the literature, we compared infectious disease reporting for three diseases (measles, mumps, and varicella) across four different data sources: Optum (health insurance billing claims data), HealthMap (online news surveillance data), Morbidity and Mortality Weekly Reports (official government reports), and National Notifiable Disease Surveillance System (government case surveillance data). We reported the yearly number of national- and state-level disease-specific case counts and disease clusters according to each of our sources during a five-year study period (2013–2017). FINDINGS: Our study demonstrated drastic differences in reported infectious disease incidence across data sources. When compared against the other three sources of interest, Optum data showed substantially higher, implausible standardized case counts for all three diseases. Although there was some concordance in identified state-level case counts and disease clusters, all four sources identified variations in state-level reporting. INTERPRETATION: Researchers should consider data source limitations when attempting to characterize the epidemiology of infectious diseases. Some data sources, such as billing claims data, may be unsuitable for epidemiological research within the infectious disease context. |
format | Online Article Text |
id | pubmed-9176660 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-91766602022-06-09 Data Source Concordance for Infectious Disease Epidemiology Majumder, Maimuna Cusick, Marika Mae Rose, Sherri medRxiv Article BACKGROUND: As highlighted by the COVID-19 pandemic, researchers are eager to make use of a wide variety of data sources, both government-sponsored and alternative, to characterize the epidemiology of infectious diseases. To date, few studies have investigated the strengths and limitations of sources currently being used for such research. These are critical for policy makers to understand when interpreting study findings. METHODS: To fill this gap in the literature, we compared infectious disease reporting for three diseases (measles, mumps, and varicella) across four different data sources: Optum (health insurance billing claims data), HealthMap (online news surveillance data), Morbidity and Mortality Weekly Reports (official government reports), and National Notifiable Disease Surveillance System (government case surveillance data). We reported the yearly number of national- and state-level disease-specific case counts and disease clusters according to each of our sources during a five-year study period (2013–2017). FINDINGS: Our study demonstrated drastic differences in reported infectious disease incidence across data sources. When compared against the other three sources of interest, Optum data showed substantially higher, implausible standardized case counts for all three diseases. Although there was some concordance in identified state-level case counts and disease clusters, all four sources identified variations in state-level reporting. INTERPRETATION: Researchers should consider data source limitations when attempting to characterize the epidemiology of infectious diseases. Some data sources, such as billing claims data, may be unsuitable for epidemiological research within the infectious disease context. Cold Spring Harbor Laboratory 2022-06-03 /pmc/articles/PMC9176660/ /pubmed/35677068 http://dx.doi.org/10.1101/2022.06.02.22275926 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Majumder, Maimuna Cusick, Marika Mae Rose, Sherri Data Source Concordance for Infectious Disease Epidemiology |
title | Data Source Concordance for Infectious Disease Epidemiology |
title_full | Data Source Concordance for Infectious Disease Epidemiology |
title_fullStr | Data Source Concordance for Infectious Disease Epidemiology |
title_full_unstemmed | Data Source Concordance for Infectious Disease Epidemiology |
title_short | Data Source Concordance for Infectious Disease Epidemiology |
title_sort | data source concordance for infectious disease epidemiology |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176660/ https://www.ncbi.nlm.nih.gov/pubmed/35677068 http://dx.doi.org/10.1101/2022.06.02.22275926 |
work_keys_str_mv | AT majumdermaimuna datasourceconcordanceforinfectiousdiseaseepidemiology AT cusickmarikamae datasourceconcordanceforinfectiousdiseaseepidemiology AT rosesherri datasourceconcordanceforinfectiousdiseaseepidemiology |