Cargando…

COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data

BACKGROUND: The COVID-19 pandemic has led to an avalanche of scientific studies, drawing on many different types of data. However, studies addressing the effectiveness of government actions against COVID-19, especially non-pharmaceutical interventions, often exhibit data problems that threaten the v...

Descripción completa

Detalles Bibliográficos
Autores principales:	Stoto, Michael A., Woolverton, Abbey, Kraemer, John, Barlow, Pepita, Clarke, Michael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Commentary
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8733906/ https://www.ncbi.nlm.nih.gov/pubmed/34991622 http://dx.doi.org/10.1186/s12992-021-00795-0

_version_	1784627900638560256
author	Stoto, Michael A. Woolverton, Abbey Kraemer, John Barlow, Pepita Clarke, Michael
author_facet	Stoto, Michael A. Woolverton, Abbey Kraemer, John Barlow, Pepita Clarke, Michael
author_sort	Stoto, Michael A.
collection	PubMed
description	BACKGROUND: The COVID-19 pandemic has led to an avalanche of scientific studies, drawing on many different types of data. However, studies addressing the effectiveness of government actions against COVID-19, especially non-pharmaceutical interventions, often exhibit data problems that threaten the validity of their results. This review is thus intended to help epidemiologists and other researchers identify a set of data issues that, in our view, must be addressed in order for their work to be credible. We further intend to help journal editors and peer reviewers when evaluating studies, to apprise policy-makers, journalists, and other research consumers about the strengths and weaknesses of published studies, and to inform the wider debate about the scientific quality of COVID-19 research. RESULTS: To this end, we describe common challenges in the collection, reporting, and use of epidemiologic, policy, and other data, including completeness and representativeness of outcomes data; their comparability over time and among jurisdictions; the adequacy of policy variables and data on intermediate outcomes such as mobility and mask use; and a mismatch between level of intervention and outcome variables. We urge researchers to think critically about potential problems with the COVID-19 data sources over the specific time periods and particular locations they have chosen to analyze, and to choose not only appropriate study designs but also to conduct appropriate checks and sensitivity analyses to investigate the impact(s) of potential threats on study findings. CONCLUSIONS: In an effort to encourage high quality research, we provide recommendations on how to address the issues we identify. Our first recommendation is for researchers to choose an appropriate design (and the data it requires). This review describes considerations and issues in order to identify the strongest analytical designs and demonstrates how interrupted time-series and comparative longitudinal studies can be particularly useful. Furthermore, we recommend that researchers conduct checks or sensitivity analyses of the results to data source and design choices, which we illustrate. Regardless of the approaches taken, researchers should be explicit about the kind of data problems or other biases that the design choice and sensitivity analyses are addressing.
format	Online Article Text
id	pubmed-8733906
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-87339062022-01-06 COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data Stoto, Michael A. Woolverton, Abbey Kraemer, John Barlow, Pepita Clarke, Michael Global Health Commentary BACKGROUND: The COVID-19 pandemic has led to an avalanche of scientific studies, drawing on many different types of data. However, studies addressing the effectiveness of government actions against COVID-19, especially non-pharmaceutical interventions, often exhibit data problems that threaten the validity of their results. This review is thus intended to help epidemiologists and other researchers identify a set of data issues that, in our view, must be addressed in order for their work to be credible. We further intend to help journal editors and peer reviewers when evaluating studies, to apprise policy-makers, journalists, and other research consumers about the strengths and weaknesses of published studies, and to inform the wider debate about the scientific quality of COVID-19 research. RESULTS: To this end, we describe common challenges in the collection, reporting, and use of epidemiologic, policy, and other data, including completeness and representativeness of outcomes data; their comparability over time and among jurisdictions; the adequacy of policy variables and data on intermediate outcomes such as mobility and mask use; and a mismatch between level of intervention and outcome variables. We urge researchers to think critically about potential problems with the COVID-19 data sources over the specific time periods and particular locations they have chosen to analyze, and to choose not only appropriate study designs but also to conduct appropriate checks and sensitivity analyses to investigate the impact(s) of potential threats on study findings. CONCLUSIONS: In an effort to encourage high quality research, we provide recommendations on how to address the issues we identify. Our first recommendation is for researchers to choose an appropriate design (and the data it requires). This review describes considerations and issues in order to identify the strongest analytical designs and demonstrates how interrupted time-series and comparative longitudinal studies can be particularly useful. Furthermore, we recommend that researchers conduct checks or sensitivity analyses of the results to data source and design choices, which we illustrate. Regardless of the approaches taken, researchers should be explicit about the kind of data problems or other biases that the design choice and sensitivity analyses are addressing. BioMed Central 2022-01-06 /pmc/articles/PMC8733906/ /pubmed/34991622 http://dx.doi.org/10.1186/s12992-021-00795-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Commentary Stoto, Michael A. Woolverton, Abbey Kraemer, John Barlow, Pepita Clarke, Michael COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title	COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title_full	COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title_fullStr	COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title_full_unstemmed	COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title_short	COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
title_sort	covid-19 data are messy: analytic methods for rigorous impact analyses with imperfect data
topic	Commentary
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8733906/ https://www.ncbi.nlm.nih.gov/pubmed/34991622 http://dx.doi.org/10.1186/s12992-021-00795-0
work_keys_str_mv	AT stotomichaela covid19dataaremessyanalyticmethodsforrigorousimpactanalyseswithimperfectdata AT woolvertonabbey covid19dataaremessyanalyticmethodsforrigorousimpactanalyseswithimperfectdata AT kraemerjohn covid19dataaremessyanalyticmethodsforrigorousimpactanalyseswithimperfectdata AT barlowpepita covid19dataaremessyanalyticmethodsforrigorousimpactanalyseswithimperfectdata AT clarkemichael covid19dataaremessyanalyticmethodsforrigorousimpactanalyseswithimperfectdata

COVID-19 data are messy: analytic methods for rigorous impact analyses with imperfect data

Ejemplares similares