Cargando…

Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses

BACKGROUND: In the data pipeline from the data collection process to the planned statistical analyses, initial data analysis (IDA) typically takes place between the end of the data collection and do not touch the research questions. A systematic process for IDA and clear reporting of the findings wo...

Descripción completa

Detalles Bibliográficos
Autores principales: Huebner, Marianne, Vach, Werner, le Cessie, Saskia, Schmidt, Carsten Oliver, Lusa, Lara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071755/
https://www.ncbi.nlm.nih.gov/pubmed/32169053
http://dx.doi.org/10.1186/s12874-020-00942-y
_version_ 1783506274183282688
author Huebner, Marianne
Vach, Werner
le Cessie, Saskia
Schmidt, Carsten Oliver
Lusa, Lara
author_facet Huebner, Marianne
Vach, Werner
le Cessie, Saskia
Schmidt, Carsten Oliver
Lusa, Lara
author_sort Huebner, Marianne
collection PubMed
description BACKGROUND: In the data pipeline from the data collection process to the planned statistical analyses, initial data analysis (IDA) typically takes place between the end of the data collection and do not touch the research questions. A systematic process for IDA and clear reporting of the findings would help to understand the potential shortcomings of a dataset, such as missing values, or subgroups with small sample sizes, or shortcomings in the collection process, and to evaluate the impact of these shortcomings on the research results. A clear reporting of findings is also relevant when making datasets available to other researchers. Initial data analyses can provide valuable insights into the suitability of a data set for a future research study. Our aim was to describe the practice of reporting of initial data analyses in observational studies in five highly ranked medical journals with focus on data cleaning, screening, and reporting of findings which led to a potential change in the analysis plan. METHODS: This review was carried out using systematic search strategies with eligibility criteria for articles to be reviewed. A total of 25 papers about observational studies were selected from five medical journals published in 2018. Each paper was reviewed by two reviewers and IDA statements were further discussed by all authors. The consensus was reported. RESULTS: IDA statements were reported in the methods, results, discussion, and supplement of papers. Ten out of 25 papers (40%) included a statement about data cleaning. Data screening statements were included in all articles, and 18 (72%) indicated the methods used to describe them. Item missingness was reported in 11 papers (44%), unit missingness in 15 papers (60%). Eleven papers (44%) mentioned some changes in the analysis plan. Reported changes referred to missing data treatment, unexpected values, population heterogeneity and aspects related to variable distributions or data properties. CONCLUSION: Reporting of initial data analyses were sparse, and statements on IDA were located throughout the research articles. There is a lack of systematic reporting of IDA. We conclude the article with recommendations on how to overcome shortcomings in the practice of IDA reporting in observational studies.
format Online
Article
Text
id pubmed-7071755
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70717552020-03-18 Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses Huebner, Marianne Vach, Werner le Cessie, Saskia Schmidt, Carsten Oliver Lusa, Lara BMC Med Res Methodol Research Article BACKGROUND: In the data pipeline from the data collection process to the planned statistical analyses, initial data analysis (IDA) typically takes place between the end of the data collection and do not touch the research questions. A systematic process for IDA and clear reporting of the findings would help to understand the potential shortcomings of a dataset, such as missing values, or subgroups with small sample sizes, or shortcomings in the collection process, and to evaluate the impact of these shortcomings on the research results. A clear reporting of findings is also relevant when making datasets available to other researchers. Initial data analyses can provide valuable insights into the suitability of a data set for a future research study. Our aim was to describe the practice of reporting of initial data analyses in observational studies in five highly ranked medical journals with focus on data cleaning, screening, and reporting of findings which led to a potential change in the analysis plan. METHODS: This review was carried out using systematic search strategies with eligibility criteria for articles to be reviewed. A total of 25 papers about observational studies were selected from five medical journals published in 2018. Each paper was reviewed by two reviewers and IDA statements were further discussed by all authors. The consensus was reported. RESULTS: IDA statements were reported in the methods, results, discussion, and supplement of papers. Ten out of 25 papers (40%) included a statement about data cleaning. Data screening statements were included in all articles, and 18 (72%) indicated the methods used to describe them. Item missingness was reported in 11 papers (44%), unit missingness in 15 papers (60%). Eleven papers (44%) mentioned some changes in the analysis plan. Reported changes referred to missing data treatment, unexpected values, population heterogeneity and aspects related to variable distributions or data properties. CONCLUSION: Reporting of initial data analyses were sparse, and statements on IDA were located throughout the research articles. There is a lack of systematic reporting of IDA. We conclude the article with recommendations on how to overcome shortcomings in the practice of IDA reporting in observational studies. BioMed Central 2020-03-13 /pmc/articles/PMC7071755/ /pubmed/32169053 http://dx.doi.org/10.1186/s12874-020-00942-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Huebner, Marianne
Vach, Werner
le Cessie, Saskia
Schmidt, Carsten Oliver
Lusa, Lara
Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title_full Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title_fullStr Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title_full_unstemmed Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title_short Hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
title_sort hidden analyses: a review of reporting practice and recommendations for more transparent reporting of initial data analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071755/
https://www.ncbi.nlm.nih.gov/pubmed/32169053
http://dx.doi.org/10.1186/s12874-020-00942-y
work_keys_str_mv AT huebnermarianne hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses
AT vachwerner hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses
AT lecessiesaskia hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses
AT schmidtcarstenoliver hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses
AT lusalara hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses
AT hiddenanalysesareviewofreportingpracticeandrecommendationsformoretransparentreportingofinitialdataanalyses