Cargando…

Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate

BACKGROUND: According to the CONSORT statement, significance testing of baseline differences in randomized controlled trials should not be performed. In fact, this practice has been discouraged by numerous authors throughout the last forty years. During that time span, reporting of baseline differen...

Descripción completa

Detalles Bibliográficos
Autores principales:	de Boer, Michiel R, Waterlander, Wilma E, Kuijper, Lothar DJ, Steenhuis, Ingrid HM, Twisk, Jos WR
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2015
Materias:	Debate
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4310023/ https://www.ncbi.nlm.nih.gov/pubmed/25616598 http://dx.doi.org/10.1186/s12966-015-0162-z

_version_	1782354790087393280
author	de Boer, Michiel R Waterlander, Wilma E Kuijper, Lothar DJ Steenhuis, Ingrid HM Twisk, Jos WR
author_facet	de Boer, Michiel R Waterlander, Wilma E Kuijper, Lothar DJ Steenhuis, Ingrid HM Twisk, Jos WR
author_sort	de Boer, Michiel R
collection	PubMed
description	BACKGROUND: According to the CONSORT statement, significance testing of baseline differences in randomized controlled trials should not be performed. In fact, this practice has been discouraged by numerous authors throughout the last forty years. During that time span, reporting of baseline differences has substantially decreased in the leading general medical journals. Our own experience in the field of nutrition behavior research however, is that co-authors, reviewers and even editors are still very persistent in their demand for these tests. The aim of this paper is therefore to negate this demand by providing clear evidence as to why testing for baseline differences between intervention groups statistically is superfluous and why such results should not be published. DISCUSSION: Testing for baseline differences is often propagated because of the belief that it shows whether randomization was successful and it identifies real or important differences between treatment arms that should be accounted for in the statistical analyses. Especially the latter argument is flawed, because it ignores the fact that the prognostic strength of a variable is also important when the interest is in adjustment for confounding. In addition, including prognostic variables as covariates can increase the precision of the effect estimate. This means that choosing covariates based on significance tests for baseline differences might lead to omissions of important covariates and, less importantly, to inclusion of irrelevant covariates in the analysis. We used data from four supermarket trials on the effects of pricing strategies on fruit and vegetables purchases, to show that results from fully adjusted analyses sometimes do appreciably differ from results from analyses adjusted for significant baseline differences only. We propose to adjust for known or anticipated important prognostic variables. These could or should be pre-specified in trial protocols. Subsequently, authors should report results from the fully adjusted as well as crude analyses, especially for dichotomous and time to event data. SUMMARY: Based on our arguments, which were illustrated by our findings, we propose that journals in and outside the field of nutrition behavior actively adopt the CONSORT 2010 statement on this topic by not publishing significance tests for baseline differences anymore.
format	Online Article Text
id	pubmed-4310023
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-43100232015-01-30 Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate de Boer, Michiel R Waterlander, Wilma E Kuijper, Lothar DJ Steenhuis, Ingrid HM Twisk, Jos WR Int J Behav Nutr Phys Act Debate BACKGROUND: According to the CONSORT statement, significance testing of baseline differences in randomized controlled trials should not be performed. In fact, this practice has been discouraged by numerous authors throughout the last forty years. During that time span, reporting of baseline differences has substantially decreased in the leading general medical journals. Our own experience in the field of nutrition behavior research however, is that co-authors, reviewers and even editors are still very persistent in their demand for these tests. The aim of this paper is therefore to negate this demand by providing clear evidence as to why testing for baseline differences between intervention groups statistically is superfluous and why such results should not be published. DISCUSSION: Testing for baseline differences is often propagated because of the belief that it shows whether randomization was successful and it identifies real or important differences between treatment arms that should be accounted for in the statistical analyses. Especially the latter argument is flawed, because it ignores the fact that the prognostic strength of a variable is also important when the interest is in adjustment for confounding. In addition, including prognostic variables as covariates can increase the precision of the effect estimate. This means that choosing covariates based on significance tests for baseline differences might lead to omissions of important covariates and, less importantly, to inclusion of irrelevant covariates in the analysis. We used data from four supermarket trials on the effects of pricing strategies on fruit and vegetables purchases, to show that results from fully adjusted analyses sometimes do appreciably differ from results from analyses adjusted for significant baseline differences only. We propose to adjust for known or anticipated important prognostic variables. These could or should be pre-specified in trial protocols. Subsequently, authors should report results from the fully adjusted as well as crude analyses, especially for dichotomous and time to event data. SUMMARY: Based on our arguments, which were illustrated by our findings, we propose that journals in and outside the field of nutrition behavior actively adopt the CONSORT 2010 statement on this topic by not publishing significance tests for baseline differences anymore. BioMed Central 2015-01-24 /pmc/articles/PMC4310023/ /pubmed/25616598 http://dx.doi.org/10.1186/s12966-015-0162-z Text en © de Boer et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Debate de Boer, Michiel R Waterlander, Wilma E Kuijper, Lothar DJ Steenhuis, Ingrid HM Twisk, Jos WR Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title	Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title_full	Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title_fullStr	Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title_full_unstemmed	Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title_short	Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
title_sort	testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate
topic	Debate
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4310023/ https://www.ncbi.nlm.nih.gov/pubmed/25616598 http://dx.doi.org/10.1186/s12966-015-0162-z
work_keys_str_mv	AT deboermichielr testingforbaselinedifferencesinrandomizedcontrolledtrialsanunhealthyresearchbehaviorthatishardtoeradicate AT waterlanderwilmae testingforbaselinedifferencesinrandomizedcontrolledtrialsanunhealthyresearchbehaviorthatishardtoeradicate AT kuijperlothardj testingforbaselinedifferencesinrandomizedcontrolledtrialsanunhealthyresearchbehaviorthatishardtoeradicate AT steenhuisingridhm testingforbaselinedifferencesinrandomizedcontrolledtrialsanunhealthyresearchbehaviorthatishardtoeradicate AT twiskjoswr testingforbaselinedifferencesinrandomizedcontrolledtrialsanunhealthyresearchbehaviorthatishardtoeradicate

Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate

Ejemplares similares