Cargando…

Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies

BACKGROUND: The goal of medical research is to develop interventions that are in some sense superior, with respect to patient outcome, to interventions currently in use. Similarly, the goal of research in methodological computational statistics is to develop data analysis tools that are themselves s...

Descripción completa

Detalles Bibliográficos
Autores principales: Boulesteix, Anne-Laure, Wilson, Rory, Hapfelmeier, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5591542/
https://www.ncbi.nlm.nih.gov/pubmed/28888225
http://dx.doi.org/10.1186/s12874-017-0417-2
_version_ 1783262735246557184
author Boulesteix, Anne-Laure
Wilson, Rory
Hapfelmeier, Alexander
author_facet Boulesteix, Anne-Laure
Wilson, Rory
Hapfelmeier, Alexander
author_sort Boulesteix, Anne-Laure
collection PubMed
description BACKGROUND: The goal of medical research is to develop interventions that are in some sense superior, with respect to patient outcome, to interventions currently in use. Similarly, the goal of research in methodological computational statistics is to develop data analysis tools that are themselves superior to the existing tools. The methodology of the evaluation of medical interventions continues to be discussed extensively in the literature and it is now well accepted that medicine should be at least partly “evidence-based”. Although we statisticians are convinced of the importance of unbiased, well-thought-out study designs and evidence-based approaches in the context of clinical research, we tend to ignore these principles when designing our own studies for evaluating statistical methods in the context of our methodological research. MAIN MESSAGE: In this paper, we draw an analogy between clinical trials and real-data-based benchmarking experiments in methodological statistical science, with datasets playing the role of patients and methods playing the role of medical interventions. Through this analogy, we suggest directions for improvement in the design and interpretation of studies which use real data to evaluate statistical methods, in particular with respect to dataset inclusion criteria and the reduction of various forms of bias. More generally, we discuss the concept of “evidence-based” statistical research, its limitations and its impact on the design and interpretation of real-data-based benchmark experiments. CONCLUSION: We suggest that benchmark studies—a method of assessment of statistical methods using real-world datasets—might benefit from adopting (some) concepts from evidence-based medicine towards the goal of more evidence-based statistical research.
format Online
Article
Text
id pubmed-5591542
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55915422017-09-13 Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies Boulesteix, Anne-Laure Wilson, Rory Hapfelmeier, Alexander BMC Med Res Methodol Debate BACKGROUND: The goal of medical research is to develop interventions that are in some sense superior, with respect to patient outcome, to interventions currently in use. Similarly, the goal of research in methodological computational statistics is to develop data analysis tools that are themselves superior to the existing tools. The methodology of the evaluation of medical interventions continues to be discussed extensively in the literature and it is now well accepted that medicine should be at least partly “evidence-based”. Although we statisticians are convinced of the importance of unbiased, well-thought-out study designs and evidence-based approaches in the context of clinical research, we tend to ignore these principles when designing our own studies for evaluating statistical methods in the context of our methodological research. MAIN MESSAGE: In this paper, we draw an analogy between clinical trials and real-data-based benchmarking experiments in methodological statistical science, with datasets playing the role of patients and methods playing the role of medical interventions. Through this analogy, we suggest directions for improvement in the design and interpretation of studies which use real data to evaluate statistical methods, in particular with respect to dataset inclusion criteria and the reduction of various forms of bias. More generally, we discuss the concept of “evidence-based” statistical research, its limitations and its impact on the design and interpretation of real-data-based benchmark experiments. CONCLUSION: We suggest that benchmark studies—a method of assessment of statistical methods using real-world datasets—might benefit from adopting (some) concepts from evidence-based medicine towards the goal of more evidence-based statistical research. BioMed Central 2017-09-09 /pmc/articles/PMC5591542/ /pubmed/28888225 http://dx.doi.org/10.1186/s12874-017-0417-2 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Debate
Boulesteix, Anne-Laure
Wilson, Rory
Hapfelmeier, Alexander
Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title_full Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title_fullStr Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title_full_unstemmed Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title_short Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
title_sort towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies
topic Debate
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5591542/
https://www.ncbi.nlm.nih.gov/pubmed/28888225
http://dx.doi.org/10.1186/s12874-017-0417-2
work_keys_str_mv AT boulesteixannelaure towardsevidencebasedcomputationalstatisticslessonsfromclinicalresearchontheroleanddesignofrealdatabenchmarkstudies
AT wilsonrory towardsevidencebasedcomputationalstatisticslessonsfromclinicalresearchontheroleanddesignofrealdatabenchmarkstudies
AT hapfelmeieralexander towardsevidencebasedcomputationalstatisticslessonsfromclinicalresearchontheroleanddesignofrealdatabenchmarkstudies