Cargando…

Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?

Effect size measures are used to quantify treatment effects or associations between variables. Such measures, of which >70 have been described in the literature, include unstandardized and standardized differences in means, risk differences, risk ratios, odds ratios, or correlations. While null h...

Descripción completa

Detalles Bibliográficos
Autores principales: Schober, Patrick, Bossers, Sebastiaan M., Schwarte, Lothar A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Lippincott Williams & Wilkins 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811238/
https://www.ncbi.nlm.nih.gov/pubmed/29337724
http://dx.doi.org/10.1213/ANE.0000000000002798
_version_ 1783299835922743296
author Schober, Patrick
Bossers, Sebastiaan M.
Schwarte, Lothar A.
author_facet Schober, Patrick
Bossers, Sebastiaan M.
Schwarte, Lothar A.
author_sort Schober, Patrick
collection PubMed
description Effect size measures are used to quantify treatment effects or associations between variables. Such measures, of which >70 have been described in the literature, include unstandardized and standardized differences in means, risk differences, risk ratios, odds ratios, or correlations. While null hypothesis significance testing is the predominant approach to statistical inference on effect sizes, results of such tests are often misinterpreted, provide no information on the magnitude of the estimate, and tell us nothing about the clinically importance of an effect. Hence, researchers should not merely focus on statistical significance but should also report the observed effect size. However, all samples are to some degree affected by randomness, such that there is a certain uncertainty on how well the observed effect size represents the actual magnitude and direction of the effect in the population. Therefore, point estimates of effect sizes should be accompanied by the entire range of plausible values to quantify this uncertainty. This facilitates assessment of how large or small the observed effect could actually be in the population of interest, and hence how clinically important it could be. This tutorial reviews different effect size measures and describes how confidence intervals can be used to address not only the statistical significance but also the clinical significance of the observed effect or association. Moreover, we discuss what P values actually represent, and how they provide supplemental information about the significant versus nonsignificant dichotomy. This tutorial intentionally focuses on an intuitive explanation of concepts and interpretation of results, rather than on the underlying mathematical theory or concepts.
format Online
Article
Text
id pubmed-5811238
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Lippincott Williams & Wilkins
record_format MEDLINE/PubMed
spelling pubmed-58112382018-03-01 Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent? Schober, Patrick Bossers, Sebastiaan M. Schwarte, Lothar A. Anesth Analg General Articles Effect size measures are used to quantify treatment effects or associations between variables. Such measures, of which >70 have been described in the literature, include unstandardized and standardized differences in means, risk differences, risk ratios, odds ratios, or correlations. While null hypothesis significance testing is the predominant approach to statistical inference on effect sizes, results of such tests are often misinterpreted, provide no information on the magnitude of the estimate, and tell us nothing about the clinically importance of an effect. Hence, researchers should not merely focus on statistical significance but should also report the observed effect size. However, all samples are to some degree affected by randomness, such that there is a certain uncertainty on how well the observed effect size represents the actual magnitude and direction of the effect in the population. Therefore, point estimates of effect sizes should be accompanied by the entire range of plausible values to quantify this uncertainty. This facilitates assessment of how large or small the observed effect could actually be in the population of interest, and hence how clinically important it could be. This tutorial reviews different effect size measures and describes how confidence intervals can be used to address not only the statistical significance but also the clinical significance of the observed effect or association. Moreover, we discuss what P values actually represent, and how they provide supplemental information about the significant versus nonsignificant dichotomy. This tutorial intentionally focuses on an intuitive explanation of concepts and interpretation of results, rather than on the underlying mathematical theory or concepts. Lippincott Williams & Wilkins 2018-03 2018-01-15 /pmc/articles/PMC5811238/ /pubmed/29337724 http://dx.doi.org/10.1213/ANE.0000000000002798 Text en Copyright © 2018 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the International Anesthesia Research Society. This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND) (http://creativecommons.org/licenses/by-nc-nd/4.0/) , where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal.
spellingShingle General Articles
Schober, Patrick
Bossers, Sebastiaan M.
Schwarte, Lothar A.
Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title_full Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title_fullStr Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title_full_unstemmed Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title_short Statistical Significance Versus Clinical Importance of Observed Effect Sizes: What Do P Values and Confidence Intervals Really Represent?
title_sort statistical significance versus clinical importance of observed effect sizes: what do p values and confidence intervals really represent?
topic General Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811238/
https://www.ncbi.nlm.nih.gov/pubmed/29337724
http://dx.doi.org/10.1213/ANE.0000000000002798
work_keys_str_mv AT schoberpatrick statisticalsignificanceversusclinicalimportanceofobservedeffectsizeswhatdopvaluesandconfidenceintervalsreallyrepresent
AT bosserssebastiaanm statisticalsignificanceversusclinicalimportanceofobservedeffectsizeswhatdopvaluesandconfidenceintervalsreallyrepresent
AT schwartelothara statisticalsignificanceversusclinicalimportanceofobservedeffectsizeswhatdopvaluesandconfidenceintervalsreallyrepresent