Cargando…

t-tests, non-parametric tests, and large studies—a paradox of statistical practice?

BACKGROUND: During the last 30 years, the median sample size of research studies published in high-impact medical journals has increased manyfold, while the use of non-parametric tests has increased at the expense of t-tests. This paper explores this paradoxical practice and illustrates its conseque...

Descripción completa

Detalles Bibliográficos
Autor principal: Fagerland, Morten W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3445820/
https://www.ncbi.nlm.nih.gov/pubmed/22697476
http://dx.doi.org/10.1186/1471-2288-12-78
_version_ 1782243865668878336
author Fagerland, Morten W
author_facet Fagerland, Morten W
author_sort Fagerland, Morten W
collection PubMed
description BACKGROUND: During the last 30 years, the median sample size of research studies published in high-impact medical journals has increased manyfold, while the use of non-parametric tests has increased at the expense of t-tests. This paper explores this paradoxical practice and illustrates its consequences. METHODS: A simulation study is used to compare the rejection rates of the Wilcoxon-Mann-Whitney (WMW) test and the two-sample t-test for increasing sample size. Samples are drawn from skewed distributions with equal means and medians but with a small difference in spread. A hypothetical case study is used for illustration and motivation. RESULTS: The WMW test produces, on average, smaller p-values than the t-test. This discrepancy increases with increasing sample size, skewness, and difference in spread. For heavily skewed data, the proportion of p<0.05 with the WMW test can be greater than 90% if the standard deviations differ by 10% and the number of observations is 1000 in each group. The high rejection rates of the WMW test should be interpreted as the power to detect that the probability that a random sample from one of the distributions is less than a random sample from the other distribution is greater than 50%. CONCLUSIONS: Non-parametric tests are most useful for small studies. Using non-parametric tests in large studies may provide answers to the wrong question, thus confusing readers. For studies with a large sample size, t-tests and their corresponding confidence intervals can and should be used even for heavily skewed data.
format Online
Article
Text
id pubmed-3445820
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34458202012-09-20 t-tests, non-parametric tests, and large studies—a paradox of statistical practice? Fagerland, Morten W BMC Med Res Methodol Research Article BACKGROUND: During the last 30 years, the median sample size of research studies published in high-impact medical journals has increased manyfold, while the use of non-parametric tests has increased at the expense of t-tests. This paper explores this paradoxical practice and illustrates its consequences. METHODS: A simulation study is used to compare the rejection rates of the Wilcoxon-Mann-Whitney (WMW) test and the two-sample t-test for increasing sample size. Samples are drawn from skewed distributions with equal means and medians but with a small difference in spread. A hypothetical case study is used for illustration and motivation. RESULTS: The WMW test produces, on average, smaller p-values than the t-test. This discrepancy increases with increasing sample size, skewness, and difference in spread. For heavily skewed data, the proportion of p<0.05 with the WMW test can be greater than 90% if the standard deviations differ by 10% and the number of observations is 1000 in each group. The high rejection rates of the WMW test should be interpreted as the power to detect that the probability that a random sample from one of the distributions is less than a random sample from the other distribution is greater than 50%. CONCLUSIONS: Non-parametric tests are most useful for small studies. Using non-parametric tests in large studies may provide answers to the wrong question, thus confusing readers. For studies with a large sample size, t-tests and their corresponding confidence intervals can and should be used even for heavily skewed data. BioMed Central 2012-06-14 /pmc/articles/PMC3445820/ /pubmed/22697476 http://dx.doi.org/10.1186/1471-2288-12-78 Text en Copyright ©2012 Fagerland; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License(http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fagerland, Morten W
t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title_full t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title_fullStr t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title_full_unstemmed t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title_short t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
title_sort t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3445820/
https://www.ncbi.nlm.nih.gov/pubmed/22697476
http://dx.doi.org/10.1186/1471-2288-12-78
work_keys_str_mv AT fagerlandmortenw ttestsnonparametrictestsandlargestudiesaparadoxofstatisticalpractice