Cargando…

Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test

There are many different proposed procedures for sample size planning for the Wilcoxon‐Mann‐Whitney test at given type‐I and type‐II error rates α and β, respectively. Most methods assume very specific models or types of data to simplify calculations (eg, ordered categorical or metric data, location...

Descripción completa

Detalles Bibliográficos
Autores principales:	Happ, Martin, Bathke, Arne C., Brunner, Edgar
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2018
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6491996/ https://www.ncbi.nlm.nih.gov/pubmed/30298671 http://dx.doi.org/10.1002/sim.7983

_version_	1783415061519269888
author	Happ, Martin Bathke, Arne C. Brunner, Edgar
author_facet	Happ, Martin Bathke, Arne C. Brunner, Edgar
author_sort	Happ, Martin
collection	PubMed
description	There are many different proposed procedures for sample size planning for the Wilcoxon‐Mann‐Whitney test at given type‐I and type‐II error rates α and β, respectively. Most methods assume very specific models or types of data to simplify calculations (eg, ordered categorical or metric data, location shift alternatives, etc). We present a unified approach that covers metric data with and without ties, count data, ordered categorical data, and even dichotomous data. For that, we calculate the unknown theoretical quantities such as the variances under the null and relevant alternative hypothesis by considering the following “synthetic data” approach. We evaluate data whose empirical distribution functions match the theoretical distribution functions involved in the computations of the unknown theoretical quantities. Then, well‐known relations for the ranks of the data are used for the calculations. In addition to computing the necessary sample size N for a fixed allocation proportion t = n (1)/N, where n (1) is the sample size in the first group and N = n (1) + n (2) is the total sample size, we provide an interval for the optimal allocation rate t, which minimizes the total sample size N. It turns out that, for certain distributions, a balanced design is optimal. We give a characterization of such distributions. Furthermore, we show that the optimal choice of t depends on the ratio of the two variances, which determine the variance of the Wilcoxon‐Mann‐Whitney statistic under the alternative. This is different from an optimal sample size allocation in case of the normal distribution model.
format	Online Article Text
id	pubmed-6491996
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-64919962019-05-06 Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test Happ, Martin Bathke, Arne C. Brunner, Edgar Stat Med Research Articles There are many different proposed procedures for sample size planning for the Wilcoxon‐Mann‐Whitney test at given type‐I and type‐II error rates α and β, respectively. Most methods assume very specific models or types of data to simplify calculations (eg, ordered categorical or metric data, location shift alternatives, etc). We present a unified approach that covers metric data with and without ties, count data, ordered categorical data, and even dichotomous data. For that, we calculate the unknown theoretical quantities such as the variances under the null and relevant alternative hypothesis by considering the following “synthetic data” approach. We evaluate data whose empirical distribution functions match the theoretical distribution functions involved in the computations of the unknown theoretical quantities. Then, well‐known relations for the ranks of the data are used for the calculations. In addition to computing the necessary sample size N for a fixed allocation proportion t = n (1)/N, where n (1) is the sample size in the first group and N = n (1) + n (2) is the total sample size, we provide an interval for the optimal allocation rate t, which minimizes the total sample size N. It turns out that, for certain distributions, a balanced design is optimal. We give a characterization of such distributions. Furthermore, we show that the optimal choice of t depends on the ratio of the two variances, which determine the variance of the Wilcoxon‐Mann‐Whitney statistic under the alternative. This is different from an optimal sample size allocation in case of the normal distribution model. John Wiley and Sons Inc. 2018-10-08 2019-02-10 /pmc/articles/PMC6491996/ /pubmed/30298671 http://dx.doi.org/10.1002/sim.7983 Text en © 2018 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Articles Happ, Martin Bathke, Arne C. Brunner, Edgar Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title	Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title_full	Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title_fullStr	Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title_full_unstemmed	Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title_short	Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test
title_sort	optimal sample size planning for the wilcoxon‐mann‐whitney test
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6491996/ https://www.ncbi.nlm.nih.gov/pubmed/30298671 http://dx.doi.org/10.1002/sim.7983
work_keys_str_mv	AT happmartin optimalsamplesizeplanningforthewilcoxonmannwhitneytest AT bathkearnec optimalsamplesizeplanningforthewilcoxonmannwhitneytest AT brunneredgar optimalsamplesizeplanningforthewilcoxonmannwhitneytest

Optimal sample size planning for the Wilcoxon‐Mann‐Whitney test

Ejemplares similares