Cargando…
Small sample sizes: A big data problem in high-dimensional data analysis
In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008424/ https://www.ncbi.nlm.nih.gov/pubmed/33228480 http://dx.doi.org/10.1177/0962280220970228 |
_version_ | 1783672692318142464 |
---|---|
author | Konietschke, Frank Schwab, Karima Pauly, Markus |
author_facet | Konietschke, Frank Schwab, Karima Pauly, Markus |
author_sort | Konietschke, Frank |
collection | PubMed |
description | In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods. |
format | Online Article Text |
id | pubmed-8008424 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-80084242021-04-08 Small sample sizes: A big data problem in high-dimensional data analysis Konietschke, Frank Schwab, Karima Pauly, Markus Stat Methods Med Res Articles In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods. SAGE Publications 2020-11-24 2021-03 /pmc/articles/PMC8008424/ /pubmed/33228480 http://dx.doi.org/10.1177/0962280220970228 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Articles Konietschke, Frank Schwab, Karima Pauly, Markus Small sample sizes: A big data problem in high-dimensional data analysis |
title | Small sample sizes: A big data problem in high-dimensional data analysis |
title_full | Small sample sizes: A big data problem in high-dimensional data analysis |
title_fullStr | Small sample sizes: A big data problem in high-dimensional data analysis |
title_full_unstemmed | Small sample sizes: A big data problem in high-dimensional data analysis |
title_short | Small sample sizes: A big data problem in high-dimensional data analysis |
title_sort | small sample sizes: a big data problem in high-dimensional data analysis |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008424/ https://www.ncbi.nlm.nih.gov/pubmed/33228480 http://dx.doi.org/10.1177/0962280220970228 |
work_keys_str_mv | AT konietschkefrank smallsamplesizesabigdataprobleminhighdimensionaldataanalysis AT schwabkarima smallsamplesizesabigdataprobleminhighdimensionaldataanalysis AT paulymarkus smallsamplesizesabigdataprobleminhighdimensionaldataanalysis |