Cargando…

Small sample sizes: A big data problem in high-dimensional data analysis

In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-...

Descripción completa

Detalles Bibliográficos
Autores principales: Konietschke, Frank, Schwab, Karima, Pauly, Markus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008424/
https://www.ncbi.nlm.nih.gov/pubmed/33228480
http://dx.doi.org/10.1177/0962280220970228
_version_ 1783672692318142464
author Konietschke, Frank
Schwab, Karima
Pauly, Markus
author_facet Konietschke, Frank
Schwab, Karima
Pauly, Markus
author_sort Konietschke, Frank
collection PubMed
description In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods.
format Online
Article
Text
id pubmed-8008424
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-80084242021-04-08 Small sample sizes: A big data problem in high-dimensional data analysis Konietschke, Frank Schwab, Karima Pauly, Markus Stat Methods Med Res Articles In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods. SAGE Publications 2020-11-24 2021-03 /pmc/articles/PMC8008424/ /pubmed/33228480 http://dx.doi.org/10.1177/0962280220970228 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Konietschke, Frank
Schwab, Karima
Pauly, Markus
Small sample sizes: A big data problem in high-dimensional data analysis
title Small sample sizes: A big data problem in high-dimensional data analysis
title_full Small sample sizes: A big data problem in high-dimensional data analysis
title_fullStr Small sample sizes: A big data problem in high-dimensional data analysis
title_full_unstemmed Small sample sizes: A big data problem in high-dimensional data analysis
title_short Small sample sizes: A big data problem in high-dimensional data analysis
title_sort small sample sizes: a big data problem in high-dimensional data analysis
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008424/
https://www.ncbi.nlm.nih.gov/pubmed/33228480
http://dx.doi.org/10.1177/0962280220970228
work_keys_str_mv AT konietschkefrank smallsamplesizesabigdataprobleminhighdimensionaldataanalysis
AT schwabkarima smallsamplesizesabigdataprobleminhighdimensionaldataanalysis
AT paulymarkus smallsamplesizesabigdataprobleminhighdimensionaldataanalysis