Cargando…
Multivariate Welch t-test on distances
Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between obse...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5181538/ https://www.ncbi.nlm.nih.gov/pubmed/27515741 http://dx.doi.org/10.1093/bioinformatics/btw524 |
_version_ | 1782485728723206144 |
---|---|
author | Alekseyenko, Alexander V. |
author_facet | Alekseyenko, Alexander V. |
author_sort | Alekseyenko, Alexander V. |
collection | PubMed |
description | Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between observations is sufficient to compute within and between group sums of squares necessary to form the (pseudo) F statistic. Moreover, not only Euclidean, but arbitrary distances can be used. This method, however, suffers from loss of power and type I error inflation in the presence of heteroscedasticity and sample size imbalances. Results: We develop a solution in the form of a distance-based Welch t-test, [Formula: see text] , for two sample potentially unbalanced and heteroscedastic data. We demonstrate empirically the desirable type I error and power characteristics of the new test. We compare the performance of PERMANOVA and [Formula: see text] in reanalysis of two existing microbiome datasets, where the methodology has originated. Availability and Implementation: The source code for methods and analysis of this article is available at https://github.com/alekseyenko/Tw2. Further guidance on application of these methods can be obtained from the author. Contact: alekseye@musc.edu |
format | Online Article Text |
id | pubmed-5181538 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-51815382016-12-27 Multivariate Welch t-test on distances Alekseyenko, Alexander V. Bioinformatics Original Papers Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between observations is sufficient to compute within and between group sums of squares necessary to form the (pseudo) F statistic. Moreover, not only Euclidean, but arbitrary distances can be used. This method, however, suffers from loss of power and type I error inflation in the presence of heteroscedasticity and sample size imbalances. Results: We develop a solution in the form of a distance-based Welch t-test, [Formula: see text] , for two sample potentially unbalanced and heteroscedastic data. We demonstrate empirically the desirable type I error and power characteristics of the new test. We compare the performance of PERMANOVA and [Formula: see text] in reanalysis of two existing microbiome datasets, where the methodology has originated. Availability and Implementation: The source code for methods and analysis of this article is available at https://github.com/alekseyenko/Tw2. Further guidance on application of these methods can be obtained from the author. Contact: alekseye@musc.edu Oxford University Press 2016-12-01 2016-08-11 /pmc/articles/PMC5181538/ /pubmed/27515741 http://dx.doi.org/10.1093/bioinformatics/btw524 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Alekseyenko, Alexander V. Multivariate Welch t-test on distances |
title | Multivariate Welch t-test on distances |
title_full | Multivariate Welch t-test on distances |
title_fullStr | Multivariate Welch t-test on distances |
title_full_unstemmed | Multivariate Welch t-test on distances |
title_short | Multivariate Welch t-test on distances |
title_sort | multivariate welch t-test on distances |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5181538/ https://www.ncbi.nlm.nih.gov/pubmed/27515741 http://dx.doi.org/10.1093/bioinformatics/btw524 |
work_keys_str_mv | AT alekseyenkoalexanderv multivariatewelchttestondistances |