Cargando…

Multivariate Welch t-test on distances

Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between obse...

Descripción completa

Detalles Bibliográficos
Autor principal: Alekseyenko, Alexander V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5181538/
https://www.ncbi.nlm.nih.gov/pubmed/27515741
http://dx.doi.org/10.1093/bioinformatics/btw524
_version_ 1782485728723206144
author Alekseyenko, Alexander V.
author_facet Alekseyenko, Alexander V.
author_sort Alekseyenko, Alexander V.
collection PubMed
description Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between observations is sufficient to compute within and between group sums of squares necessary to form the (pseudo) F statistic. Moreover, not only Euclidean, but arbitrary distances can be used. This method, however, suffers from loss of power and type I error inflation in the presence of heteroscedasticity and sample size imbalances. Results: We develop a solution in the form of a distance-based Welch t-test, [Formula: see text] , for two sample potentially unbalanced and heteroscedastic data. We demonstrate empirically the desirable type I error and power characteristics of the new test. We compare the performance of PERMANOVA and [Formula: see text] in reanalysis of two existing microbiome datasets, where the methodology has originated. Availability and Implementation: The source code for methods and analysis of this article is available at https://github.com/alekseyenko/Tw2. Further guidance on application of these methods can be obtained from the author. Contact: alekseye@musc.edu
format Online
Article
Text
id pubmed-5181538
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-51815382016-12-27 Multivariate Welch t-test on distances Alekseyenko, Alexander V. Bioinformatics Original Papers Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between observations is sufficient to compute within and between group sums of squares necessary to form the (pseudo) F statistic. Moreover, not only Euclidean, but arbitrary distances can be used. This method, however, suffers from loss of power and type I error inflation in the presence of heteroscedasticity and sample size imbalances. Results: We develop a solution in the form of a distance-based Welch t-test, [Formula: see text] , for two sample potentially unbalanced and heteroscedastic data. We demonstrate empirically the desirable type I error and power characteristics of the new test. We compare the performance of PERMANOVA and [Formula: see text] in reanalysis of two existing microbiome datasets, where the methodology has originated. Availability and Implementation: The source code for methods and analysis of this article is available at https://github.com/alekseyenko/Tw2. Further guidance on application of these methods can be obtained from the author. Contact: alekseye@musc.edu Oxford University Press 2016-12-01 2016-08-11 /pmc/articles/PMC5181538/ /pubmed/27515741 http://dx.doi.org/10.1093/bioinformatics/btw524 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Alekseyenko, Alexander V.
Multivariate Welch t-test on distances
title Multivariate Welch t-test on distances
title_full Multivariate Welch t-test on distances
title_fullStr Multivariate Welch t-test on distances
title_full_unstemmed Multivariate Welch t-test on distances
title_short Multivariate Welch t-test on distances
title_sort multivariate welch t-test on distances
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5181538/
https://www.ncbi.nlm.nih.gov/pubmed/27515741
http://dx.doi.org/10.1093/bioinformatics/btw524
work_keys_str_mv AT alekseyenkoalexanderv multivariatewelchttestondistances