Cargando…

A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis

For pathway analysis of genomic data, the most common methods involve combining p-values from individual statistical tests. However, there are several multivariate statistical methods that can be used to test whether a pathway has changed. Because of the large number of variables and pathway sizes i...

Descripción completa

Detalles Bibliográficos
Autor principal: Mitchell, Matthew W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4415974/
https://www.ncbi.nlm.nih.gov/pubmed/25927705
http://dx.doi.org/10.1371/journal.pone.0125081
_version_ 1782369163054940160
author Mitchell, Matthew W.
author_facet Mitchell, Matthew W.
author_sort Mitchell, Matthew W.
collection PubMed
description For pathway analysis of genomic data, the most common methods involve combining p-values from individual statistical tests. However, there are several multivariate statistical methods that can be used to test whether a pathway has changed. Because of the large number of variables and pathway sizes in genomics data, some of these statistics cannot be computed. However, in metabolomics data, the number of variables and pathway sizes are typically much smaller, making such computations feasible. Of particular interest is being able to detect changes in pathways that may not be detected for the individual variables. We compare the performance of both the p-value methods and multivariate statistics for self-contained tests with an extensive simulation study and a human metabolomics study. Permutation tests, rather than asymptotic results are used to assess the statistical significance of the pathways. Furthermore, both one and two-sided alternatives hypotheses are examined. From the human metabolomic study, many pathways were statistically significant, although the majority of the individual variables in the pathway were not. Overall, the p-value methods perform at least as well as the multivariate statistics for these scenarios.
format Online
Article
Text
id pubmed-4415974
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44159742015-05-07 A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis Mitchell, Matthew W. PLoS One Research Article For pathway analysis of genomic data, the most common methods involve combining p-values from individual statistical tests. However, there are several multivariate statistical methods that can be used to test whether a pathway has changed. Because of the large number of variables and pathway sizes in genomics data, some of these statistics cannot be computed. However, in metabolomics data, the number of variables and pathway sizes are typically much smaller, making such computations feasible. Of particular interest is being able to detect changes in pathways that may not be detected for the individual variables. We compare the performance of both the p-value methods and multivariate statistics for self-contained tests with an extensive simulation study and a human metabolomics study. Permutation tests, rather than asymptotic results are used to assess the statistical significance of the pathways. Furthermore, both one and two-sided alternatives hypotheses are examined. From the human metabolomic study, many pathways were statistically significant, although the majority of the individual variables in the pathway were not. Overall, the p-value methods perform at least as well as the multivariate statistics for these scenarios. Public Library of Science 2015-04-30 /pmc/articles/PMC4415974/ /pubmed/25927705 http://dx.doi.org/10.1371/journal.pone.0125081 Text en © 2015 Matthew W. Mitchell http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mitchell, Matthew W.
A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title_full A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title_fullStr A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title_full_unstemmed A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title_short A Comparison of Aggregate P-Value Methods and Multivariate Statistics for Self-Contained Tests of Metabolic Pathway Analysis
title_sort comparison of aggregate p-value methods and multivariate statistics for self-contained tests of metabolic pathway analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4415974/
https://www.ncbi.nlm.nih.gov/pubmed/25927705
http://dx.doi.org/10.1371/journal.pone.0125081
work_keys_str_mv AT mitchellmattheww acomparisonofaggregatepvaluemethodsandmultivariatestatisticsforselfcontainedtestsofmetabolicpathwayanalysis
AT mitchellmattheww comparisonofaggregatepvaluemethodsandmultivariatestatisticsforselfcontainedtestsofmetabolicpathwayanalysis