Cargando…
Combining p-values from various statistical methods for microbiome data
MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9686280/ https://www.ncbi.nlm.nih.gov/pubmed/36439799 http://dx.doi.org/10.3389/fmicb.2022.990870 |
_version_ | 1784835709007298560 |
---|---|
author | Ham, Hyeonjung Park, Taesung |
author_facet | Ham, Hyeonjung Park, Taesung |
author_sort | Ham, Hyeonjung |
collection | PubMed |
description | MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or test formulations, it is quite often to have inconsistent significance results across statistical methods, that makes it difficult to determine the importance of microbiome taxa. Thus, it is practically important to have the integration of the result from all statistical methods to determine the importance of microbiome taxa. A standard meta-analysis is a powerful tool for integrative analysis and it provides a summary measure by combining p-values from various statistical methods. While there are many meta-analyses available, it is not easy to choose the best meta-analysis that is the most suitable for microbiome data. RESULTS: In this study, we investigated which meta-analysis method most adequately represents the importance of microbiome taxa. We considered Fisher’s method, minimum value of p method, Simes method, Stouffer’s method, Kost method, and Cauchy combination test. Through simulation studies, we showed that Cauchy combination test provides the best combined value of p in the sense that it performed the best among the examined methods while controlling the type 1 error rates. Furthermore, it produced high rank similarity with the true ranks. Through the real data application of colorectal cancer microbiome data, we demonstrated that the most highly ranked microbiome taxa by Cauchy combination test have been reported to be associated with colorectal cancer. |
format | Online Article Text |
id | pubmed-9686280 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96862802022-11-25 Combining p-values from various statistical methods for microbiome data Ham, Hyeonjung Park, Taesung Front Microbiol Microbiology MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or test formulations, it is quite often to have inconsistent significance results across statistical methods, that makes it difficult to determine the importance of microbiome taxa. Thus, it is practically important to have the integration of the result from all statistical methods to determine the importance of microbiome taxa. A standard meta-analysis is a powerful tool for integrative analysis and it provides a summary measure by combining p-values from various statistical methods. While there are many meta-analyses available, it is not easy to choose the best meta-analysis that is the most suitable for microbiome data. RESULTS: In this study, we investigated which meta-analysis method most adequately represents the importance of microbiome taxa. We considered Fisher’s method, minimum value of p method, Simes method, Stouffer’s method, Kost method, and Cauchy combination test. Through simulation studies, we showed that Cauchy combination test provides the best combined value of p in the sense that it performed the best among the examined methods while controlling the type 1 error rates. Furthermore, it produced high rank similarity with the true ranks. Through the real data application of colorectal cancer microbiome data, we demonstrated that the most highly ranked microbiome taxa by Cauchy combination test have been reported to be associated with colorectal cancer. Frontiers Media S.A. 2022-11-10 /pmc/articles/PMC9686280/ /pubmed/36439799 http://dx.doi.org/10.3389/fmicb.2022.990870 Text en Copyright © 2022 Ham and Park. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Ham, Hyeonjung Park, Taesung Combining p-values from various statistical methods for microbiome data |
title | Combining p-values from various statistical methods for microbiome data |
title_full | Combining p-values from various statistical methods for microbiome data |
title_fullStr | Combining p-values from various statistical methods for microbiome data |
title_full_unstemmed | Combining p-values from various statistical methods for microbiome data |
title_short | Combining p-values from various statistical methods for microbiome data |
title_sort | combining p-values from various statistical methods for microbiome data |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9686280/ https://www.ncbi.nlm.nih.gov/pubmed/36439799 http://dx.doi.org/10.3389/fmicb.2022.990870 |
work_keys_str_mv | AT hamhyeonjung combiningpvaluesfromvariousstatisticalmethodsformicrobiomedata AT parktaesung combiningpvaluesfromvariousstatisticalmethodsformicrobiomedata |