Cargando…

Combining p-values from various statistical methods for microbiome data

MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or...

Descripción completa

Detalles Bibliográficos
Autores principales: Ham, Hyeonjung, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9686280/
https://www.ncbi.nlm.nih.gov/pubmed/36439799
http://dx.doi.org/10.3389/fmicb.2022.990870
_version_ 1784835709007298560
author Ham, Hyeonjung
Park, Taesung
author_facet Ham, Hyeonjung
Park, Taesung
author_sort Ham, Hyeonjung
collection PubMed
description MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or test formulations, it is quite often to have inconsistent significance results across statistical methods, that makes it difficult to determine the importance of microbiome taxa. Thus, it is practically important to have the integration of the result from all statistical methods to determine the importance of microbiome taxa. A standard meta-analysis is a powerful tool for integrative analysis and it provides a summary measure by combining p-values from various statistical methods. While there are many meta-analyses available, it is not easy to choose the best meta-analysis that is the most suitable for microbiome data. RESULTS: In this study, we investigated which meta-analysis method most adequately represents the importance of microbiome taxa. We considered Fisher’s method, minimum value of p method, Simes method, Stouffer’s method, Kost method, and Cauchy combination test. Through simulation studies, we showed that Cauchy combination test provides the best combined value of p in the sense that it performed the best among the examined methods while controlling the type 1 error rates. Furthermore, it produced high rank similarity with the true ranks. Through the real data application of colorectal cancer microbiome data, we demonstrated that the most highly ranked microbiome taxa by Cauchy combination test have been reported to be associated with colorectal cancer.
format Online
Article
Text
id pubmed-9686280
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96862802022-11-25 Combining p-values from various statistical methods for microbiome data Ham, Hyeonjung Park, Taesung Front Microbiol Microbiology MOTIVATION: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or test formulations, it is quite often to have inconsistent significance results across statistical methods, that makes it difficult to determine the importance of microbiome taxa. Thus, it is practically important to have the integration of the result from all statistical methods to determine the importance of microbiome taxa. A standard meta-analysis is a powerful tool for integrative analysis and it provides a summary measure by combining p-values from various statistical methods. While there are many meta-analyses available, it is not easy to choose the best meta-analysis that is the most suitable for microbiome data. RESULTS: In this study, we investigated which meta-analysis method most adequately represents the importance of microbiome taxa. We considered Fisher’s method, minimum value of p method, Simes method, Stouffer’s method, Kost method, and Cauchy combination test. Through simulation studies, we showed that Cauchy combination test provides the best combined value of p in the sense that it performed the best among the examined methods while controlling the type 1 error rates. Furthermore, it produced high rank similarity with the true ranks. Through the real data application of colorectal cancer microbiome data, we demonstrated that the most highly ranked microbiome taxa by Cauchy combination test have been reported to be associated with colorectal cancer. Frontiers Media S.A. 2022-11-10 /pmc/articles/PMC9686280/ /pubmed/36439799 http://dx.doi.org/10.3389/fmicb.2022.990870 Text en Copyright © 2022 Ham and Park. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Ham, Hyeonjung
Park, Taesung
Combining p-values from various statistical methods for microbiome data
title Combining p-values from various statistical methods for microbiome data
title_full Combining p-values from various statistical methods for microbiome data
title_fullStr Combining p-values from various statistical methods for microbiome data
title_full_unstemmed Combining p-values from various statistical methods for microbiome data
title_short Combining p-values from various statistical methods for microbiome data
title_sort combining p-values from various statistical methods for microbiome data
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9686280/
https://www.ncbi.nlm.nih.gov/pubmed/36439799
http://dx.doi.org/10.3389/fmicb.2022.990870
work_keys_str_mv AT hamhyeonjung combiningpvaluesfromvariousstatisticalmethodsformicrobiomedata
AT parktaesung combiningpvaluesfromvariousstatisticalmethodsformicrobiomedata