Cargando…
Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiom...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414689/ https://www.ncbi.nlm.nih.gov/pubmed/34474689 http://dx.doi.org/10.1186/s40168-021-01129-3 |
_version_ | 1783747829991211008 |
---|---|
author | Ling, Wodan Zhao, Ni Plantinga, Anna M. Launer, Lenore J. Fodor, Anthony A. Meyer, Katie A. Wu, Michael C. |
author_facet | Ling, Wodan Zhao, Ni Plantinga, Anna M. Launer, Lenore J. Fodor, Anthony A. Meyer, Katie A. Wu, Michael C. |
author_sort | Ling, Wodan |
collection | PubMed |
description | BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiome data, approaches that are generally applicable remain elusive. Classical tests often do not accommodate the realities of microbiome data, leading to power loss. Approaches tailored for microbiome data depend highly upon the normalization strategies used to handle differential read depth and other data characteristics, and they often have unacceptably high false positive rates, generally due to unsatisfied distributional assumptions. On the other hand, many non-parametric tests suffer from loss of power and may also present difficulties in adjusting for potential covariates. Most extant approaches also fail in the presence of heterogeneous effects. The field needs new non-parametric approaches that are tailored to microbiome data, robust to distributional assumptions, and powerful under heterogeneous effects, while permitting adjustment for covariates. METHODS: As an alternative to existing approaches, we propose a zero-inflated quantile approach (ZINQ), which uses a two-part quantile regression model to accommodate the zero inflation in microbiome data. For a given taxon, ZINQ consists of a valid test in logistic regression to model the zero counts, followed by a series of quantile rank-score based tests on multiple quantiles of the non-zero part with adjustment for the zero inflation. As a regression and quantile-based approach, the method is non-parametric and robust to irregular distributions, while providing an allowance for covariate adjustment. Since no distributional assumptions are made, ZINQ can be applied to data that has been processed under any normalization strategy. RESULTS: Thorough simulations based on real data across a range of scenarios and application to real data sets show that ZINQ often has equivalent or higher power compared to existing tests even as it offers better control of false positives. CONCLUSIONS: We present ZINQ, a quantile-based association test between microbiota and dichotomous or quantitative clinical variables, providing a powerful and robust alternative for the current microbiome differential abundance analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40168-021-01129-3). |
format | Online Article Text |
id | pubmed-8414689 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-84146892021-09-09 Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) Ling, Wodan Zhao, Ni Plantinga, Anna M. Launer, Lenore J. Fodor, Anthony A. Meyer, Katie A. Wu, Michael C. Microbiome Methodology BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiome data, approaches that are generally applicable remain elusive. Classical tests often do not accommodate the realities of microbiome data, leading to power loss. Approaches tailored for microbiome data depend highly upon the normalization strategies used to handle differential read depth and other data characteristics, and they often have unacceptably high false positive rates, generally due to unsatisfied distributional assumptions. On the other hand, many non-parametric tests suffer from loss of power and may also present difficulties in adjusting for potential covariates. Most extant approaches also fail in the presence of heterogeneous effects. The field needs new non-parametric approaches that are tailored to microbiome data, robust to distributional assumptions, and powerful under heterogeneous effects, while permitting adjustment for covariates. METHODS: As an alternative to existing approaches, we propose a zero-inflated quantile approach (ZINQ), which uses a two-part quantile regression model to accommodate the zero inflation in microbiome data. For a given taxon, ZINQ consists of a valid test in logistic regression to model the zero counts, followed by a series of quantile rank-score based tests on multiple quantiles of the non-zero part with adjustment for the zero inflation. As a regression and quantile-based approach, the method is non-parametric and robust to irregular distributions, while providing an allowance for covariate adjustment. Since no distributional assumptions are made, ZINQ can be applied to data that has been processed under any normalization strategy. RESULTS: Thorough simulations based on real data across a range of scenarios and application to real data sets show that ZINQ often has equivalent or higher power compared to existing tests even as it offers better control of false positives. CONCLUSIONS: We present ZINQ, a quantile-based association test between microbiota and dichotomous or quantitative clinical variables, providing a powerful and robust alternative for the current microbiome differential abundance analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40168-021-01129-3). BioMed Central 2021-09-02 /pmc/articles/PMC8414689/ /pubmed/34474689 http://dx.doi.org/10.1186/s40168-021-01129-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Ling, Wodan Zhao, Ni Plantinga, Anna M. Launer, Lenore J. Fodor, Anthony A. Meyer, Katie A. Wu, Michael C. Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title | Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title_full | Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title_fullStr | Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title_full_unstemmed | Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title_short | Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) |
title_sort | powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (zinq) |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414689/ https://www.ncbi.nlm.nih.gov/pubmed/34474689 http://dx.doi.org/10.1186/s40168-021-01129-3 |
work_keys_str_mv | AT lingwodan powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT zhaoni powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT plantingaannam powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT launerlenorej powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT fodoranthonya powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT meyerkatiea powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq AT wumichaelc powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq |