Cargando…

Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)

BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiom...

Descripción completa

Detalles Bibliográficos
Autores principales: Ling, Wodan, Zhao, Ni, Plantinga, Anna M., Launer, Lenore J., Fodor, Anthony A., Meyer, Katie A., Wu, Michael C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414689/
https://www.ncbi.nlm.nih.gov/pubmed/34474689
http://dx.doi.org/10.1186/s40168-021-01129-3
_version_ 1783747829991211008
author Ling, Wodan
Zhao, Ni
Plantinga, Anna M.
Launer, Lenore J.
Fodor, Anthony A.
Meyer, Katie A.
Wu, Michael C.
author_facet Ling, Wodan
Zhao, Ni
Plantinga, Anna M.
Launer, Lenore J.
Fodor, Anthony A.
Meyer, Katie A.
Wu, Michael C.
author_sort Ling, Wodan
collection PubMed
description BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiome data, approaches that are generally applicable remain elusive. Classical tests often do not accommodate the realities of microbiome data, leading to power loss. Approaches tailored for microbiome data depend highly upon the normalization strategies used to handle differential read depth and other data characteristics, and they often have unacceptably high false positive rates, generally due to unsatisfied distributional assumptions. On the other hand, many non-parametric tests suffer from loss of power and may also present difficulties in adjusting for potential covariates. Most extant approaches also fail in the presence of heterogeneous effects. The field needs new non-parametric approaches that are tailored to microbiome data, robust to distributional assumptions, and powerful under heterogeneous effects, while permitting adjustment for covariates. METHODS: As an alternative to existing approaches, we propose a zero-inflated quantile approach (ZINQ), which uses a two-part quantile regression model to accommodate the zero inflation in microbiome data. For a given taxon, ZINQ consists of a valid test in logistic regression to model the zero counts, followed by a series of quantile rank-score based tests on multiple quantiles of the non-zero part with adjustment for the zero inflation. As a regression and quantile-based approach, the method is non-parametric and robust to irregular distributions, while providing an allowance for covariate adjustment. Since no distributional assumptions are made, ZINQ can be applied to data that has been processed under any normalization strategy. RESULTS: Thorough simulations based on real data across a range of scenarios and application to real data sets show that ZINQ often has equivalent or higher power compared to existing tests even as it offers better control of false positives. CONCLUSIONS: We present ZINQ, a quantile-based association test between microbiota and dichotomous or quantitative clinical variables, providing a powerful and robust alternative for the current microbiome differential abundance analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40168-021-01129-3).
format Online
Article
Text
id pubmed-8414689
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84146892021-09-09 Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ) Ling, Wodan Zhao, Ni Plantinga, Anna M. Launer, Lenore J. Fodor, Anthony A. Meyer, Katie A. Wu, Michael C. Microbiome Methodology BACKGROUND: Identification of bacterial taxa associated with diseases, exposures, and other variables of interest offers a more comprehensive understanding of the role of microbes in many conditions. However, despite considerable research in statistical methods for association testing with microbiome data, approaches that are generally applicable remain elusive. Classical tests often do not accommodate the realities of microbiome data, leading to power loss. Approaches tailored for microbiome data depend highly upon the normalization strategies used to handle differential read depth and other data characteristics, and they often have unacceptably high false positive rates, generally due to unsatisfied distributional assumptions. On the other hand, many non-parametric tests suffer from loss of power and may also present difficulties in adjusting for potential covariates. Most extant approaches also fail in the presence of heterogeneous effects. The field needs new non-parametric approaches that are tailored to microbiome data, robust to distributional assumptions, and powerful under heterogeneous effects, while permitting adjustment for covariates. METHODS: As an alternative to existing approaches, we propose a zero-inflated quantile approach (ZINQ), which uses a two-part quantile regression model to accommodate the zero inflation in microbiome data. For a given taxon, ZINQ consists of a valid test in logistic regression to model the zero counts, followed by a series of quantile rank-score based tests on multiple quantiles of the non-zero part with adjustment for the zero inflation. As a regression and quantile-based approach, the method is non-parametric and robust to irregular distributions, while providing an allowance for covariate adjustment. Since no distributional assumptions are made, ZINQ can be applied to data that has been processed under any normalization strategy. RESULTS: Thorough simulations based on real data across a range of scenarios and application to real data sets show that ZINQ often has equivalent or higher power compared to existing tests even as it offers better control of false positives. CONCLUSIONS: We present ZINQ, a quantile-based association test between microbiota and dichotomous or quantitative clinical variables, providing a powerful and robust alternative for the current microbiome differential abundance analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40168-021-01129-3). BioMed Central 2021-09-02 /pmc/articles/PMC8414689/ /pubmed/34474689 http://dx.doi.org/10.1186/s40168-021-01129-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Ling, Wodan
Zhao, Ni
Plantinga, Anna M.
Launer, Lenore J.
Fodor, Anthony A.
Meyer, Katie A.
Wu, Michael C.
Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title_full Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title_fullStr Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title_full_unstemmed Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title_short Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (ZINQ)
title_sort powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (zinq)
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414689/
https://www.ncbi.nlm.nih.gov/pubmed/34474689
http://dx.doi.org/10.1186/s40168-021-01129-3
work_keys_str_mv AT lingwodan powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT zhaoni powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT plantingaannam powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT launerlenorej powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT fodoranthonya powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT meyerkatiea powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq
AT wumichaelc powerfulandrobustnonparametricassociationtestingformicrobiomedataviaazeroinflatedquantileapproachzinq