Cargando…

Associating microbiome composition with environmental covariates using generalized UniFrac distances

Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jun, Bittinger, Kyle, Charlson, Emily S., Hoffmann, Christian, Lewis, James, Wu, Gary D., Collman, Ronald G., Bushman, Frederic D., Li, Hongzhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413390/
https://www.ncbi.nlm.nih.gov/pubmed/22711789
http://dx.doi.org/10.1093/bioinformatics/bts342
_version_ 1782240053508964352
author Chen, Jun
Bittinger, Kyle
Charlson, Emily S.
Hoffmann, Christian
Lewis, James
Wu, Gary D.
Collman, Ronald G.
Bushman, Frederic D.
Li, Hongzhe
author_facet Chen, Jun
Bittinger, Kyle
Charlson, Emily S.
Hoffmann, Christian
Lewis, James
Wu, Gary D.
Collman, Ronald G.
Bushman, Frederic D.
Li, Hongzhe
author_sort Chen, Jun
collection PubMed
description Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based statistical tests have been applied to test the association of microbiome composition with environmental or biological covariates. The unweighted and weighted UniFrac distances are the most widely used distance measures. However, these two measures assign too much weight either to rare lineages or to most abundant lineages, which can lead to loss of power when the important composition change occurs in moderately abundant lineages. Results: We develop generalized UniFrac distances that extend the weighted and unweighted UniFrac distances for detecting a much wider range of biologically relevant changes. We evaluate the use of generalized UniFrac distances in associating microbiome composition with environmental covariates using extensive Monte Carlo simulations. Our results show that tests using the unweighted and weighted UniFrac distances are less powerful in detecting abundance change in moderately abundant lineages. In contrast, the generalized UniFrac distance is most powerful in detecting such changes, yet it retains nearly all its power for detecting rare and highly abundant lineages. The generalized UniFrac distance also has an overall better power than the joint use of unweighted/weighted UniFrac distances. Application to two real microbiome datasets has demonstrated gains in power in testing the associations between human microbiome and diet intakes and habitual smoking. Availability: http://cran.r-project.org/web/packages/GUniFrac Contact: hongzhe@upenn.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3413390
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34133902012-08-07 Associating microbiome composition with environmental covariates using generalized UniFrac distances Chen, Jun Bittinger, Kyle Charlson, Emily S. Hoffmann, Christian Lewis, James Wu, Gary D. Collman, Ronald G. Bushman, Frederic D. Li, Hongzhe Bioinformatics Original Paper Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based statistical tests have been applied to test the association of microbiome composition with environmental or biological covariates. The unweighted and weighted UniFrac distances are the most widely used distance measures. However, these two measures assign too much weight either to rare lineages or to most abundant lineages, which can lead to loss of power when the important composition change occurs in moderately abundant lineages. Results: We develop generalized UniFrac distances that extend the weighted and unweighted UniFrac distances for detecting a much wider range of biologically relevant changes. We evaluate the use of generalized UniFrac distances in associating microbiome composition with environmental covariates using extensive Monte Carlo simulations. Our results show that tests using the unweighted and weighted UniFrac distances are less powerful in detecting abundance change in moderately abundant lineages. In contrast, the generalized UniFrac distance is most powerful in detecting such changes, yet it retains nearly all its power for detecting rare and highly abundant lineages. The generalized UniFrac distance also has an overall better power than the joint use of unweighted/weighted UniFrac distances. Application to two real microbiome datasets has demonstrated gains in power in testing the associations between human microbiome and diet intakes and habitual smoking. Availability: http://cran.r-project.org/web/packages/GUniFrac Contact: hongzhe@upenn.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-08-15 2012-06-17 /pmc/articles/PMC3413390/ /pubmed/22711789 http://dx.doi.org/10.1093/bioinformatics/bts342 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Chen, Jun
Bittinger, Kyle
Charlson, Emily S.
Hoffmann, Christian
Lewis, James
Wu, Gary D.
Collman, Ronald G.
Bushman, Frederic D.
Li, Hongzhe
Associating microbiome composition with environmental covariates using generalized UniFrac distances
title Associating microbiome composition with environmental covariates using generalized UniFrac distances
title_full Associating microbiome composition with environmental covariates using generalized UniFrac distances
title_fullStr Associating microbiome composition with environmental covariates using generalized UniFrac distances
title_full_unstemmed Associating microbiome composition with environmental covariates using generalized UniFrac distances
title_short Associating microbiome composition with environmental covariates using generalized UniFrac distances
title_sort associating microbiome composition with environmental covariates using generalized unifrac distances
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413390/
https://www.ncbi.nlm.nih.gov/pubmed/22711789
http://dx.doi.org/10.1093/bioinformatics/bts342
work_keys_str_mv AT chenjun associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT bittingerkyle associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT charlsonemilys associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT hoffmannchristian associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT lewisjames associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT wugaryd associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT collmanronaldg associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT bushmanfredericd associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances
AT lihongzhe associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances