Cargando…
Associating microbiome composition with environmental covariates using generalized UniFrac distances
Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413390/ https://www.ncbi.nlm.nih.gov/pubmed/22711789 http://dx.doi.org/10.1093/bioinformatics/bts342 |
_version_ | 1782240053508964352 |
---|---|
author | Chen, Jun Bittinger, Kyle Charlson, Emily S. Hoffmann, Christian Lewis, James Wu, Gary D. Collman, Ronald G. Bushman, Frederic D. Li, Hongzhe |
author_facet | Chen, Jun Bittinger, Kyle Charlson, Emily S. Hoffmann, Christian Lewis, James Wu, Gary D. Collman, Ronald G. Bushman, Frederic D. Li, Hongzhe |
author_sort | Chen, Jun |
collection | PubMed |
description | Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based statistical tests have been applied to test the association of microbiome composition with environmental or biological covariates. The unweighted and weighted UniFrac distances are the most widely used distance measures. However, these two measures assign too much weight either to rare lineages or to most abundant lineages, which can lead to loss of power when the important composition change occurs in moderately abundant lineages. Results: We develop generalized UniFrac distances that extend the weighted and unweighted UniFrac distances for detecting a much wider range of biologically relevant changes. We evaluate the use of generalized UniFrac distances in associating microbiome composition with environmental covariates using extensive Monte Carlo simulations. Our results show that tests using the unweighted and weighted UniFrac distances are less powerful in detecting abundance change in moderately abundant lineages. In contrast, the generalized UniFrac distance is most powerful in detecting such changes, yet it retains nearly all its power for detecting rare and highly abundant lineages. The generalized UniFrac distance also has an overall better power than the joint use of unweighted/weighted UniFrac distances. Application to two real microbiome datasets has demonstrated gains in power in testing the associations between human microbiome and diet intakes and habitual smoking. Availability: http://cran.r-project.org/web/packages/GUniFrac Contact: hongzhe@upenn.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3413390 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34133902012-08-07 Associating microbiome composition with environmental covariates using generalized UniFrac distances Chen, Jun Bittinger, Kyle Charlson, Emily S. Hoffmann, Christian Lewis, James Wu, Gary D. Collman, Ronald G. Bushman, Frederic D. Li, Hongzhe Bioinformatics Original Paper Motivation: The human microbiome plays an important role in human disease and health. Identification of factors that affect the microbiome composition can provide insights into disease mechanism as well as suggest ways to modulate the microbiome composition for therapeutical purposes. Distance-based statistical tests have been applied to test the association of microbiome composition with environmental or biological covariates. The unweighted and weighted UniFrac distances are the most widely used distance measures. However, these two measures assign too much weight either to rare lineages or to most abundant lineages, which can lead to loss of power when the important composition change occurs in moderately abundant lineages. Results: We develop generalized UniFrac distances that extend the weighted and unweighted UniFrac distances for detecting a much wider range of biologically relevant changes. We evaluate the use of generalized UniFrac distances in associating microbiome composition with environmental covariates using extensive Monte Carlo simulations. Our results show that tests using the unweighted and weighted UniFrac distances are less powerful in detecting abundance change in moderately abundant lineages. In contrast, the generalized UniFrac distance is most powerful in detecting such changes, yet it retains nearly all its power for detecting rare and highly abundant lineages. The generalized UniFrac distance also has an overall better power than the joint use of unweighted/weighted UniFrac distances. Application to two real microbiome datasets has demonstrated gains in power in testing the associations between human microbiome and diet intakes and habitual smoking. Availability: http://cran.r-project.org/web/packages/GUniFrac Contact: hongzhe@upenn.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-08-15 2012-06-17 /pmc/articles/PMC3413390/ /pubmed/22711789 http://dx.doi.org/10.1093/bioinformatics/bts342 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Chen, Jun Bittinger, Kyle Charlson, Emily S. Hoffmann, Christian Lewis, James Wu, Gary D. Collman, Ronald G. Bushman, Frederic D. Li, Hongzhe Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title | Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title_full | Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title_fullStr | Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title_full_unstemmed | Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title_short | Associating microbiome composition with environmental covariates using generalized UniFrac distances |
title_sort | associating microbiome composition with environmental covariates using generalized unifrac distances |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413390/ https://www.ncbi.nlm.nih.gov/pubmed/22711789 http://dx.doi.org/10.1093/bioinformatics/bts342 |
work_keys_str_mv | AT chenjun associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT bittingerkyle associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT charlsonemilys associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT hoffmannchristian associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT lewisjames associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT wugaryd associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT collmanronaldg associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT bushmanfredericd associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances AT lihongzhe associatingmicrobiomecompositionwithenvironmentalcovariatesusinggeneralizedunifracdistances |