Cargando…

Equivalent input produces different output in the UniFrac significance test

BACKGROUND: UniFrac is a well-known tool for comparing microbial communities and assessing statistically significant differences between communities. In this paper we identify a discrepancy in the UniFrac methodology that causes semantically equivalent inputs to produce different outputs in tests of...

Descripción completa

Detalles Bibliográficos
Autores principales: Long, Jeffrey R, Pittet, Vanessa, Trost, Brett, Yan, Qingxiang, Vickers, David, Haakensen, Monique, Kusalik, Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141948/
https://www.ncbi.nlm.nih.gov/pubmed/25124232
http://dx.doi.org/10.1186/1471-2105-15-278
Descripción
Sumario:BACKGROUND: UniFrac is a well-known tool for comparing microbial communities and assessing statistically significant differences between communities. In this paper we identify a discrepancy in the UniFrac methodology that causes semantically equivalent inputs to produce different outputs in tests of statistical significance. RESULTS: The phylogenetic trees that are input into UniFrac may or may not contain abundance counts. An isomorphic transform can be defined that will convert trees between these two formats without altering the semantic meaning of the trees. UniFrac produces different outputs for these equivalent forms of the same input tree. This is illustrated using metagenomics data from a lake sediment study. CONCLUSIONS: Results from the UniFrac tool can vary greatly for the same input depending on the arbitrary choice of input format. Practitioners should be aware of this issue and use the tool with caution to ensure consistency and validity in their analyses. We provide a script to transform inputs between equivalent formats to help researchers achieve this consistency. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-278) contains supplementary material, which is available to authorized users.