Cargando…
EMPeror: a tool for visualizing high-throughput microbial community data
BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasing...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076506/ https://www.ncbi.nlm.nih.gov/pubmed/24280061 http://dx.doi.org/10.1186/2047-217X-2-16 |
_version_ | 1782323494450626560 |
---|---|
author | Vázquez-Baeza, Yoshiki Pirrung, Meg Gonzalez, Antonio Knight, Rob |
author_facet | Vázquez-Baeza, Yoshiki Pirrung, Meg Gonzalez, Antonio Knight, Rob |
author_sort | Vázquez-Baeza, Yoshiki |
collection | PubMed |
description | BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasingly rich and standards-compliant metadata to understand the biological factors driving these relationships. In particular, the Earth Microbiome Project drives these needs by profiling the genomic content of tens of thousands of samples across multiple environment types. FINDINGS: Features of EMPeror include: ability to visualize gradients and categorical data, visualize different principal coordinates axes, present the data in the form of parallel coordinates, show taxa as well as environmental samples, dynamically adjust the size and transparency of the spheres representing the communities on a per-category basis, dynamically scale the axes according to the fraction of variance each explains, show, hide or recolor points according to arbitrary metadata including that compliant with the MIxS family of standards developed by the Genomic Standards Consortium, display jackknifed-resampled data to assess statistical confidence in clustering, perform coordinate comparisons (useful for procrustes analysis plots), and greatly reduce loading times and overall memory footprint compared with existing approaches. Additionally, ease of sharing, given EMPeror’s small output file size, enables agile collaboration by allowing users to embed these visualizations via emails or web pages without the need for extra plugins. CONCLUSIONS: Here we present EMPeror, an open source and web browser enabled tool with a versatile command line interface that allows researchers to perform rapid exploratory investigations of 3D visualizations of microbial community data, such as the widely used principal coordinates plots. EMPeror includes a rich set of controllers to modify features as a function of the metadata. By being specifically tailored to the requirements of microbial ecologists, EMPeror thus increases the speed with which insight can be gained from large microbiome datasets. |
format | Online Article Text |
id | pubmed-4076506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40765062014-07-02 EMPeror: a tool for visualizing high-throughput microbial community data Vázquez-Baeza, Yoshiki Pirrung, Meg Gonzalez, Antonio Knight, Rob Gigascience Technical Note BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasingly rich and standards-compliant metadata to understand the biological factors driving these relationships. In particular, the Earth Microbiome Project drives these needs by profiling the genomic content of tens of thousands of samples across multiple environment types. FINDINGS: Features of EMPeror include: ability to visualize gradients and categorical data, visualize different principal coordinates axes, present the data in the form of parallel coordinates, show taxa as well as environmental samples, dynamically adjust the size and transparency of the spheres representing the communities on a per-category basis, dynamically scale the axes according to the fraction of variance each explains, show, hide or recolor points according to arbitrary metadata including that compliant with the MIxS family of standards developed by the Genomic Standards Consortium, display jackknifed-resampled data to assess statistical confidence in clustering, perform coordinate comparisons (useful for procrustes analysis plots), and greatly reduce loading times and overall memory footprint compared with existing approaches. Additionally, ease of sharing, given EMPeror’s small output file size, enables agile collaboration by allowing users to embed these visualizations via emails or web pages without the need for extra plugins. CONCLUSIONS: Here we present EMPeror, an open source and web browser enabled tool with a versatile command line interface that allows researchers to perform rapid exploratory investigations of 3D visualizations of microbial community data, such as the widely used principal coordinates plots. EMPeror includes a rich set of controllers to modify features as a function of the metadata. By being specifically tailored to the requirements of microbial ecologists, EMPeror thus increases the speed with which insight can be gained from large microbiome datasets. BioMed Central 2013-11-26 /pmc/articles/PMC4076506/ /pubmed/24280061 http://dx.doi.org/10.1186/2047-217X-2-16 Text en Copyright © 2013 Vázquez-Baeza et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Note Vázquez-Baeza, Yoshiki Pirrung, Meg Gonzalez, Antonio Knight, Rob EMPeror: a tool for visualizing high-throughput microbial community data |
title | EMPeror: a tool for visualizing high-throughput microbial community data |
title_full | EMPeror: a tool for visualizing high-throughput microbial community data |
title_fullStr | EMPeror: a tool for visualizing high-throughput microbial community data |
title_full_unstemmed | EMPeror: a tool for visualizing high-throughput microbial community data |
title_short | EMPeror: a tool for visualizing high-throughput microbial community data |
title_sort | emperor: a tool for visualizing high-throughput microbial community data |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076506/ https://www.ncbi.nlm.nih.gov/pubmed/24280061 http://dx.doi.org/10.1186/2047-217X-2-16 |
work_keys_str_mv | AT vazquezbaezayoshiki emperoratoolforvisualizinghighthroughputmicrobialcommunitydata AT pirrungmeg emperoratoolforvisualizinghighthroughputmicrobialcommunitydata AT gonzalezantonio emperoratoolforvisualizinghighthroughputmicrobialcommunitydata AT knightrob emperoratoolforvisualizinghighthroughputmicrobialcommunitydata |