Cargando…

EMPeror: a tool for visualizing high-throughput microbial community data

BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasing...

Descripción completa

Detalles Bibliográficos
Autores principales: Vázquez-Baeza, Yoshiki, Pirrung, Meg, Gonzalez, Antonio, Knight, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076506/
https://www.ncbi.nlm.nih.gov/pubmed/24280061
http://dx.doi.org/10.1186/2047-217X-2-16
_version_ 1782323494450626560
author Vázquez-Baeza, Yoshiki
Pirrung, Meg
Gonzalez, Antonio
Knight, Rob
author_facet Vázquez-Baeza, Yoshiki
Pirrung, Meg
Gonzalez, Antonio
Knight, Rob
author_sort Vázquez-Baeza, Yoshiki
collection PubMed
description BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasingly rich and standards-compliant metadata to understand the biological factors driving these relationships. In particular, the Earth Microbiome Project drives these needs by profiling the genomic content of tens of thousands of samples across multiple environment types. FINDINGS: Features of EMPeror include: ability to visualize gradients and categorical data, visualize different principal coordinates axes, present the data in the form of parallel coordinates, show taxa as well as environmental samples, dynamically adjust the size and transparency of the spheres representing the communities on a per-category basis, dynamically scale the axes according to the fraction of variance each explains, show, hide or recolor points according to arbitrary metadata including that compliant with the MIxS family of standards developed by the Genomic Standards Consortium, display jackknifed-resampled data to assess statistical confidence in clustering, perform coordinate comparisons (useful for procrustes analysis plots), and greatly reduce loading times and overall memory footprint compared with existing approaches. Additionally, ease of sharing, given EMPeror’s small output file size, enables agile collaboration by allowing users to embed these visualizations via emails or web pages without the need for extra plugins. CONCLUSIONS: Here we present EMPeror, an open source and web browser enabled tool with a versatile command line interface that allows researchers to perform rapid exploratory investigations of 3D visualizations of microbial community data, such as the widely used principal coordinates plots. EMPeror includes a rich set of controllers to modify features as a function of the metadata. By being specifically tailored to the requirements of microbial ecologists, EMPeror thus increases the speed with which insight can be gained from large microbiome datasets.
format Online
Article
Text
id pubmed-4076506
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40765062014-07-02 EMPeror: a tool for visualizing high-throughput microbial community data Vázquez-Baeza, Yoshiki Pirrung, Meg Gonzalez, Antonio Knight, Rob Gigascience Technical Note BACKGROUND: As microbial ecologists take advantage of high-throughput sequencing technologies to describe microbial communities across ever-increasing numbers of samples, new analysis tools are required to relate the distribution of microbes among larger numbers of communities, and to use increasingly rich and standards-compliant metadata to understand the biological factors driving these relationships. In particular, the Earth Microbiome Project drives these needs by profiling the genomic content of tens of thousands of samples across multiple environment types. FINDINGS: Features of EMPeror include: ability to visualize gradients and categorical data, visualize different principal coordinates axes, present the data in the form of parallel coordinates, show taxa as well as environmental samples, dynamically adjust the size and transparency of the spheres representing the communities on a per-category basis, dynamically scale the axes according to the fraction of variance each explains, show, hide or recolor points according to arbitrary metadata including that compliant with the MIxS family of standards developed by the Genomic Standards Consortium, display jackknifed-resampled data to assess statistical confidence in clustering, perform coordinate comparisons (useful for procrustes analysis plots), and greatly reduce loading times and overall memory footprint compared with existing approaches. Additionally, ease of sharing, given EMPeror’s small output file size, enables agile collaboration by allowing users to embed these visualizations via emails or web pages without the need for extra plugins. CONCLUSIONS: Here we present EMPeror, an open source and web browser enabled tool with a versatile command line interface that allows researchers to perform rapid exploratory investigations of 3D visualizations of microbial community data, such as the widely used principal coordinates plots. EMPeror includes a rich set of controllers to modify features as a function of the metadata. By being specifically tailored to the requirements of microbial ecologists, EMPeror thus increases the speed with which insight can be gained from large microbiome datasets. BioMed Central 2013-11-26 /pmc/articles/PMC4076506/ /pubmed/24280061 http://dx.doi.org/10.1186/2047-217X-2-16 Text en Copyright © 2013 Vázquez-Baeza et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Vázquez-Baeza, Yoshiki
Pirrung, Meg
Gonzalez, Antonio
Knight, Rob
EMPeror: a tool for visualizing high-throughput microbial community data
title EMPeror: a tool for visualizing high-throughput microbial community data
title_full EMPeror: a tool for visualizing high-throughput microbial community data
title_fullStr EMPeror: a tool for visualizing high-throughput microbial community data
title_full_unstemmed EMPeror: a tool for visualizing high-throughput microbial community data
title_short EMPeror: a tool for visualizing high-throughput microbial community data
title_sort emperor: a tool for visualizing high-throughput microbial community data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076506/
https://www.ncbi.nlm.nih.gov/pubmed/24280061
http://dx.doi.org/10.1186/2047-217X-2-16
work_keys_str_mv AT vazquezbaezayoshiki emperoratoolforvisualizinghighthroughputmicrobialcommunitydata
AT pirrungmeg emperoratoolforvisualizinghighthroughputmicrobialcommunitydata
AT gonzalezantonio emperoratoolforvisualizinghighthroughputmicrobialcommunitydata
AT knightrob emperoratoolforvisualizinghighthroughputmicrobialcommunitydata