Cargando…
Bacterial rose garden for metagenomic SNP-based phylogeny visualization
BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374582/ https://www.ncbi.nlm.nih.gov/pubmed/25815061 http://dx.doi.org/10.1186/s13040-015-0045-5 |
_version_ | 1782363512366956544 |
---|---|
author | Alexeev, Dmitry Bibikova, Tanya Kovarsky, Boris Melnikov, Damir Tyakht, Alexander Govorun, Vadim |
author_facet | Alexeev, Dmitry Bibikova, Tanya Kovarsky, Boris Melnikov, Damir Tyakht, Alexander Govorun, Vadim |
author_sort | Alexeev, Dmitry |
collection | PubMed |
description | BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a single genome as it varies in time by the amount of bacteria comprising it. Other levels of data complexity include geography of the samples and phylogenetic distance between the genomes of the same operational taxonomic unit (OTU). We have developed the visualization concept for the representation of multilayer metagenomics data – the bacterial rose garden. The approach allows to display the taxonomic distance between the representatives of the same OTU in different samples and use variety of the metadata for display. RESULTS: We have developed the principle of visualization allowing for multilayer information representation. We have incorporated data on OTU diversity across metagenomes and origin of the samples. The visual representation we have called “rose” is focused on the phylogenetic distance between the representatives of the same OTU. The visual representation is realized as interactive data chart which allows user to interact with data and explore variables. It is known that classical representation of the taxonomic tree is a reduction of information from original pairwise distance matrix. The visualization presented is a way to save all the information available through projection of distance matrix into single dimensional space of one sample. It could serve as a basis for further more complex information representation. We have used the principle proposed for visualization of 101 bacterial OTUs phylogenetic distances, finally we provide open code for the web page generation. CONCLUSIONS: Bacterial rose garden is a versatile visualization principle coping with the major difficulties of metagenomic big-data visualization without loss of data. The method proposed is showing the interconnectedness of variables and is realized as user-friendly web page allowing for dynamic data exploration. The concept provided serves as one of the original approaches for metagenomic data representation and sharing. Full functional prototype could be found at http://rosegarden.datalaboratory.ru ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0045-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4374582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43745822015-03-27 Bacterial rose garden for metagenomic SNP-based phylogeny visualization Alexeev, Dmitry Bibikova, Tanya Kovarsky, Boris Melnikov, Damir Tyakht, Alexander Govorun, Vadim BioData Min Methodology BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a single genome as it varies in time by the amount of bacteria comprising it. Other levels of data complexity include geography of the samples and phylogenetic distance between the genomes of the same operational taxonomic unit (OTU). We have developed the visualization concept for the representation of multilayer metagenomics data – the bacterial rose garden. The approach allows to display the taxonomic distance between the representatives of the same OTU in different samples and use variety of the metadata for display. RESULTS: We have developed the principle of visualization allowing for multilayer information representation. We have incorporated data on OTU diversity across metagenomes and origin of the samples. The visual representation we have called “rose” is focused on the phylogenetic distance between the representatives of the same OTU. The visual representation is realized as interactive data chart which allows user to interact with data and explore variables. It is known that classical representation of the taxonomic tree is a reduction of information from original pairwise distance matrix. The visualization presented is a way to save all the information available through projection of distance matrix into single dimensional space of one sample. It could serve as a basis for further more complex information representation. We have used the principle proposed for visualization of 101 bacterial OTUs phylogenetic distances, finally we provide open code for the web page generation. CONCLUSIONS: Bacterial rose garden is a versatile visualization principle coping with the major difficulties of metagenomic big-data visualization without loss of data. The method proposed is showing the interconnectedness of variables and is realized as user-friendly web page allowing for dynamic data exploration. The concept provided serves as one of the original approaches for metagenomic data representation and sharing. Full functional prototype could be found at http://rosegarden.datalaboratory.ru ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0045-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-21 /pmc/articles/PMC4374582/ /pubmed/25815061 http://dx.doi.org/10.1186/s13040-015-0045-5 Text en © Alexeev et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Alexeev, Dmitry Bibikova, Tanya Kovarsky, Boris Melnikov, Damir Tyakht, Alexander Govorun, Vadim Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title | Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title_full | Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title_fullStr | Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title_full_unstemmed | Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title_short | Bacterial rose garden for metagenomic SNP-based phylogeny visualization |
title_sort | bacterial rose garden for metagenomic snp-based phylogeny visualization |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374582/ https://www.ncbi.nlm.nih.gov/pubmed/25815061 http://dx.doi.org/10.1186/s13040-015-0045-5 |
work_keys_str_mv | AT alexeevdmitry bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization AT bibikovatanya bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization AT kovarskyboris bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization AT melnikovdamir bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization AT tyakhtalexander bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization AT govorunvadim bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization |