Cargando…

Bacterial rose garden for metagenomic SNP-based phylogeny visualization

BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexeev, Dmitry, Bibikova, Tanya, Kovarsky, Boris, Melnikov, Damir, Tyakht, Alexander, Govorun, Vadim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374582/
https://www.ncbi.nlm.nih.gov/pubmed/25815061
http://dx.doi.org/10.1186/s13040-015-0045-5
_version_ 1782363512366956544
author Alexeev, Dmitry
Bibikova, Tanya
Kovarsky, Boris
Melnikov, Damir
Tyakht, Alexander
Govorun, Vadim
author_facet Alexeev, Dmitry
Bibikova, Tanya
Kovarsky, Boris
Melnikov, Damir
Tyakht, Alexander
Govorun, Vadim
author_sort Alexeev, Dmitry
collection PubMed
description BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a single genome as it varies in time by the amount of bacteria comprising it. Other levels of data complexity include geography of the samples and phylogenetic distance between the genomes of the same operational taxonomic unit (OTU). We have developed the visualization concept for the representation of multilayer metagenomics data – the bacterial rose garden. The approach allows to display the taxonomic distance between the representatives of the same OTU in different samples and use variety of the metadata for display. RESULTS: We have developed the principle of visualization allowing for multilayer information representation. We have incorporated data on OTU diversity across metagenomes and origin of the samples. The visual representation we have called “rose” is focused on the phylogenetic distance between the representatives of the same OTU. The visual representation is realized as interactive data chart which allows user to interact with data and explore variables. It is known that classical representation of the taxonomic tree is a reduction of information from original pairwise distance matrix. The visualization presented is a way to save all the information available through projection of distance matrix into single dimensional space of one sample. It could serve as a basis for further more complex information representation. We have used the principle proposed for visualization of 101 bacterial OTUs phylogenetic distances, finally we provide open code for the web page generation. CONCLUSIONS: Bacterial rose garden is a versatile visualization principle coping with the major difficulties of metagenomic big-data visualization without loss of data. The method proposed is showing the interconnectedness of variables and is realized as user-friendly web page allowing for dynamic data exploration. The concept provided serves as one of the original approaches for metagenomic data representation and sharing. Full functional prototype could be found at http://rosegarden.datalaboratory.ru ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0045-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4374582
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43745822015-03-27 Bacterial rose garden for metagenomic SNP-based phylogeny visualization Alexeev, Dmitry Bibikova, Tanya Kovarsky, Boris Melnikov, Damir Tyakht, Alexander Govorun, Vadim BioData Min Methodology BACKGROUND: One of the most challenging tasks in genomic analysis nowadays is metagenomics. Biomedical applications of metagenomics give rise to datasets containing hundreds and thousands of samples from various body sites for hundreds of patients. Inherently metagenome is by far more complex than a single genome as it varies in time by the amount of bacteria comprising it. Other levels of data complexity include geography of the samples and phylogenetic distance between the genomes of the same operational taxonomic unit (OTU). We have developed the visualization concept for the representation of multilayer metagenomics data – the bacterial rose garden. The approach allows to display the taxonomic distance between the representatives of the same OTU in different samples and use variety of the metadata for display. RESULTS: We have developed the principle of visualization allowing for multilayer information representation. We have incorporated data on OTU diversity across metagenomes and origin of the samples. The visual representation we have called “rose” is focused on the phylogenetic distance between the representatives of the same OTU. The visual representation is realized as interactive data chart which allows user to interact with data and explore variables. It is known that classical representation of the taxonomic tree is a reduction of information from original pairwise distance matrix. The visualization presented is a way to save all the information available through projection of distance matrix into single dimensional space of one sample. It could serve as a basis for further more complex information representation. We have used the principle proposed for visualization of 101 bacterial OTUs phylogenetic distances, finally we provide open code for the web page generation. CONCLUSIONS: Bacterial rose garden is a versatile visualization principle coping with the major difficulties of metagenomic big-data visualization without loss of data. The method proposed is showing the interconnectedness of variables and is realized as user-friendly web page allowing for dynamic data exploration. The concept provided serves as one of the original approaches for metagenomic data representation and sharing. Full functional prototype could be found at http://rosegarden.datalaboratory.ru ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-015-0045-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-21 /pmc/articles/PMC4374582/ /pubmed/25815061 http://dx.doi.org/10.1186/s13040-015-0045-5 Text en © Alexeev et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Alexeev, Dmitry
Bibikova, Tanya
Kovarsky, Boris
Melnikov, Damir
Tyakht, Alexander
Govorun, Vadim
Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title_full Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title_fullStr Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title_full_unstemmed Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title_short Bacterial rose garden for metagenomic SNP-based phylogeny visualization
title_sort bacterial rose garden for metagenomic snp-based phylogeny visualization
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374582/
https://www.ncbi.nlm.nih.gov/pubmed/25815061
http://dx.doi.org/10.1186/s13040-015-0045-5
work_keys_str_mv AT alexeevdmitry bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization
AT bibikovatanya bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization
AT kovarskyboris bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization
AT melnikovdamir bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization
AT tyakhtalexander bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization
AT govorunvadim bacterialrosegardenformetagenomicsnpbasedphylogenyvisualization