Cargando…

Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation

BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to...

Descripción completa

Detalles Bibliográficos
Autores principales: Zaslavsky, Leonid, Bao, Yiming, Tatusova, Tatiana A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2416652/
https://www.ncbi.nlm.nih.gov/pubmed/18485197
http://dx.doi.org/10.1186/1471-2105-9-237
_version_ 1782156074453827584
author Zaslavsky, Leonid
Bao, Yiming
Tatusova, Tatiana A
author_facet Zaslavsky, Leonid
Bao, Yiming
Tatusova, Tatiana A
author_sort Zaslavsky, Leonid
collection PubMed
description BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to facilitate convenient information retrieval. RESULTS: We developed an approach to visualize large phylogenetic trees in an aggregated form with a special representation of subscale details. The initial aggregated tree representation is built with a level of resolution automatically selected to fit into the available screen space, with terminal groups selected based on sequence similarity. The default aggregated representation can be refined by users interactively. Structure and data variability within terminal groups are displayed using small trees that have the same vertical size as the text annotation of the group. These subscale representations are calculated using systematic sampling from the corresponding terminal group. The aggregated tree containing terminal groups can be annotated using aggregation of structured metadata, such as seasonal distribution, geographic locations, etc. AVAILABILITY: The algorithms are implemented in JavaScript within the NCBI Influenza Virus Resource [1].
format Text
id pubmed-2416652
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24166522008-06-09 Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation Zaslavsky, Leonid Bao, Yiming Tatusova, Tatiana A BMC Bioinformatics Methodology Article BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to facilitate convenient information retrieval. RESULTS: We developed an approach to visualize large phylogenetic trees in an aggregated form with a special representation of subscale details. The initial aggregated tree representation is built with a level of resolution automatically selected to fit into the available screen space, with terminal groups selected based on sequence similarity. The default aggregated representation can be refined by users interactively. Structure and data variability within terminal groups are displayed using small trees that have the same vertical size as the text annotation of the group. These subscale representations are calculated using systematic sampling from the corresponding terminal group. The aggregated tree containing terminal groups can be annotated using aggregation of structured metadata, such as seasonal distribution, geographic locations, etc. AVAILABILITY: The algorithms are implemented in JavaScript within the NCBI Influenza Virus Resource [1]. BioMed Central 2008-05-16 /pmc/articles/PMC2416652/ /pubmed/18485197 http://dx.doi.org/10.1186/1471-2105-9-237 Text en Copyright © 2008 Zaslavsky et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zaslavsky, Leonid
Bao, Yiming
Tatusova, Tatiana A
Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title_full Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title_fullStr Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title_full_unstemmed Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title_short Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
title_sort visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2416652/
https://www.ncbi.nlm.nih.gov/pubmed/18485197
http://dx.doi.org/10.1186/1471-2105-9-237
work_keys_str_mv AT zaslavskyleonid visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation
AT baoyiming visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation
AT tatusovatatianaa visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation