Cargando…
Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation
BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2416652/ https://www.ncbi.nlm.nih.gov/pubmed/18485197 http://dx.doi.org/10.1186/1471-2105-9-237 |
_version_ | 1782156074453827584 |
---|---|
author | Zaslavsky, Leonid Bao, Yiming Tatusova, Tatiana A |
author_facet | Zaslavsky, Leonid Bao, Yiming Tatusova, Tatiana A |
author_sort | Zaslavsky, Leonid |
collection | PubMed |
description | BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to facilitate convenient information retrieval. RESULTS: We developed an approach to visualize large phylogenetic trees in an aggregated form with a special representation of subscale details. The initial aggregated tree representation is built with a level of resolution automatically selected to fit into the available screen space, with terminal groups selected based on sequence similarity. The default aggregated representation can be refined by users interactively. Structure and data variability within terminal groups are displayed using small trees that have the same vertical size as the text annotation of the group. These subscale representations are calculated using systematic sampling from the corresponding terminal group. The aggregated tree containing terminal groups can be annotated using aggregation of structured metadata, such as seasonal distribution, geographic locations, etc. AVAILABILITY: The algorithms are implemented in JavaScript within the NCBI Influenza Virus Resource [1]. |
format | Text |
id | pubmed-2416652 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24166522008-06-09 Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation Zaslavsky, Leonid Bao, Yiming Tatusova, Tatiana A BMC Bioinformatics Methodology Article BACKGROUND: With the amount of influenza genome sequence data growing rapidly, researchers need machine assistance in selecting datasets and exploring the data. Enhanced visualization tools are required to represent results of the exploratory analysis on the web in an easy-to-comprehend form and to facilitate convenient information retrieval. RESULTS: We developed an approach to visualize large phylogenetic trees in an aggregated form with a special representation of subscale details. The initial aggregated tree representation is built with a level of resolution automatically selected to fit into the available screen space, with terminal groups selected based on sequence similarity. The default aggregated representation can be refined by users interactively. Structure and data variability within terminal groups are displayed using small trees that have the same vertical size as the text annotation of the group. These subscale representations are calculated using systematic sampling from the corresponding terminal group. The aggregated tree containing terminal groups can be annotated using aggregation of structured metadata, such as seasonal distribution, geographic locations, etc. AVAILABILITY: The algorithms are implemented in JavaScript within the NCBI Influenza Virus Resource [1]. BioMed Central 2008-05-16 /pmc/articles/PMC2416652/ /pubmed/18485197 http://dx.doi.org/10.1186/1471-2105-9-237 Text en Copyright © 2008 Zaslavsky et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Zaslavsky, Leonid Bao, Yiming Tatusova, Tatiana A Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title | Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title_full | Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title_fullStr | Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title_full_unstemmed | Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title_short | Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
title_sort | visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2416652/ https://www.ncbi.nlm.nih.gov/pubmed/18485197 http://dx.doi.org/10.1186/1471-2105-9-237 |
work_keys_str_mv | AT zaslavskyleonid visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation AT baoyiming visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation AT tatusovatatianaa visualizationoflargeinfluenzavirussequencedatasetsusingadaptivelyaggregatedtreeswithsamplingbasedsubscalerepresentation |