Cargando…

Genome display tool: visualizing features in complex data sets

BACKGROUND: The enormity of the information contained in large data sets makes it difficult to develop intuitive understanding. It would be useful to have software that allows visualization of possible correlations between properties that can be associated with a core data set. In the case of bacter...

Descripción completa

Detalles Bibliográficos
Autores principales: Viswanath, Lalitha, Lu, Yue, Fox, George E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1805442/
https://www.ncbi.nlm.nih.gov/pubmed/17300731
http://dx.doi.org/10.1186/1751-0473-2-1
_version_ 1782132478013603840
author Viswanath, Lalitha
Lu, Yue
Fox, George E
author_facet Viswanath, Lalitha
Lu, Yue
Fox, George E
author_sort Viswanath, Lalitha
collection PubMed
description BACKGROUND: The enormity of the information contained in large data sets makes it difficult to develop intuitive understanding. It would be useful to have software that allows visualization of possible correlations between properties that can be associated with a core data set. In the case of bacterial genomes, existing visualization tools focus on either global properties such as variations in composition or detailed local displays of the features that comprise the annotation. It is not easy to visualize other information in the context of this core information. RESULTS: A Java based software known as the Genome Display Tool (GDT), allows the user to simultaneously view the distribution of multiple attributes pertaining to genes and intragenic regions in a single bacterial genome using different colours and shapes on a single screen. The display represents each gene by small boxes that correlate with physical position in the genome. The size of the boxes is dynamically allocated based on the number of genes and a zoom feature allows close-up inspection of regions of interest. The display is interfaced with a MS-Access relational database and can display any feature in the database that can be represented by discrete values. Data is readily added to the database from an MS-Excel spread sheet. The functionality of GDT is demonstrated by comparing the results of two predictions of recent horizontal transfer events in the genome of Synechocystis PCC-6803. The resulting display allows the user to immediately see how much agreement exists between the two methods and also visualize how genes in various categories (e.g. predicted in both methods, one method etc) are distributed in the genome. CONCLUSION: The GDT software provides the user with a powerful tool that allows development of an intuitive understanding of the relative distribution of features in a large data set. As additional features are added to the data set, the number of possible correlations that can be visualized grows rapidly. Although described here for use in bacterial genomics, the principle is general and similar software might be useful in other contexts such as patient studies.
format Text
id pubmed-1805442
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18054422007-02-27 Genome display tool: visualizing features in complex data sets Viswanath, Lalitha Lu, Yue Fox, George E Source Code Biol Med Research BACKGROUND: The enormity of the information contained in large data sets makes it difficult to develop intuitive understanding. It would be useful to have software that allows visualization of possible correlations between properties that can be associated with a core data set. In the case of bacterial genomes, existing visualization tools focus on either global properties such as variations in composition or detailed local displays of the features that comprise the annotation. It is not easy to visualize other information in the context of this core information. RESULTS: A Java based software known as the Genome Display Tool (GDT), allows the user to simultaneously view the distribution of multiple attributes pertaining to genes and intragenic regions in a single bacterial genome using different colours and shapes on a single screen. The display represents each gene by small boxes that correlate with physical position in the genome. The size of the boxes is dynamically allocated based on the number of genes and a zoom feature allows close-up inspection of regions of interest. The display is interfaced with a MS-Access relational database and can display any feature in the database that can be represented by discrete values. Data is readily added to the database from an MS-Excel spread sheet. The functionality of GDT is demonstrated by comparing the results of two predictions of recent horizontal transfer events in the genome of Synechocystis PCC-6803. The resulting display allows the user to immediately see how much agreement exists between the two methods and also visualize how genes in various categories (e.g. predicted in both methods, one method etc) are distributed in the genome. CONCLUSION: The GDT software provides the user with a powerful tool that allows development of an intuitive understanding of the relative distribution of features in a large data set. As additional features are added to the data set, the number of possible correlations that can be visualized grows rapidly. Although described here for use in bacterial genomics, the principle is general and similar software might be useful in other contexts such as patient studies. BioMed Central 2007-02-14 /pmc/articles/PMC1805442/ /pubmed/17300731 http://dx.doi.org/10.1186/1751-0473-2-1 Text en Copyright © 2007 Viswanath et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Viswanath, Lalitha
Lu, Yue
Fox, George E
Genome display tool: visualizing features in complex data sets
title Genome display tool: visualizing features in complex data sets
title_full Genome display tool: visualizing features in complex data sets
title_fullStr Genome display tool: visualizing features in complex data sets
title_full_unstemmed Genome display tool: visualizing features in complex data sets
title_short Genome display tool: visualizing features in complex data sets
title_sort genome display tool: visualizing features in complex data sets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1805442/
https://www.ncbi.nlm.nih.gov/pubmed/17300731
http://dx.doi.org/10.1186/1751-0473-2-1
work_keys_str_mv AT viswanathlalitha genomedisplaytoolvisualizingfeaturesincomplexdatasets
AT luyue genomedisplaytoolvisualizingfeaturesincomplexdatasets
AT foxgeorgee genomedisplaytoolvisualizingfeaturesincomplexdatasets