Cargando…

HAMSTER: visualizing microarray experiments as a set of minimum spanning trees

BACKGROUND: Visualization tools allow researchers to obtain a global view of the interrelationships between the probes or experiments of a gene expression (e.g. microarray) data set. Some existing methods include hierarchical clustering and k-means. In recent years, others have proposed applying min...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Raymond, Kiseleva, Larisa, Harada, Hajime, Mamitsuka, Hiroshi, Horton, Paul
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2784758/
https://www.ncbi.nlm.nih.gov/pubmed/19925686
http://dx.doi.org/10.1186/1751-0473-4-8
_version_ 1782174778378944512
author Wan, Raymond
Kiseleva, Larisa
Harada, Hajime
Mamitsuka, Hiroshi
Horton, Paul
author_facet Wan, Raymond
Kiseleva, Larisa
Harada, Hajime
Mamitsuka, Hiroshi
Horton, Paul
author_sort Wan, Raymond
collection PubMed
description BACKGROUND: Visualization tools allow researchers to obtain a global view of the interrelationships between the probes or experiments of a gene expression (e.g. microarray) data set. Some existing methods include hierarchical clustering and k-means. In recent years, others have proposed applying minimum spanning trees (MST) for microarray clustering. Although MST-based clustering is formally equivalent to the dendrograms produced by hierarchical clustering under certain conditions; visually they can be quite different. METHODS: HAMSTER (Helpful Abstraction using Minimum Spanning Trees for Expression Relations) is an open source system for generating a set of MSTs from the experiments of a microarray data set. While previous works have generated a single MST from a data set for data clustering, we recursively merge experiments and repeat this process to obtain a set of MSTs for data visualization. Depending on the parameters chosen, each tree is analogous to a snapshot of one step of the hierarchical clustering process. We scored and ranked these trees using one of three proposed schemes. HAMSTER is implemented in C++ and makes use of Graphviz for laying out each MST. RESULTS: We report on the running time of HAMSTER and demonstrate using data sets from the NCBI Gene Expression Omnibus (GEO) that the images created by HAMSTER offer insights that differ from the dendrograms of hierarchical clustering. In addition to the C++ program which is available as open source, we also provided a web-based version (HAMSTER(+)) which allows users to apply our system through a web browser without any computer programming knowledge. CONCLUSION: Researchers may find it helpful to include HAMSTER in their microarray analysis workflow as it can offer insights that differ from hierarchical clustering. We believe that HAMSTER would be useful for certain types of gradient data sets (e.g time-series data) and data that indicate relationships between cells/tissues. Both the source and the web server variant of HAMSTER are available from http://hamster.cbrc.jp/.
format Text
id pubmed-2784758
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27847582009-11-28 HAMSTER: visualizing microarray experiments as a set of minimum spanning trees Wan, Raymond Kiseleva, Larisa Harada, Hajime Mamitsuka, Hiroshi Horton, Paul Source Code Biol Med Methodology BACKGROUND: Visualization tools allow researchers to obtain a global view of the interrelationships between the probes or experiments of a gene expression (e.g. microarray) data set. Some existing methods include hierarchical clustering and k-means. In recent years, others have proposed applying minimum spanning trees (MST) for microarray clustering. Although MST-based clustering is formally equivalent to the dendrograms produced by hierarchical clustering under certain conditions; visually they can be quite different. METHODS: HAMSTER (Helpful Abstraction using Minimum Spanning Trees for Expression Relations) is an open source system for generating a set of MSTs from the experiments of a microarray data set. While previous works have generated a single MST from a data set for data clustering, we recursively merge experiments and repeat this process to obtain a set of MSTs for data visualization. Depending on the parameters chosen, each tree is analogous to a snapshot of one step of the hierarchical clustering process. We scored and ranked these trees using one of three proposed schemes. HAMSTER is implemented in C++ and makes use of Graphviz for laying out each MST. RESULTS: We report on the running time of HAMSTER and demonstrate using data sets from the NCBI Gene Expression Omnibus (GEO) that the images created by HAMSTER offer insights that differ from the dendrograms of hierarchical clustering. In addition to the C++ program which is available as open source, we also provided a web-based version (HAMSTER(+)) which allows users to apply our system through a web browser without any computer programming knowledge. CONCLUSION: Researchers may find it helpful to include HAMSTER in their microarray analysis workflow as it can offer insights that differ from hierarchical clustering. We believe that HAMSTER would be useful for certain types of gradient data sets (e.g time-series data) and data that indicate relationships between cells/tissues. Both the source and the web server variant of HAMSTER are available from http://hamster.cbrc.jp/. BioMed Central 2009-11-20 /pmc/articles/PMC2784758/ /pubmed/19925686 http://dx.doi.org/10.1186/1751-0473-4-8 Text en Copyright ©2009 Wan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Wan, Raymond
Kiseleva, Larisa
Harada, Hajime
Mamitsuka, Hiroshi
Horton, Paul
HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title_full HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title_fullStr HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title_full_unstemmed HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title_short HAMSTER: visualizing microarray experiments as a set of minimum spanning trees
title_sort hamster: visualizing microarray experiments as a set of minimum spanning trees
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2784758/
https://www.ncbi.nlm.nih.gov/pubmed/19925686
http://dx.doi.org/10.1186/1751-0473-4-8
work_keys_str_mv AT wanraymond hamstervisualizingmicroarrayexperimentsasasetofminimumspanningtrees
AT kiselevalarisa hamstervisualizingmicroarrayexperimentsasasetofminimumspanningtrees
AT haradahajime hamstervisualizingmicroarrayexperimentsasasetofminimumspanningtrees
AT mamitsukahiroshi hamstervisualizingmicroarrayexperimentsasasetofminimumspanningtrees
AT hortonpaul hamstervisualizingmicroarrayexperimentsasasetofminimumspanningtrees