Cargando…

Unboxing cluster heatmaps

BACKGROUND: Cluster heatmaps are commonly used in biology and related fields to reveal hierarchical clusters in data matrices. This visualization technique has high data density and reveal clusters better than unordered heatmaps alone. However, cluster heatmaps have known issues making them both tim...

Descripción completa

Detalles Bibliográficos
Autores principales: Engle, Sophie, Whalen, Sean, Joshi, Alark, Pollard, Katherine S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5333167/
https://www.ncbi.nlm.nih.gov/pubmed/28251868
http://dx.doi.org/10.1186/s12859-016-1442-6
_version_ 1782511676273197056
author Engle, Sophie
Whalen, Sean
Joshi, Alark
Pollard, Katherine S.
author_facet Engle, Sophie
Whalen, Sean
Joshi, Alark
Pollard, Katherine S.
author_sort Engle, Sophie
collection PubMed
description BACKGROUND: Cluster heatmaps are commonly used in biology and related fields to reveal hierarchical clusters in data matrices. This visualization technique has high data density and reveal clusters better than unordered heatmaps alone. However, cluster heatmaps have known issues making them both time consuming to use and prone to error. We hypothesize that visualization techniques without the rigid grid constraint of cluster heatmaps will perform better at clustering-related tasks. RESULTS: We developed an approach to “unbox” the heatmap values and embed them directly in the hierarchical clustering results, allowing us to use standard hierarchical visualization techniques as alternatives to cluster heatmaps. We then tested our hypothesis by conducting a survey of 45 practitioners to determine how cluster heatmaps are used, prototyping alternatives to cluster heatmaps using pair analytics with a computational biologist, and evaluating those alternatives with hour-long interviews of 5 practitioners and an Amazon Mechanical Turk user study with approximately 200 participants. We found statistically significant performance differences for most clustering-related tasks, and in the number of perceived visual clusters. Visit git.io/vw0t3 for our results. CONCLUSIONS: The optimal technique varied by task. However, gapmaps were preferred by the interviewed practitioners and outperformed or performed as well as cluster heatmaps for clustering-related tasks. Gapmaps are similar to cluster heatmaps, but relax the heatmap grid constraints by introducing gaps between rows and/or columns that are not closely clustered. Based on these results, we recommend users adopt gapmaps as an alternative to cluster heatmaps.
format Online
Article
Text
id pubmed-5333167
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53331672017-03-06 Unboxing cluster heatmaps Engle, Sophie Whalen, Sean Joshi, Alark Pollard, Katherine S. BMC Bioinformatics Methodology BACKGROUND: Cluster heatmaps are commonly used in biology and related fields to reveal hierarchical clusters in data matrices. This visualization technique has high data density and reveal clusters better than unordered heatmaps alone. However, cluster heatmaps have known issues making them both time consuming to use and prone to error. We hypothesize that visualization techniques without the rigid grid constraint of cluster heatmaps will perform better at clustering-related tasks. RESULTS: We developed an approach to “unbox” the heatmap values and embed them directly in the hierarchical clustering results, allowing us to use standard hierarchical visualization techniques as alternatives to cluster heatmaps. We then tested our hypothesis by conducting a survey of 45 practitioners to determine how cluster heatmaps are used, prototyping alternatives to cluster heatmaps using pair analytics with a computational biologist, and evaluating those alternatives with hour-long interviews of 5 practitioners and an Amazon Mechanical Turk user study with approximately 200 participants. We found statistically significant performance differences for most clustering-related tasks, and in the number of perceived visual clusters. Visit git.io/vw0t3 for our results. CONCLUSIONS: The optimal technique varied by task. However, gapmaps were preferred by the interviewed practitioners and outperformed or performed as well as cluster heatmaps for clustering-related tasks. Gapmaps are similar to cluster heatmaps, but relax the heatmap grid constraints by introducing gaps between rows and/or columns that are not closely clustered. Based on these results, we recommend users adopt gapmaps as an alternative to cluster heatmaps. BioMed Central 2017-02-15 /pmc/articles/PMC5333167/ /pubmed/28251868 http://dx.doi.org/10.1186/s12859-016-1442-6 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Engle, Sophie
Whalen, Sean
Joshi, Alark
Pollard, Katherine S.
Unboxing cluster heatmaps
title Unboxing cluster heatmaps
title_full Unboxing cluster heatmaps
title_fullStr Unboxing cluster heatmaps
title_full_unstemmed Unboxing cluster heatmaps
title_short Unboxing cluster heatmaps
title_sort unboxing cluster heatmaps
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5333167/
https://www.ncbi.nlm.nih.gov/pubmed/28251868
http://dx.doi.org/10.1186/s12859-016-1442-6
work_keys_str_mv AT englesophie unboxingclusterheatmaps
AT whalensean unboxingclusterheatmaps
AT joshialark unboxingclusterheatmaps
AT pollardkatherines unboxingclusterheatmaps