Cargando…
Visualizing statistical significance of disease clusters using cartograms
BACKGROUND: Health officials and epidemiological researchers often use maps of disease rates to identify potential disease clusters. Because these maps exaggerate the prominence of low-density districts and hide potential clusters in urban (high-density) areas, many researchers have used density-equ...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433035/ https://www.ncbi.nlm.nih.gov/pubmed/28506288 http://dx.doi.org/10.1186/s12942-017-0093-9 |
_version_ | 1783236764277669888 |
---|---|
author | Kronenfeld, Barry J. Wong, David W. S. |
author_facet | Kronenfeld, Barry J. Wong, David W. S. |
author_sort | Kronenfeld, Barry J. |
collection | PubMed |
description | BACKGROUND: Health officials and epidemiological researchers often use maps of disease rates to identify potential disease clusters. Because these maps exaggerate the prominence of low-density districts and hide potential clusters in urban (high-density) areas, many researchers have used density-equalizing maps (cartograms) as a basis for epidemiological mapping. However, we do not have existing guidelines for visual assessment of statistical uncertainty. To address this shortcoming, we develop techniques for visual determination of statistical significance of clusters spanning one or more districts on a cartogram. We developed the techniques within a geovisual analytics framework that does not rely on automated significance testing, and can therefore facilitate visual analysis to detect clusters that automated techniques might miss. RESULTS: On a cartogram of the at-risk population, the statistical significance of a disease cluster is determinate from the rate, area and shape of the cluster under standard hypothesis testing scenarios. We develop formulae to determine, for a given rate, the area required for statistical significance of a priori and a posteriori designated regions under certain test assumptions. Uniquely, our approach enables dynamic inference of aggregate regions formed by combining individual districts. The method is implemented in interactive tools that provide choropleth mapping, automated legend construction and dynamic search tools to facilitate cluster detection and assessment of the validity of tested assumptions. A case study of leukemia incidence analysis in California demonstrates the ability to visually distinguish between statistically significant and insignificant regions. CONCLUSION: The proposed geovisual analytics approach enables intuitive visual assessment of statistical significance of arbitrarily defined regions on a cartogram. Our research prompts a broader discussion of the role of geovisual exploratory analyses in disease mapping and the appropriate framework for visually assessing the statistical significance of spatial clusters. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-017-0093-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5433035 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54330352017-05-17 Visualizing statistical significance of disease clusters using cartograms Kronenfeld, Barry J. Wong, David W. S. Int J Health Geogr Methodology BACKGROUND: Health officials and epidemiological researchers often use maps of disease rates to identify potential disease clusters. Because these maps exaggerate the prominence of low-density districts and hide potential clusters in urban (high-density) areas, many researchers have used density-equalizing maps (cartograms) as a basis for epidemiological mapping. However, we do not have existing guidelines for visual assessment of statistical uncertainty. To address this shortcoming, we develop techniques for visual determination of statistical significance of clusters spanning one or more districts on a cartogram. We developed the techniques within a geovisual analytics framework that does not rely on automated significance testing, and can therefore facilitate visual analysis to detect clusters that automated techniques might miss. RESULTS: On a cartogram of the at-risk population, the statistical significance of a disease cluster is determinate from the rate, area and shape of the cluster under standard hypothesis testing scenarios. We develop formulae to determine, for a given rate, the area required for statistical significance of a priori and a posteriori designated regions under certain test assumptions. Uniquely, our approach enables dynamic inference of aggregate regions formed by combining individual districts. The method is implemented in interactive tools that provide choropleth mapping, automated legend construction and dynamic search tools to facilitate cluster detection and assessment of the validity of tested assumptions. A case study of leukemia incidence analysis in California demonstrates the ability to visually distinguish between statistically significant and insignificant regions. CONCLUSION: The proposed geovisual analytics approach enables intuitive visual assessment of statistical significance of arbitrarily defined regions on a cartogram. Our research prompts a broader discussion of the role of geovisual exploratory analyses in disease mapping and the appropriate framework for visually assessing the statistical significance of spatial clusters. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-017-0093-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-15 /pmc/articles/PMC5433035/ /pubmed/28506288 http://dx.doi.org/10.1186/s12942-017-0093-9 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Kronenfeld, Barry J. Wong, David W. S. Visualizing statistical significance of disease clusters using cartograms |
title | Visualizing statistical significance of disease clusters using cartograms |
title_full | Visualizing statistical significance of disease clusters using cartograms |
title_fullStr | Visualizing statistical significance of disease clusters using cartograms |
title_full_unstemmed | Visualizing statistical significance of disease clusters using cartograms |
title_short | Visualizing statistical significance of disease clusters using cartograms |
title_sort | visualizing statistical significance of disease clusters using cartograms |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433035/ https://www.ncbi.nlm.nih.gov/pubmed/28506288 http://dx.doi.org/10.1186/s12942-017-0093-9 |
work_keys_str_mv | AT kronenfeldbarryj visualizingstatisticalsignificanceofdiseaseclustersusingcartograms AT wongdavidws visualizingstatisticalsignificanceofdiseaseclustersusingcartograms |