Cargando…
Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping
BACKGROUND: Maps of disease rates produced without careful consideration of the underlying population distribution may be unreliable due to the well-known small numbers problem. Smoothing methods such as Kernel Density Estimation (KDE) are employed to control the population basis of spatial support...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5938815/ https://www.ncbi.nlm.nih.gov/pubmed/29739415 http://dx.doi.org/10.1186/s12942-018-0129-9 |
_version_ | 1783320852197015552 |
---|---|
author | Ruckthongsook, Warangkana Tiwari, Chetan Oppong, Joseph R. Natesan, Prathiba |
author_facet | Ruckthongsook, Warangkana Tiwari, Chetan Oppong, Joseph R. Natesan, Prathiba |
author_sort | Ruckthongsook, Warangkana |
collection | PubMed |
description | BACKGROUND: Maps of disease rates produced without careful consideration of the underlying population distribution may be unreliable due to the well-known small numbers problem. Smoothing methods such as Kernel Density Estimation (KDE) are employed to control the population basis of spatial support used to calculate each disease rate. The degree of smoothing is controlled by a user-defined parameter (bandwidth or threshold) which influences the resolution of the disease map and the reliability of the computed rates. Methods for automatically selecting a smoothing parameter such as normal scale, plug-in, and smoothed cross validation bandwidth selectors have been proposed for use with non-spatial data, but their relative utilities remain unknown. This study assesses the relative performance of these methods in terms of resolution and reliability for disease mapping. RESULTS: Using a simulated dataset of heart disease mortality among males aged 35 years and older in Texas, we assess methods for automatically selecting a smoothing parameter. Our results show that while all parameter choices accurately estimate the overall state rates, they vary in terms of the degree of spatial resolution. Further, parameter choices resulting in desirable characteristics for one sub group of the population (e.g., a specific age-group) may not necessarily be appropriate for other groups. CONCLUSION: We show that the appropriate threshold value depends on the characteristics of the data, and that bandwidth selector algorithms can be used to guide such decisions about mapping parameters. An unguided choice may produce maps that distort the balance of resolution and statistical reliability. |
format | Online Article Text |
id | pubmed-5938815 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59388152018-05-11 Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping Ruckthongsook, Warangkana Tiwari, Chetan Oppong, Joseph R. Natesan, Prathiba Int J Health Geogr Methodology BACKGROUND: Maps of disease rates produced without careful consideration of the underlying population distribution may be unreliable due to the well-known small numbers problem. Smoothing methods such as Kernel Density Estimation (KDE) are employed to control the population basis of spatial support used to calculate each disease rate. The degree of smoothing is controlled by a user-defined parameter (bandwidth or threshold) which influences the resolution of the disease map and the reliability of the computed rates. Methods for automatically selecting a smoothing parameter such as normal scale, plug-in, and smoothed cross validation bandwidth selectors have been proposed for use with non-spatial data, but their relative utilities remain unknown. This study assesses the relative performance of these methods in terms of resolution and reliability for disease mapping. RESULTS: Using a simulated dataset of heart disease mortality among males aged 35 years and older in Texas, we assess methods for automatically selecting a smoothing parameter. Our results show that while all parameter choices accurately estimate the overall state rates, they vary in terms of the degree of spatial resolution. Further, parameter choices resulting in desirable characteristics for one sub group of the population (e.g., a specific age-group) may not necessarily be appropriate for other groups. CONCLUSION: We show that the appropriate threshold value depends on the characteristics of the data, and that bandwidth selector algorithms can be used to guide such decisions about mapping parameters. An unguided choice may produce maps that distort the balance of resolution and statistical reliability. BioMed Central 2018-05-08 /pmc/articles/PMC5938815/ /pubmed/29739415 http://dx.doi.org/10.1186/s12942-018-0129-9 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Ruckthongsook, Warangkana Tiwari, Chetan Oppong, Joseph R. Natesan, Prathiba Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title | Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title_full | Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title_fullStr | Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title_full_unstemmed | Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title_short | Evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
title_sort | evaluation of threshold selection methods for adaptive kernel density estimation in disease mapping |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5938815/ https://www.ncbi.nlm.nih.gov/pubmed/29739415 http://dx.doi.org/10.1186/s12942-018-0129-9 |
work_keys_str_mv | AT ruckthongsookwarangkana evaluationofthresholdselectionmethodsforadaptivekerneldensityestimationindiseasemapping AT tiwarichetan evaluationofthresholdselectionmethodsforadaptivekerneldensityestimationindiseasemapping AT oppongjosephr evaluationofthresholdselectionmethodsforadaptivekerneldensityestimationindiseasemapping AT natesanprathiba evaluationofthresholdselectionmethodsforadaptivekerneldensityestimationindiseasemapping |