Cargando…
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4731069/ https://www.ncbi.nlm.nih.gov/pubmed/26820646 http://dx.doi.org/10.1371/journal.pone.0147918 |
_version_ | 1782412509873963008 |
---|---|
author | Ma, Yue Yin, Fei Zhang, Tao Zhou, Xiaohua Andrew Li, Xiaosong |
author_facet | Ma, Yue Yin, Fei Zhang, Tao Zhou, Xiaohua Andrew Li, Xiaosong |
author_sort | Ma, Yue |
collection | PubMed |
description | Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set–proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters. |
format | Online Article Text |
id | pubmed-4731069 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-47310692016-02-04 Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic Ma, Yue Yin, Fei Zhang, Tao Zhou, Xiaohua Andrew Li, Xiaosong PLoS One Research Article Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set–proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters. Public Library of Science 2016-01-28 /pmc/articles/PMC4731069/ /pubmed/26820646 http://dx.doi.org/10.1371/journal.pone.0147918 Text en © 2016 Ma et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ma, Yue Yin, Fei Zhang, Tao Zhou, Xiaohua Andrew Li, Xiaosong Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title | Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title_full | Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title_fullStr | Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title_full_unstemmed | Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title_short | Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic |
title_sort | selection of the maximum spatial cluster size of the spatial scan statistic by using the maximum clustering set-proportion statistic |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4731069/ https://www.ncbi.nlm.nih.gov/pubmed/26820646 http://dx.doi.org/10.1371/journal.pone.0147918 |
work_keys_str_mv | AT mayue selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT yinfei selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT zhangtao selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT zhouxiaohuaandrew selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT lixiaosong selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic |