Cargando…

Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic

Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Yue, Yin, Fei, Zhang, Tao, Zhou, Xiaohua Andrew, Li, Xiaosong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4731069/
https://www.ncbi.nlm.nih.gov/pubmed/26820646
http://dx.doi.org/10.1371/journal.pone.0147918
_version_ 1782412509873963008
author Ma, Yue
Yin, Fei
Zhang, Tao
Zhou, Xiaohua Andrew
Li, Xiaosong
author_facet Ma, Yue
Yin, Fei
Zhang, Tao
Zhou, Xiaohua Andrew
Li, Xiaosong
author_sort Ma, Yue
collection PubMed
description Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set–proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters.
format Online
Article
Text
id pubmed-4731069
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47310692016-02-04 Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic Ma, Yue Yin, Fei Zhang, Tao Zhou, Xiaohua Andrew Li, Xiaosong PLoS One Research Article Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set–proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters. Public Library of Science 2016-01-28 /pmc/articles/PMC4731069/ /pubmed/26820646 http://dx.doi.org/10.1371/journal.pone.0147918 Text en © 2016 Ma et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ma, Yue
Yin, Fei
Zhang, Tao
Zhou, Xiaohua Andrew
Li, Xiaosong
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title_full Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title_fullStr Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title_full_unstemmed Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title_short Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic
title_sort selection of the maximum spatial cluster size of the spatial scan statistic by using the maximum clustering set-proportion statistic
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4731069/
https://www.ncbi.nlm.nih.gov/pubmed/26820646
http://dx.doi.org/10.1371/journal.pone.0147918
work_keys_str_mv AT mayue selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT yinfei selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT zhangtao selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT zhouxiaohuaandrew selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT lixiaosong selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic