Cargando…
Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313839/ https://www.ncbi.nlm.nih.gov/pubmed/35347729 http://dx.doi.org/10.1002/sim.9395 |
_version_ | 1784754172435890176 |
---|---|
author | Wang, Wei Xiao, Xiong Qian, Jian Chen, Shiqi Liao, Fang Yin, Fei Zhang, Tao Li, Xiaosong Ma, Yue |
author_facet | Wang, Wei Xiao, Xiong Qian, Jian Chen, Shiqi Liao, Fang Yin, Fei Zhang, Tao Li, Xiaosong Ma, Yue |
author_sort | Wang, Wei |
collection | PubMed |
description | Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first‐law‐of‐geography‐based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data‐driven Ws (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first‐law‐of‐geographic‐based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux‐prior‐based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density‐based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases. |
format | Online Article Text |
id | pubmed-9313839 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93138392022-07-30 Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices Wang, Wei Xiao, Xiong Qian, Jian Chen, Shiqi Liao, Fang Yin, Fei Zhang, Tao Li, Xiaosong Ma, Yue Stat Med Research Articles Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first‐law‐of‐geography‐based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data‐driven Ws (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first‐law‐of‐geographic‐based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux‐prior‐based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density‐based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases. John Wiley & Sons, Inc. 2022-03-28 2022-07-10 /pmc/articles/PMC9313839/ /pubmed/35347729 http://dx.doi.org/10.1002/sim.9395 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Wang, Wei Xiao, Xiong Qian, Jian Chen, Shiqi Liao, Fang Yin, Fei Zhang, Tao Li, Xiaosong Ma, Yue Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title | Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title_full | Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title_fullStr | Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title_full_unstemmed | Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title_short | Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices |
title_sort | reclaiming independence in spatial‐clustering datasets: a series of data‐driven spatial weights matrices |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313839/ https://www.ncbi.nlm.nih.gov/pubmed/35347729 http://dx.doi.org/10.1002/sim.9395 |
work_keys_str_mv | AT wangwei reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT xiaoxiong reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT qianjian reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT chenshiqi reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT liaofang reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT yinfei reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT zhangtao reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT lixiaosong reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices AT mayue reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices |