Cargando…

Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices

Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wei, Xiao, Xiong, Qian, Jian, Chen, Shiqi, Liao, Fang, Yin, Fei, Zhang, Tao, Li, Xiaosong, Ma, Yue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313839/
https://www.ncbi.nlm.nih.gov/pubmed/35347729
http://dx.doi.org/10.1002/sim.9395
_version_ 1784754172435890176
author Wang, Wei
Xiao, Xiong
Qian, Jian
Chen, Shiqi
Liao, Fang
Yin, Fei
Zhang, Tao
Li, Xiaosong
Ma, Yue
author_facet Wang, Wei
Xiao, Xiong
Qian, Jian
Chen, Shiqi
Liao, Fang
Yin, Fei
Zhang, Tao
Li, Xiaosong
Ma, Yue
author_sort Wang, Wei
collection PubMed
description Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first‐law‐of‐geography‐based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data‐driven Ws (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first‐law‐of‐geographic‐based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux‐prior‐based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density‐based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases.
format Online
Article
Text
id pubmed-9313839
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-93138392022-07-30 Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices Wang, Wei Xiao, Xiong Qian, Jian Chen, Shiqi Liao, Fang Yin, Fei Zhang, Tao Li, Xiaosong Ma, Yue Stat Med Research Articles Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first‐law‐of‐geography‐based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data‐driven Ws (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first‐law‐of‐geographic‐based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux‐prior‐based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density‐based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases. John Wiley & Sons, Inc. 2022-03-28 2022-07-10 /pmc/articles/PMC9313839/ /pubmed/35347729 http://dx.doi.org/10.1002/sim.9395 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Wang, Wei
Xiao, Xiong
Qian, Jian
Chen, Shiqi
Liao, Fang
Yin, Fei
Zhang, Tao
Li, Xiaosong
Ma, Yue
Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title_full Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title_fullStr Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title_full_unstemmed Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title_short Reclaiming independence in spatial‐clustering datasets: A series of data‐driven spatial weights matrices
title_sort reclaiming independence in spatial‐clustering datasets: a series of data‐driven spatial weights matrices
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313839/
https://www.ncbi.nlm.nih.gov/pubmed/35347729
http://dx.doi.org/10.1002/sim.9395
work_keys_str_mv AT wangwei reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT xiaoxiong reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT qianjian reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT chenshiqi reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT liaofang reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT yinfei reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT zhangtao reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT lixiaosong reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices
AT mayue reclaimingindependenceinspatialclusteringdatasetsaseriesofdatadrivenspatialweightsmatrices