Cargando…

Managing distance and covariate information with point-based clustering

BACKGROUND: Geographic perspectives of disease and the human condition often involve point-based observations and questions of clustering or dispersion within a spatial context. These problems involve a finite set of point observations and are constrained by a larger, but finite, set of locations wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Whigham, Peter A., de Graaf, Brandon, Srivastava, Rashmi, Glue, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009712/
https://www.ncbi.nlm.nih.gov/pubmed/27586862
http://dx.doi.org/10.1186/s12874-016-0218-z
_version_ 1782451566881538048
author Whigham, Peter A.
de Graaf, Brandon
Srivastava, Rashmi
Glue, Paul
author_facet Whigham, Peter A.
de Graaf, Brandon
Srivastava, Rashmi
Glue, Paul
author_sort Whigham, Peter A.
collection PubMed
description BACKGROUND: Geographic perspectives of disease and the human condition often involve point-based observations and questions of clustering or dispersion within a spatial context. These problems involve a finite set of point observations and are constrained by a larger, but finite, set of locations where the observations could occur. Developing a rigorous method for pattern analysis in this context requires handling spatial covariates, a method for constrained finite spatial clustering, and addressing bias in geographic distance measures. An approach, based on Ripley’s K and applied to the problem of clustering with deliberate self-harm (DSH), is presented. METHODS: Point-based Monte-Carlo simulation of Ripley’s K, accounting for socio-economic deprivation and sources of distance measurement bias, was developed to estimate clustering of DSH at a range of spatial scales. A rotated Minkowski L(1) distance metric allowed variation in physical distance and clustering to be assessed. Self-harm data was derived from an audit of 2 years’ emergency hospital presentations (n = 136) in a New Zealand town (population ~50,000). Study area was defined by residential (housing) land parcels representing a finite set of possible point addresses. RESULTS: Area-based deprivation was spatially correlated. Accounting for deprivation and distance bias showed evidence for clustering of DSH for spatial scales up to 500 m with a one-sided 95 % CI, suggesting that social contagion may be present for this urban cohort. CONCLUSIONS: Many problems involve finite locations in geographic space that require estimates of distance-based clustering at many scales. A Monte-Carlo approach to Ripley’s K, incorporating covariates and models for distance bias, are crucial when assessing health-related clustering. The case study showed that social network structure defined at the neighbourhood level may account for aspects of neighbourhood clustering of DSH. Accounting for covariate measures that exhibit spatial clustering, such as deprivation, are crucial when assessing point-based clustering.
format Online
Article
Text
id pubmed-5009712
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50097122016-09-03 Managing distance and covariate information with point-based clustering Whigham, Peter A. de Graaf, Brandon Srivastava, Rashmi Glue, Paul BMC Med Res Methodol Research Article BACKGROUND: Geographic perspectives of disease and the human condition often involve point-based observations and questions of clustering or dispersion within a spatial context. These problems involve a finite set of point observations and are constrained by a larger, but finite, set of locations where the observations could occur. Developing a rigorous method for pattern analysis in this context requires handling spatial covariates, a method for constrained finite spatial clustering, and addressing bias in geographic distance measures. An approach, based on Ripley’s K and applied to the problem of clustering with deliberate self-harm (DSH), is presented. METHODS: Point-based Monte-Carlo simulation of Ripley’s K, accounting for socio-economic deprivation and sources of distance measurement bias, was developed to estimate clustering of DSH at a range of spatial scales. A rotated Minkowski L(1) distance metric allowed variation in physical distance and clustering to be assessed. Self-harm data was derived from an audit of 2 years’ emergency hospital presentations (n = 136) in a New Zealand town (population ~50,000). Study area was defined by residential (housing) land parcels representing a finite set of possible point addresses. RESULTS: Area-based deprivation was spatially correlated. Accounting for deprivation and distance bias showed evidence for clustering of DSH for spatial scales up to 500 m with a one-sided 95 % CI, suggesting that social contagion may be present for this urban cohort. CONCLUSIONS: Many problems involve finite locations in geographic space that require estimates of distance-based clustering at many scales. A Monte-Carlo approach to Ripley’s K, incorporating covariates and models for distance bias, are crucial when assessing health-related clustering. The case study showed that social network structure defined at the neighbourhood level may account for aspects of neighbourhood clustering of DSH. Accounting for covariate measures that exhibit spatial clustering, such as deprivation, are crucial when assessing point-based clustering. BioMed Central 2016-09-01 /pmc/articles/PMC5009712/ /pubmed/27586862 http://dx.doi.org/10.1186/s12874-016-0218-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Whigham, Peter A.
de Graaf, Brandon
Srivastava, Rashmi
Glue, Paul
Managing distance and covariate information with point-based clustering
title Managing distance and covariate information with point-based clustering
title_full Managing distance and covariate information with point-based clustering
title_fullStr Managing distance and covariate information with point-based clustering
title_full_unstemmed Managing distance and covariate information with point-based clustering
title_short Managing distance and covariate information with point-based clustering
title_sort managing distance and covariate information with point-based clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009712/
https://www.ncbi.nlm.nih.gov/pubmed/27586862
http://dx.doi.org/10.1186/s12874-016-0218-z
work_keys_str_mv AT whighampetera managingdistanceandcovariateinformationwithpointbasedclustering
AT degraafbrandon managingdistanceandcovariateinformationwithpointbasedclustering
AT srivastavarashmi managingdistanceandcovariateinformationwithpointbasedclustering
AT gluepaul managingdistanceandcovariateinformationwithpointbasedclustering