Cargando…

Quantifying the scale effect in geospatial big data using semi-variograms

The scale effect is an important research topic in the field of geography. When aggregating individual-level data into areal units, encountering the scale problem is inevitable. This problem is more substantial when mining collective patterns from big geo-data due to the characteristics of extensive...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lei, Gao, Yong, Zhu, Di, Yuan, Yihong, Liu, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6855636/
https://www.ncbi.nlm.nih.gov/pubmed/31725781
http://dx.doi.org/10.1371/journal.pone.0225139
_version_ 1783470442504257536
author Chen, Lei
Gao, Yong
Zhu, Di
Yuan, Yihong
Liu, Yu
author_facet Chen, Lei
Gao, Yong
Zhu, Di
Yuan, Yihong
Liu, Yu
author_sort Chen, Lei
collection PubMed
description The scale effect is an important research topic in the field of geography. When aggregating individual-level data into areal units, encountering the scale problem is inevitable. This problem is more substantial when mining collective patterns from big geo-data due to the characteristics of extensive spatial data. Although multi-scale models were constructed to mitigate this issue, most studies still arbitrarily choose a single scale to extract spatial patterns. In this research, we introduce the nugget-sill ratio (NSR) derived from semi-variograms as an indicator to extract the optimal scale. We conducted two simulated experiments to demonstrate the feasibility of this method. Our results showed that the optimal scale is negatively correlated with spatial point density, but positively correlated with the degree of dispersion in a point pattern. We also applied the proposed method to a case study using Weibo check-in data from Beijing, Shanghai, Chengdu, and Wuhan. Our study provides a new perspective to measure the spatial heterogeneity of big geo-data and selects an optimal spatial scale for big data analytics.
format Online
Article
Text
id pubmed-6855636
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68556362019-12-07 Quantifying the scale effect in geospatial big data using semi-variograms Chen, Lei Gao, Yong Zhu, Di Yuan, Yihong Liu, Yu PLoS One Research Article The scale effect is an important research topic in the field of geography. When aggregating individual-level data into areal units, encountering the scale problem is inevitable. This problem is more substantial when mining collective patterns from big geo-data due to the characteristics of extensive spatial data. Although multi-scale models were constructed to mitigate this issue, most studies still arbitrarily choose a single scale to extract spatial patterns. In this research, we introduce the nugget-sill ratio (NSR) derived from semi-variograms as an indicator to extract the optimal scale. We conducted two simulated experiments to demonstrate the feasibility of this method. Our results showed that the optimal scale is negatively correlated with spatial point density, but positively correlated with the degree of dispersion in a point pattern. We also applied the proposed method to a case study using Weibo check-in data from Beijing, Shanghai, Chengdu, and Wuhan. Our study provides a new perspective to measure the spatial heterogeneity of big geo-data and selects an optimal spatial scale for big data analytics. Public Library of Science 2019-11-14 /pmc/articles/PMC6855636/ /pubmed/31725781 http://dx.doi.org/10.1371/journal.pone.0225139 Text en © 2019 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Lei
Gao, Yong
Zhu, Di
Yuan, Yihong
Liu, Yu
Quantifying the scale effect in geospatial big data using semi-variograms
title Quantifying the scale effect in geospatial big data using semi-variograms
title_full Quantifying the scale effect in geospatial big data using semi-variograms
title_fullStr Quantifying the scale effect in geospatial big data using semi-variograms
title_full_unstemmed Quantifying the scale effect in geospatial big data using semi-variograms
title_short Quantifying the scale effect in geospatial big data using semi-variograms
title_sort quantifying the scale effect in geospatial big data using semi-variograms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6855636/
https://www.ncbi.nlm.nih.gov/pubmed/31725781
http://dx.doi.org/10.1371/journal.pone.0225139
work_keys_str_mv AT chenlei quantifyingthescaleeffectingeospatialbigdatausingsemivariograms
AT gaoyong quantifyingthescaleeffectingeospatialbigdatausingsemivariograms
AT zhudi quantifyingthescaleeffectingeospatialbigdatausingsemivariograms
AT yuanyihong quantifyingthescaleeffectingeospatialbigdatausingsemivariograms
AT liuyu quantifyingthescaleeffectingeospatialbigdatausingsemivariograms