Cargando…

Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis

Twitter and related social media feeds have become valuable data sources to many fields of research. Numerous researchers have thereby used social media posts for spatial analysis, since many of them contain explicit geographic locations. However, despite its widespread use within applied research,...

Descripción completa

Detalles Bibliográficos
Autores principales: Westerholt, Rene, Steiger, Enrico, Resch, Bernd, Zipf, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5017681/
https://www.ncbi.nlm.nih.gov/pubmed/27611199
http://dx.doi.org/10.1371/journal.pone.0162360
_version_ 1782452795982479360
author Westerholt, Rene
Steiger, Enrico
Resch, Bernd
Zipf, Alexander
author_facet Westerholt, Rene
Steiger, Enrico
Resch, Bernd
Zipf, Alexander
author_sort Westerholt, Rene
collection PubMed
description Twitter and related social media feeds have become valuable data sources to many fields of research. Numerous researchers have thereby used social media posts for spatial analysis, since many of them contain explicit geographic locations. However, despite its widespread use within applied research, a thorough understanding of the underlying spatial characteristics of these data is still lacking. In this paper, we investigate how topological outliers influence the outcomes of spatial analyses of social media data. These outliers appear when different users contribute heterogeneous information about different phenomena simultaneously from similar locations. As a consequence, various messages representing different spatial phenomena are captured closely to each other, and are at risk to be falsely related in a spatial analysis. Our results reveal indications for corresponding spurious effects when analyzing Twitter data. Further, we show how the outliers distort the range of outcomes of spatial analysis methods. This has significant influence on the power of spatial inferential techniques, and, more generally, on the validity and interpretability of spatial analysis results. We further investigate how the issues caused by topological outliers are composed in detail. We unveil that multiple disturbing effects are acting simultaneously and that these are related to the geographic scales of the involved overlapping patterns. Our results show that at some scale configurations, the disturbances added through overlap are more severe than at others. Further, their behavior turns into a volatile and almost chaotic fluctuation when the scales of the involved patterns become too different. Overall, our results highlight the critical importance of thoroughly considering the specific characteristics of social media data when analyzing them spatially.
format Online
Article
Text
id pubmed-5017681
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-50176812016-09-27 Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis Westerholt, Rene Steiger, Enrico Resch, Bernd Zipf, Alexander PLoS One Research Article Twitter and related social media feeds have become valuable data sources to many fields of research. Numerous researchers have thereby used social media posts for spatial analysis, since many of them contain explicit geographic locations. However, despite its widespread use within applied research, a thorough understanding of the underlying spatial characteristics of these data is still lacking. In this paper, we investigate how topological outliers influence the outcomes of spatial analyses of social media data. These outliers appear when different users contribute heterogeneous information about different phenomena simultaneously from similar locations. As a consequence, various messages representing different spatial phenomena are captured closely to each other, and are at risk to be falsely related in a spatial analysis. Our results reveal indications for corresponding spurious effects when analyzing Twitter data. Further, we show how the outliers distort the range of outcomes of spatial analysis methods. This has significant influence on the power of spatial inferential techniques, and, more generally, on the validity and interpretability of spatial analysis results. We further investigate how the issues caused by topological outliers are composed in detail. We unveil that multiple disturbing effects are acting simultaneously and that these are related to the geographic scales of the involved overlapping patterns. Our results show that at some scale configurations, the disturbances added through overlap are more severe than at others. Further, their behavior turns into a volatile and almost chaotic fluctuation when the scales of the involved patterns become too different. Overall, our results highlight the critical importance of thoroughly considering the specific characteristics of social media data when analyzing them spatially. Public Library of Science 2016-09-09 /pmc/articles/PMC5017681/ /pubmed/27611199 http://dx.doi.org/10.1371/journal.pone.0162360 Text en © 2016 Westerholt et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Westerholt, Rene
Steiger, Enrico
Resch, Bernd
Zipf, Alexander
Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title_full Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title_fullStr Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title_full_unstemmed Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title_short Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis
title_sort abundant topological outliers in social media data and their effect on spatial analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5017681/
https://www.ncbi.nlm.nih.gov/pubmed/27611199
http://dx.doi.org/10.1371/journal.pone.0162360
work_keys_str_mv AT westerholtrene abundanttopologicaloutliersinsocialmediadataandtheireffectonspatialanalysis
AT steigerenrico abundanttopologicaloutliersinsocialmediadataandtheireffectonspatialanalysis
AT reschbernd abundanttopologicaloutliersinsocialmediadataandtheireffectonspatialanalysis
AT zipfalexander abundanttopologicaloutliersinsocialmediadataandtheireffectonspatialanalysis