Cargando…

Scaling laws in geo-located Twitter data

Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship bet...

Descripción completa

Detalles Bibliográficos
Autores principales: Arthur, Rudy, Williams, Hywel T. P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6655604/
https://www.ncbi.nlm.nih.gov/pubmed/31339901
http://dx.doi.org/10.1371/journal.pone.0218454
_version_ 1783438617853558784
author Arthur, Rudy
Williams, Hywel T. P.
author_facet Arthur, Rudy
Williams, Hywel T. P.
author_sort Arthur, Rudy
collection PubMed
description Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship between population density and Twitter use. Number of tweets, number of users and population per unit area are related by power law functions with exponents greater than one. These relations are consistent with each other and hold across a range of spatial scales. This implies that population density can accurately predict Twitter activity, but importantly, it also implies that correct predictions are not given by a naive linear scaling analysis. The observed super-linearity has implications for any spatial analyses performed with Twitter data and is important for understanding the relationship between Twitter use and demographics. For example, the robustness of this relationship means that we can identify ‘anomalous’ geographic areas that deviate from the observed trend, identifying several towns with high/low usage relative to expectation; using the scaling relationship we are able to show that these anomalies are not caused by age structure, as has been previously proposed. Proper consideration of this scaling relationship will improve robustness in future geo-spatial studies using Twitter.
format Online
Article
Text
id pubmed-6655604
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66556042019-08-07 Scaling laws in geo-located Twitter data Arthur, Rudy Williams, Hywel T. P. PLoS One Research Article Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship between population density and Twitter use. Number of tweets, number of users and population per unit area are related by power law functions with exponents greater than one. These relations are consistent with each other and hold across a range of spatial scales. This implies that population density can accurately predict Twitter activity, but importantly, it also implies that correct predictions are not given by a naive linear scaling analysis. The observed super-linearity has implications for any spatial analyses performed with Twitter data and is important for understanding the relationship between Twitter use and demographics. For example, the robustness of this relationship means that we can identify ‘anomalous’ geographic areas that deviate from the observed trend, identifying several towns with high/low usage relative to expectation; using the scaling relationship we are able to show that these anomalies are not caused by age structure, as has been previously proposed. Proper consideration of this scaling relationship will improve robustness in future geo-spatial studies using Twitter. Public Library of Science 2019-07-24 /pmc/articles/PMC6655604/ /pubmed/31339901 http://dx.doi.org/10.1371/journal.pone.0218454 Text en © 2019 Arthur, Williams http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Arthur, Rudy
Williams, Hywel T. P.
Scaling laws in geo-located Twitter data
title Scaling laws in geo-located Twitter data
title_full Scaling laws in geo-located Twitter data
title_fullStr Scaling laws in geo-located Twitter data
title_full_unstemmed Scaling laws in geo-located Twitter data
title_short Scaling laws in geo-located Twitter data
title_sort scaling laws in geo-located twitter data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6655604/
https://www.ncbi.nlm.nih.gov/pubmed/31339901
http://dx.doi.org/10.1371/journal.pone.0218454
work_keys_str_mv AT arthurrudy scalinglawsingeolocatedtwitterdata
AT williamshyweltp scalinglawsingeolocatedtwitterdata