Cargando…
Scaling laws in geo-located Twitter data
Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship bet...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6655604/ https://www.ncbi.nlm.nih.gov/pubmed/31339901 http://dx.doi.org/10.1371/journal.pone.0218454 |
_version_ | 1783438617853558784 |
---|---|
author | Arthur, Rudy Williams, Hywel T. P. |
author_facet | Arthur, Rudy Williams, Hywel T. P. |
author_sort | Arthur, Rudy |
collection | PubMed |
description | Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship between population density and Twitter use. Number of tweets, number of users and population per unit area are related by power law functions with exponents greater than one. These relations are consistent with each other and hold across a range of spatial scales. This implies that population density can accurately predict Twitter activity, but importantly, it also implies that correct predictions are not given by a naive linear scaling analysis. The observed super-linearity has implications for any spatial analyses performed with Twitter data and is important for understanding the relationship between Twitter use and demographics. For example, the robustness of this relationship means that we can identify ‘anomalous’ geographic areas that deviate from the observed trend, identifying several towns with high/low usage relative to expectation; using the scaling relationship we are able to show that these anomalies are not caused by age structure, as has been previously proposed. Proper consideration of this scaling relationship will improve robustness in future geo-spatial studies using Twitter. |
format | Online Article Text |
id | pubmed-6655604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-66556042019-08-07 Scaling laws in geo-located Twitter data Arthur, Rudy Williams, Hywel T. P. PLoS One Research Article Twitter has become an important platform for geo-spatial analyses, providing high-volume spatial data on a wide variety of social processes. Understanding the relationship between population density and Twitter activity is therefore of key importance. This study reports a systematic relationship between population density and Twitter use. Number of tweets, number of users and population per unit area are related by power law functions with exponents greater than one. These relations are consistent with each other and hold across a range of spatial scales. This implies that population density can accurately predict Twitter activity, but importantly, it also implies that correct predictions are not given by a naive linear scaling analysis. The observed super-linearity has implications for any spatial analyses performed with Twitter data and is important for understanding the relationship between Twitter use and demographics. For example, the robustness of this relationship means that we can identify ‘anomalous’ geographic areas that deviate from the observed trend, identifying several towns with high/low usage relative to expectation; using the scaling relationship we are able to show that these anomalies are not caused by age structure, as has been previously proposed. Proper consideration of this scaling relationship will improve robustness in future geo-spatial studies using Twitter. Public Library of Science 2019-07-24 /pmc/articles/PMC6655604/ /pubmed/31339901 http://dx.doi.org/10.1371/journal.pone.0218454 Text en © 2019 Arthur, Williams http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Arthur, Rudy Williams, Hywel T. P. Scaling laws in geo-located Twitter data |
title | Scaling laws in geo-located Twitter data |
title_full | Scaling laws in geo-located Twitter data |
title_fullStr | Scaling laws in geo-located Twitter data |
title_full_unstemmed | Scaling laws in geo-located Twitter data |
title_short | Scaling laws in geo-located Twitter data |
title_sort | scaling laws in geo-located twitter data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6655604/ https://www.ncbi.nlm.nih.gov/pubmed/31339901 http://dx.doi.org/10.1371/journal.pone.0218454 |
work_keys_str_mv | AT arthurrudy scalinglawsingeolocatedtwitterdata AT williamshyweltp scalinglawsingeolocatedtwitterdata |