Cargando…

"Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information

BACKGROUND: Twitter provides various types of location data, including exact Global Positioning System (GPS) coordinates, which could be used for infoveillance and infodemiology (ie, the study and monitoring of online health information), health communication, and interventions. Despite its potentia...

Descripción completa

Detalles Bibliográficos
Autores principales: Burton, Scott H, Tanner, Kesler W, Giraud-Carrier, Christophe G, West, Joshua H, Barnes, Michael D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Gunther Eysenbach 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3510712/
https://www.ncbi.nlm.nih.gov/pubmed/23154246
http://dx.doi.org/10.2196/jmir.2121
_version_ 1782251490003386368
author Burton, Scott H
Tanner, Kesler W
Giraud-Carrier, Christophe G
West, Joshua H
Barnes, Michael D
author_facet Burton, Scott H
Tanner, Kesler W
Giraud-Carrier, Christophe G
West, Joshua H
Barnes, Michael D
author_sort Burton, Scott H
collection PubMed
description BACKGROUND: Twitter provides various types of location data, including exact Global Positioning System (GPS) coordinates, which could be used for infoveillance and infodemiology (ie, the study and monitoring of online health information), health communication, and interventions. Despite its potential, Twitter location information is not well understood or well documented, limiting its public health utility. OBJECTIVE: The objective of this study was to document and describe the various types of location information available in Twitter. The different types of location data that can be ascertained from Twitter users are described. This information is key to informing future research on the availability, usability, and limitations of such location data. METHODS: Location data was gathered directly from Twitter using its application programming interface (API). The maximum tweets allowed by Twitter were gathered (1% of the total tweets) over 2 separate weeks in October and November 2011. The final dataset consisted of 23.8 million tweets from 9.5 million unique users. Frequencies for each of the location options were calculated to determine the prevalence of the various location data options by region of the world, time zone, and state within the United States. Data from the US Census Bureau were also compiled to determine population proportions in each state, and Pearson correlation coefficients were used to compare each state’s population with the number of Twitter users who enable the GPS location option. RESULTS: The GPS location data could be ascertained for 2.02% of tweets and 2.70% of unique users. Using a simple text-matching approach, 17.13% of user profiles in the 4 continental US time zones were able to be used to determine the user’s city and state. Agreement between GPS data and data from the text-matching approach was high (87.69%). Furthermore, there was a significant correlation between the number of Twitter users per state and the 2010 US Census state populations (r ≥ 0.97, P < .001). CONCLUSIONS: Health researchers exploring ways to use Twitter data for disease surveillance should be aware that the majority of tweets are not currently associated with an identifiable geographic location. Location can be identified for approximately 4 times the number of tweets using a straightforward text-matching process compared to using the GPS location information available in Twitter. Given the strong correlation between both data gathering methods, future research may consider using more qualitative approaches with higher yields, such as text mining, to acquire information about Twitter users’ geographical location.
format Online
Article
Text
id pubmed-3510712
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Gunther Eysenbach
record_format MEDLINE/PubMed
spelling pubmed-35107122012-12-07 "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information Burton, Scott H Tanner, Kesler W Giraud-Carrier, Christophe G West, Joshua H Barnes, Michael D J Med Internet Res Original Paper BACKGROUND: Twitter provides various types of location data, including exact Global Positioning System (GPS) coordinates, which could be used for infoveillance and infodemiology (ie, the study and monitoring of online health information), health communication, and interventions. Despite its potential, Twitter location information is not well understood or well documented, limiting its public health utility. OBJECTIVE: The objective of this study was to document and describe the various types of location information available in Twitter. The different types of location data that can be ascertained from Twitter users are described. This information is key to informing future research on the availability, usability, and limitations of such location data. METHODS: Location data was gathered directly from Twitter using its application programming interface (API). The maximum tweets allowed by Twitter were gathered (1% of the total tweets) over 2 separate weeks in October and November 2011. The final dataset consisted of 23.8 million tweets from 9.5 million unique users. Frequencies for each of the location options were calculated to determine the prevalence of the various location data options by region of the world, time zone, and state within the United States. Data from the US Census Bureau were also compiled to determine population proportions in each state, and Pearson correlation coefficients were used to compare each state’s population with the number of Twitter users who enable the GPS location option. RESULTS: The GPS location data could be ascertained for 2.02% of tweets and 2.70% of unique users. Using a simple text-matching approach, 17.13% of user profiles in the 4 continental US time zones were able to be used to determine the user’s city and state. Agreement between GPS data and data from the text-matching approach was high (87.69%). Furthermore, there was a significant correlation between the number of Twitter users per state and the 2010 US Census state populations (r ≥ 0.97, P < .001). CONCLUSIONS: Health researchers exploring ways to use Twitter data for disease surveillance should be aware that the majority of tweets are not currently associated with an identifiable geographic location. Location can be identified for approximately 4 times the number of tweets using a straightforward text-matching process compared to using the GPS location information available in Twitter. Given the strong correlation between both data gathering methods, future research may consider using more qualitative approaches with higher yields, such as text mining, to acquire information about Twitter users’ geographical location. Gunther Eysenbach 2012-11-15 /pmc/articles/PMC3510712/ /pubmed/23154246 http://dx.doi.org/10.2196/jmir.2121 Text en ©Scott H. Burton, Kesler W. Tanner, Christophe G. Giraud-Carrier, Joshua H. West, Michael D. Barnes. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.11.2012. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Burton, Scott H
Tanner, Kesler W
Giraud-Carrier, Christophe G
West, Joshua H
Barnes, Michael D
"Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title_full "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title_fullStr "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title_full_unstemmed "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title_short "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information
title_sort "right time, right place" health communication on twitter: value and accuracy of location information
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3510712/
https://www.ncbi.nlm.nih.gov/pubmed/23154246
http://dx.doi.org/10.2196/jmir.2121
work_keys_str_mv AT burtonscotth righttimerightplacehealthcommunicationontwittervalueandaccuracyoflocationinformation
AT tannerkeslerw righttimerightplacehealthcommunicationontwittervalueandaccuracyoflocationinformation
AT giraudcarrierchristopheg righttimerightplacehealthcommunicationontwittervalueandaccuracyoflocationinformation
AT westjoshuah righttimerightplacehealthcommunicationontwittervalueandaccuracyoflocationinformation
AT barnesmichaeld righttimerightplacehealthcommunicationontwittervalueandaccuracyoflocationinformation