Cargando…

Can Twitter be used to predict county excessive alcohol consumption rates?

OBJECTIVES: The current study analyzes a large set of Twitter data from 1,384 US counties to determine whether excessive alcohol consumption rates can be predicted by the words being posted from each county. METHODS: Data from over 138 million county-level tweets were analyzed using predictive model...

Descripción completa

Detalles Bibliográficos
Autores principales: Curtis, Brenda, Giorgi, Salvatore, Buffone, Anneke E. K., Ungar, Lyle H., Ashford, Robert D., Hemmons, Jessie, Summers, Dan, Hamilton, Casey, Schwartz, H. Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5884504/
https://www.ncbi.nlm.nih.gov/pubmed/29617408
http://dx.doi.org/10.1371/journal.pone.0194290
_version_ 1783311835551760384
author Curtis, Brenda
Giorgi, Salvatore
Buffone, Anneke E. K.
Ungar, Lyle H.
Ashford, Robert D.
Hemmons, Jessie
Summers, Dan
Hamilton, Casey
Schwartz, H. Andrew
author_facet Curtis, Brenda
Giorgi, Salvatore
Buffone, Anneke E. K.
Ungar, Lyle H.
Ashford, Robert D.
Hemmons, Jessie
Summers, Dan
Hamilton, Casey
Schwartz, H. Andrew
author_sort Curtis, Brenda
collection PubMed
description OBJECTIVES: The current study analyzes a large set of Twitter data from 1,384 US counties to determine whether excessive alcohol consumption rates can be predicted by the words being posted from each county. METHODS: Data from over 138 million county-level tweets were analyzed using predictive modeling, differential language analysis, and mediating language analysis. RESULTS: Twitter language data captures cross-sectional patterns of excessive alcohol consumption beyond that of sociodemographic factors (e.g. age, gender, race, income, education), and can be used to accurately predict rates of excessive alcohol consumption. Additionally, mediation analysis found that Twitter topics (e.g. ‘ready gettin leave’) can explain much of the variance associated between socioeconomics and excessive alcohol consumption. CONCLUSIONS: Twitter data can be used to predict public health concerns such as excessive drinking. Using mediation analysis in conjunction with predictive modeling allows for a high portion of the variance associated with socioeconomic status to be explained.
format Online
Article
Text
id pubmed-5884504
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58845042018-04-13 Can Twitter be used to predict county excessive alcohol consumption rates? Curtis, Brenda Giorgi, Salvatore Buffone, Anneke E. K. Ungar, Lyle H. Ashford, Robert D. Hemmons, Jessie Summers, Dan Hamilton, Casey Schwartz, H. Andrew PLoS One Research Article OBJECTIVES: The current study analyzes a large set of Twitter data from 1,384 US counties to determine whether excessive alcohol consumption rates can be predicted by the words being posted from each county. METHODS: Data from over 138 million county-level tweets were analyzed using predictive modeling, differential language analysis, and mediating language analysis. RESULTS: Twitter language data captures cross-sectional patterns of excessive alcohol consumption beyond that of sociodemographic factors (e.g. age, gender, race, income, education), and can be used to accurately predict rates of excessive alcohol consumption. Additionally, mediation analysis found that Twitter topics (e.g. ‘ready gettin leave’) can explain much of the variance associated between socioeconomics and excessive alcohol consumption. CONCLUSIONS: Twitter data can be used to predict public health concerns such as excessive drinking. Using mediation analysis in conjunction with predictive modeling allows for a high portion of the variance associated with socioeconomic status to be explained. Public Library of Science 2018-04-04 /pmc/articles/PMC5884504/ /pubmed/29617408 http://dx.doi.org/10.1371/journal.pone.0194290 Text en © 2018 Curtis et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Curtis, Brenda
Giorgi, Salvatore
Buffone, Anneke E. K.
Ungar, Lyle H.
Ashford, Robert D.
Hemmons, Jessie
Summers, Dan
Hamilton, Casey
Schwartz, H. Andrew
Can Twitter be used to predict county excessive alcohol consumption rates?
title Can Twitter be used to predict county excessive alcohol consumption rates?
title_full Can Twitter be used to predict county excessive alcohol consumption rates?
title_fullStr Can Twitter be used to predict county excessive alcohol consumption rates?
title_full_unstemmed Can Twitter be used to predict county excessive alcohol consumption rates?
title_short Can Twitter be used to predict county excessive alcohol consumption rates?
title_sort can twitter be used to predict county excessive alcohol consumption rates?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5884504/
https://www.ncbi.nlm.nih.gov/pubmed/29617408
http://dx.doi.org/10.1371/journal.pone.0194290
work_keys_str_mv AT curtisbrenda cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT giorgisalvatore cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT buffoneannekeek cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT ungarlyleh cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT ashfordrobertd cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT hemmonsjessie cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT summersdan cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT hamiltoncasey cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates
AT schwartzhandrew cantwitterbeusedtopredictcountyexcessivealcoholconsumptionrates