Cargando…
Online and Social Media Data As an Imperfect Continuous Panel Survey
There is a large body of research on utilizing online activity as a survey of political opinion to predict real world election outcomes. There is considerably less work, however, on using this data to understand topic-specific interest and opinion amongst the general population and specific demograp...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4711590/ https://www.ncbi.nlm.nih.gov/pubmed/26730933 http://dx.doi.org/10.1371/journal.pone.0145406 |
_version_ | 1782409954182823936 |
---|---|
author | Diaz, Fernando Gamon, Michael Hofman, Jake M. Kıcıman, Emre Rothschild, David |
author_facet | Diaz, Fernando Gamon, Michael Hofman, Jake M. Kıcıman, Emre Rothschild, David |
author_sort | Diaz, Fernando |
collection | PubMed |
description | There is a large body of research on utilizing online activity as a survey of political opinion to predict real world election outcomes. There is considerably less work, however, on using this data to understand topic-specific interest and opinion amongst the general population and specific demographic subgroups, as currently measured by relatively expensive surveys. Here we investigate this possibility by studying a full census of all Twitter activity during the 2012 election cycle along with the comprehensive search history of a large panel of Internet users during the same period, highlighting the challenges in interpreting online and social media activity as the results of a survey. As noted in existing work, the online population is a non-representative sample of the offline world (e.g., the U.S. voting population). We extend this work to show how demographic skew and user participation is non-stationary and difficult to predict over time. In addition, the nature of user contributions varies substantially around important events. Furthermore, we note subtle problems in mapping what people are sharing or consuming online to specific sentiment or opinion measures around a particular topic. We provide a framework, built around considering this data as an imperfect continuous panel survey, for addressing these issues so that meaningful insight about public interest and opinion can be reliably extracted from online and social media data. |
format | Online Article Text |
id | pubmed-4711590 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-47115902016-01-26 Online and Social Media Data As an Imperfect Continuous Panel Survey Diaz, Fernando Gamon, Michael Hofman, Jake M. Kıcıman, Emre Rothschild, David PLoS One Research Article There is a large body of research on utilizing online activity as a survey of political opinion to predict real world election outcomes. There is considerably less work, however, on using this data to understand topic-specific interest and opinion amongst the general population and specific demographic subgroups, as currently measured by relatively expensive surveys. Here we investigate this possibility by studying a full census of all Twitter activity during the 2012 election cycle along with the comprehensive search history of a large panel of Internet users during the same period, highlighting the challenges in interpreting online and social media activity as the results of a survey. As noted in existing work, the online population is a non-representative sample of the offline world (e.g., the U.S. voting population). We extend this work to show how demographic skew and user participation is non-stationary and difficult to predict over time. In addition, the nature of user contributions varies substantially around important events. Furthermore, we note subtle problems in mapping what people are sharing or consuming online to specific sentiment or opinion measures around a particular topic. We provide a framework, built around considering this data as an imperfect continuous panel survey, for addressing these issues so that meaningful insight about public interest and opinion can be reliably extracted from online and social media data. Public Library of Science 2016-01-05 /pmc/articles/PMC4711590/ /pubmed/26730933 http://dx.doi.org/10.1371/journal.pone.0145406 Text en © 2016 Diaz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited |
spellingShingle | Research Article Diaz, Fernando Gamon, Michael Hofman, Jake M. Kıcıman, Emre Rothschild, David Online and Social Media Data As an Imperfect Continuous Panel Survey |
title | Online and Social Media Data As an Imperfect Continuous Panel Survey |
title_full | Online and Social Media Data As an Imperfect Continuous Panel Survey |
title_fullStr | Online and Social Media Data As an Imperfect Continuous Panel Survey |
title_full_unstemmed | Online and Social Media Data As an Imperfect Continuous Panel Survey |
title_short | Online and Social Media Data As an Imperfect Continuous Panel Survey |
title_sort | online and social media data as an imperfect continuous panel survey |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4711590/ https://www.ncbi.nlm.nih.gov/pubmed/26730933 http://dx.doi.org/10.1371/journal.pone.0145406 |
work_keys_str_mv | AT diazfernando onlineandsocialmediadataasanimperfectcontinuouspanelsurvey AT gamonmichael onlineandsocialmediadataasanimperfectcontinuouspanelsurvey AT hofmanjakem onlineandsocialmediadataasanimperfectcontinuouspanelsurvey AT kıcımanemre onlineandsocialmediadataasanimperfectcontinuouspanelsurvey AT rothschilddavid onlineandsocialmediadataasanimperfectcontinuouspanelsurvey |