Cargando…

Social Media as a Sensor of Air Quality and Public Response in China

BACKGROUND: Recent studies have demonstrated the utility of social media data sources for a wide range of public health goals, including disease surveillance, mental health trends, and health perceptions and sentiment. Most such research has focused on English-language social media for the task of d...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shiliang, Paul, Michael J, Dredze, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4400579/
https://www.ncbi.nlm.nih.gov/pubmed/25831020
http://dx.doi.org/10.2196/jmir.3875
_version_ 1782367050335780864
author Wang, Shiliang
Paul, Michael J
Dredze, Mark
author_facet Wang, Shiliang
Paul, Michael J
Dredze, Mark
author_sort Wang, Shiliang
collection PubMed
description BACKGROUND: Recent studies have demonstrated the utility of social media data sources for a wide range of public health goals, including disease surveillance, mental health trends, and health perceptions and sentiment. Most such research has focused on English-language social media for the task of disease surveillance. OBJECTIVE: We investigated the value of Chinese social media for monitoring air quality trends and related public perceptions and response. The goal was to determine if this data is suitable for learning actionable information about pollution levels and public response. METHODS: We mined a collection of 93 million messages from Sina Weibo, China’s largest microblogging service. We experimented with different filters to identify messages relevant to air quality, based on keyword matching and topic modeling. We evaluated the reliability of the data filters by comparing message volume per city to air particle pollution rates obtained from the Chinese government for 74 cities. Additionally, we performed a qualitative study of the content of pollution-related messages by coding a sample of 170 messages for relevance to air quality, and whether the message included details such as a reactive behavior or a health concern. RESULTS: The volume of pollution-related messages is highly correlated with particle pollution levels, with Pearson correlation values up to .718 (n=74, P<.001). Our qualitative results found that 67.1% (114/170) of messages were relevant to air quality and of those, 78.9% (90/114) were a firsthand report. Of firsthand reports, 28% (32/90) indicated a reactive behavior and 19% (17/90) expressed a health concern. Additionally, 3 messages of 170 requested that action be taken to improve quality. CONCLUSIONS: We have found quantitatively that message volume in Sina Weibo is indicative of true particle pollution levels, and we have found qualitatively that messages contain rich details including perceptions, behaviors, and self-reported health effects. Social media data can augment existing air pollution surveillance data, especially perception and health-related data that traditionally requires expensive surveys or interviews.
format Online
Article
Text
id pubmed-4400579
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher JMIR Publications Inc.
record_format MEDLINE/PubMed
spelling pubmed-44005792015-04-28 Social Media as a Sensor of Air Quality and Public Response in China Wang, Shiliang Paul, Michael J Dredze, Mark J Med Internet Res Original Paper BACKGROUND: Recent studies have demonstrated the utility of social media data sources for a wide range of public health goals, including disease surveillance, mental health trends, and health perceptions and sentiment. Most such research has focused on English-language social media for the task of disease surveillance. OBJECTIVE: We investigated the value of Chinese social media for monitoring air quality trends and related public perceptions and response. The goal was to determine if this data is suitable for learning actionable information about pollution levels and public response. METHODS: We mined a collection of 93 million messages from Sina Weibo, China’s largest microblogging service. We experimented with different filters to identify messages relevant to air quality, based on keyword matching and topic modeling. We evaluated the reliability of the data filters by comparing message volume per city to air particle pollution rates obtained from the Chinese government for 74 cities. Additionally, we performed a qualitative study of the content of pollution-related messages by coding a sample of 170 messages for relevance to air quality, and whether the message included details such as a reactive behavior or a health concern. RESULTS: The volume of pollution-related messages is highly correlated with particle pollution levels, with Pearson correlation values up to .718 (n=74, P<.001). Our qualitative results found that 67.1% (114/170) of messages were relevant to air quality and of those, 78.9% (90/114) were a firsthand report. Of firsthand reports, 28% (32/90) indicated a reactive behavior and 19% (17/90) expressed a health concern. Additionally, 3 messages of 170 requested that action be taken to improve quality. CONCLUSIONS: We have found quantitatively that message volume in Sina Weibo is indicative of true particle pollution levels, and we have found qualitatively that messages contain rich details including perceptions, behaviors, and self-reported health effects. Social media data can augment existing air pollution surveillance data, especially perception and health-related data that traditionally requires expensive surveys or interviews. JMIR Publications Inc. 2015-03-26 /pmc/articles/PMC4400579/ /pubmed/25831020 http://dx.doi.org/10.2196/jmir.3875 Text en ©Shiliang Wang, Michael J Paul, Mark Dredze. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 26.03.2015. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Shiliang
Paul, Michael J
Dredze, Mark
Social Media as a Sensor of Air Quality and Public Response in China
title Social Media as a Sensor of Air Quality and Public Response in China
title_full Social Media as a Sensor of Air Quality and Public Response in China
title_fullStr Social Media as a Sensor of Air Quality and Public Response in China
title_full_unstemmed Social Media as a Sensor of Air Quality and Public Response in China
title_short Social Media as a Sensor of Air Quality and Public Response in China
title_sort social media as a sensor of air quality and public response in china
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4400579/
https://www.ncbi.nlm.nih.gov/pubmed/25831020
http://dx.doi.org/10.2196/jmir.3875
work_keys_str_mv AT wangshiliang socialmediaasasensorofairqualityandpublicresponseinchina
AT paulmichaelj socialmediaasasensorofairqualityandpublicresponseinchina
AT dredzemark socialmediaasasensorofairqualityandpublicresponseinchina