Cargando…

Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study

BACKGROUND: Health authorities can minimize the impact of an emergent infectious disease outbreak through effective and timely risk communication, which can build trust and adherence to subsequent behavioral messaging. Monitoring the psychological impacts of an outbreak, as well as public adherence...

Descripción completa

Detalles Bibliográficos
Autores principales: Daughton, Ashlynn R, Shelley, Courtney D, Barnard, Martha, Gerts, Dax, Watson Ross, Chrysm, Crooker, Isabel, Nadiga, Gopal, Mukundan, Nilesh, Vaquera Chavez, Nidia Yadira, Parikh, Nidhi, Pitts, Travis, Fairchild, Geoffrey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153035/
https://www.ncbi.nlm.nih.gov/pubmed/33882015
http://dx.doi.org/10.2196/27059
_version_ 1783698714955612160
author Daughton, Ashlynn R
Shelley, Courtney D
Barnard, Martha
Gerts, Dax
Watson Ross, Chrysm
Crooker, Isabel
Nadiga, Gopal
Mukundan, Nilesh
Vaquera Chavez, Nidia Yadira
Parikh, Nidhi
Pitts, Travis
Fairchild, Geoffrey
author_facet Daughton, Ashlynn R
Shelley, Courtney D
Barnard, Martha
Gerts, Dax
Watson Ross, Chrysm
Crooker, Isabel
Nadiga, Gopal
Mukundan, Nilesh
Vaquera Chavez, Nidia Yadira
Parikh, Nidhi
Pitts, Travis
Fairchild, Geoffrey
author_sort Daughton, Ashlynn R
collection PubMed
description BACKGROUND: Health authorities can minimize the impact of an emergent infectious disease outbreak through effective and timely risk communication, which can build trust and adherence to subsequent behavioral messaging. Monitoring the psychological impacts of an outbreak, as well as public adherence to such messaging, is also important for minimizing long-term effects of an outbreak. OBJECTIVE: We used social media data from Twitter to identify human behaviors relevant to COVID-19 transmission, as well as the perceived impacts of COVID-19 on individuals, as a first step toward real-time monitoring of public perceptions to inform public health communications. METHODS: We developed a coding schema for 6 categories and 11 subcategories, which included both a wide number of behaviors as well codes focused on the impacts of the pandemic (eg, economic and mental health impacts). We used this to develop training data and develop supervised learning classifiers for classes with sufficient labels. Classifiers that performed adequately were applied to our remaining corpus, and temporal and geospatial trends were assessed. We compared the classified patterns to ground truth mobility data and actual COVID-19 confirmed cases to assess the signal achieved here. RESULTS: We applied our labeling schema to approximately 7200 tweets. The worst-performing classifiers had F1 scores of only 0.18 to 0.28 when trying to identify tweets about monitoring symptoms and testing. Classifiers about social distancing, however, were much stronger, with F1 scores of 0.64 to 0.66. We applied the social distancing classifiers to over 228 million tweets. We showed temporal patterns consistent with real-world events, and we showed correlations of up to –0.5 between social distancing signals on Twitter and ground truth mobility throughout the United States. CONCLUSIONS: Behaviors discussed on Twitter are exceptionally varied. Twitter can provide useful information for parameterizing models that incorporate human behavior, as well as for informing public health communication strategies by describing awareness of and compliance with suggested behaviors.
format Online
Article
Text
id pubmed-8153035
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81530352021-06-11 Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study Daughton, Ashlynn R Shelley, Courtney D Barnard, Martha Gerts, Dax Watson Ross, Chrysm Crooker, Isabel Nadiga, Gopal Mukundan, Nilesh Vaquera Chavez, Nidia Yadira Parikh, Nidhi Pitts, Travis Fairchild, Geoffrey J Med Internet Res Original Paper BACKGROUND: Health authorities can minimize the impact of an emergent infectious disease outbreak through effective and timely risk communication, which can build trust and adherence to subsequent behavioral messaging. Monitoring the psychological impacts of an outbreak, as well as public adherence to such messaging, is also important for minimizing long-term effects of an outbreak. OBJECTIVE: We used social media data from Twitter to identify human behaviors relevant to COVID-19 transmission, as well as the perceived impacts of COVID-19 on individuals, as a first step toward real-time monitoring of public perceptions to inform public health communications. METHODS: We developed a coding schema for 6 categories and 11 subcategories, which included both a wide number of behaviors as well codes focused on the impacts of the pandemic (eg, economic and mental health impacts). We used this to develop training data and develop supervised learning classifiers for classes with sufficient labels. Classifiers that performed adequately were applied to our remaining corpus, and temporal and geospatial trends were assessed. We compared the classified patterns to ground truth mobility data and actual COVID-19 confirmed cases to assess the signal achieved here. RESULTS: We applied our labeling schema to approximately 7200 tweets. The worst-performing classifiers had F1 scores of only 0.18 to 0.28 when trying to identify tweets about monitoring symptoms and testing. Classifiers about social distancing, however, were much stronger, with F1 scores of 0.64 to 0.66. We applied the social distancing classifiers to over 228 million tweets. We showed temporal patterns consistent with real-world events, and we showed correlations of up to –0.5 between social distancing signals on Twitter and ground truth mobility throughout the United States. CONCLUSIONS: Behaviors discussed on Twitter are exceptionally varied. Twitter can provide useful information for parameterizing models that incorporate human behavior, as well as for informing public health communication strategies by describing awareness of and compliance with suggested behaviors. JMIR Publications 2021-05-25 /pmc/articles/PMC8153035/ /pubmed/33882015 http://dx.doi.org/10.2196/27059 Text en ©Ashlynn R Daughton, Courtney D Shelley, Martha Barnard, Dax Gerts, Chrysm Watson Ross, Isabel Crooker, Gopal Nadiga, Nilesh Mukundan, Nidia Yadira Vaquera Chavez, Nidhi Parikh, Travis Pitts, Geoffrey Fairchild. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 25.05.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Daughton, Ashlynn R
Shelley, Courtney D
Barnard, Martha
Gerts, Dax
Watson Ross, Chrysm
Crooker, Isabel
Nadiga, Gopal
Mukundan, Nilesh
Vaquera Chavez, Nidia Yadira
Parikh, Nidhi
Pitts, Travis
Fairchild, Geoffrey
Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title_full Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title_fullStr Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title_full_unstemmed Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title_short Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study
title_sort mining and validating social media data for covid-19–related human behaviors between january and july 2020: infodemiology study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153035/
https://www.ncbi.nlm.nih.gov/pubmed/33882015
http://dx.doi.org/10.2196/27059
work_keys_str_mv AT daughtonashlynnr miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT shelleycourtneyd miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT barnardmartha miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT gertsdax miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT watsonrosschrysm miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT crookerisabel miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT nadigagopal miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT mukundannilesh miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT vaquerachaveznidiayadira miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT parikhnidhi miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT pittstravis miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy
AT fairchildgeoffrey miningandvalidatingsocialmediadataforcovid19relatedhumanbehaviorsbetweenjanuaryandjuly2020infodemiologystudy