Cargando…

Methods for Coding Tobacco-Related Twitter Data: A Systematic Review

BACKGROUND: As Twitter has grown in popularity to 313 million monthly active users, researchers have increasingly been using it as a data source for tobacco-related research. OBJECTIVE: The objective of this systematic review was to assess the methodological approaches of categorically coded tobacco...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lienemann, Brianna A, Unger, Jennifer B, Cruz, Tess Boley, Chu, Kar-Hai
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2017
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5392207/ https://www.ncbi.nlm.nih.gov/pubmed/28363883 http://dx.doi.org/10.2196/jmir.7022

_version_	1783229401043828736
author	Lienemann, Brianna A Unger, Jennifer B Cruz, Tess Boley Chu, Kar-Hai
author_facet	Lienemann, Brianna A Unger, Jennifer B Cruz, Tess Boley Chu, Kar-Hai
author_sort	Lienemann, Brianna A
collection	PubMed
description	BACKGROUND: As Twitter has grown in popularity to 313 million monthly active users, researchers have increasingly been using it as a data source for tobacco-related research. OBJECTIVE: The objective of this systematic review was to assess the methodological approaches of categorically coded tobacco Twitter data and make recommendations for future studies. METHODS: Data sources included PsycINFO, Web of Science, PubMed, ABI/INFORM, Communication Source, and Tobacco Regulatory Science. Searches were limited to peer-reviewed journals and conference proceedings in English from January 2006 to July 2016. The initial search identified 274 articles using a Twitter keyword and a tobacco keyword. One coder reviewed all abstracts and identified 27 articles that met the following inclusion criteria: (1) original research, (2) focused on tobacco or a tobacco product, (3) analyzed Twitter data, and (4) coded Twitter data categorically. One coder extracted data collection and coding methods. RESULTS: E-cigarettes were the most common type of Twitter data analyzed, followed by specific tobacco campaigns. The most prevalent data sources were Gnip and Twitter’s Streaming application programming interface (API). The primary methods of coding were hand-coding and machine learning. The studies predominantly coded for relevance, sentiment, theme, user or account, and location of user. CONCLUSIONS: Standards for data collection and coding should be developed to be able to more easily compare and replicate tobacco-related Twitter results. Additional recommendations include the following: sample Twitter’s databases multiple times, make a distinction between message attitude and emotional tone for sentiment, code images and URLs, and analyze user profiles. Being relatively novel and widely used among adolescents and black and Hispanic individuals, Twitter could provide a rich source of tobacco surveillance data among vulnerable populations.
format	Online Article Text
id	pubmed-5392207
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-53922072017-05-04 Methods for Coding Tobacco-Related Twitter Data: A Systematic Review Lienemann, Brianna A Unger, Jennifer B Cruz, Tess Boley Chu, Kar-Hai J Med Internet Res Original Paper BACKGROUND: As Twitter has grown in popularity to 313 million monthly active users, researchers have increasingly been using it as a data source for tobacco-related research. OBJECTIVE: The objective of this systematic review was to assess the methodological approaches of categorically coded tobacco Twitter data and make recommendations for future studies. METHODS: Data sources included PsycINFO, Web of Science, PubMed, ABI/INFORM, Communication Source, and Tobacco Regulatory Science. Searches were limited to peer-reviewed journals and conference proceedings in English from January 2006 to July 2016. The initial search identified 274 articles using a Twitter keyword and a tobacco keyword. One coder reviewed all abstracts and identified 27 articles that met the following inclusion criteria: (1) original research, (2) focused on tobacco or a tobacco product, (3) analyzed Twitter data, and (4) coded Twitter data categorically. One coder extracted data collection and coding methods. RESULTS: E-cigarettes were the most common type of Twitter data analyzed, followed by specific tobacco campaigns. The most prevalent data sources were Gnip and Twitter’s Streaming application programming interface (API). The primary methods of coding were hand-coding and machine learning. The studies predominantly coded for relevance, sentiment, theme, user or account, and location of user. CONCLUSIONS: Standards for data collection and coding should be developed to be able to more easily compare and replicate tobacco-related Twitter results. Additional recommendations include the following: sample Twitter’s databases multiple times, make a distinction between message attitude and emotional tone for sentiment, code images and URLs, and analyze user profiles. Being relatively novel and widely used among adolescents and black and Hispanic individuals, Twitter could provide a rich source of tobacco surveillance data among vulnerable populations. JMIR Publications 2017-03-31 /pmc/articles/PMC5392207/ /pubmed/28363883 http://dx.doi.org/10.2196/jmir.7022 Text en ©Brianna A Lienemann, Jennifer B Unger, Tess Boley Cruz, Kar-Hai Chu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.03.2017. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Lienemann, Brianna A Unger, Jennifer B Cruz, Tess Boley Chu, Kar-Hai Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title	Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title_full	Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title_fullStr	Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title_full_unstemmed	Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title_short	Methods for Coding Tobacco-Related Twitter Data: A Systematic Review
title_sort	methods for coding tobacco-related twitter data: a systematic review
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5392207/ https://www.ncbi.nlm.nih.gov/pubmed/28363883 http://dx.doi.org/10.2196/jmir.7022
work_keys_str_mv	AT lienemannbriannaa methodsforcodingtobaccorelatedtwitterdataasystematicreview AT ungerjenniferb methodsforcodingtobaccorelatedtwitterdataasystematicreview AT cruztessboley methodsforcodingtobaccorelatedtwitterdataasystematicreview AT chukarhai methodsforcodingtobaccorelatedtwitterdataasystematicreview

Methods for Coding Tobacco-Related Twitter Data: A Systematic Review

Ejemplares similares