Cargando…

Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing

Social media presents a rich opportunity to gather health information with limited intervention through the analysis of completely unstructured and unlabeled microposts. We sought to estimate the health-related quality of life (HRQOL) of Twitter users using automated semantic processing methods. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarma, Karthik V., Spiegel, Brennan M. R., Reid, Mark W., Chen, Shawn, Merchant, Raina M., Seltzer, Emily, Arnold, Corey W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8081585/
https://www.ncbi.nlm.nih.gov/pubmed/31438088
http://dx.doi.org/10.3233/SHTI190388
_version_ 1783685672686583808
author Sarma, Karthik V.
Spiegel, Brennan M. R.
Reid, Mark W.
Chen, Shawn
Merchant, Raina M.
Seltzer, Emily
Arnold, Corey W.
author_facet Sarma, Karthik V.
Spiegel, Brennan M. R.
Reid, Mark W.
Chen, Shawn
Merchant, Raina M.
Seltzer, Emily
Arnold, Corey W.
author_sort Sarma, Karthik V.
collection PubMed
description Social media presents a rich opportunity to gather health information with limited intervention through the analysis of completely unstructured and unlabeled microposts. We sought to estimate the health-related quality of life (HRQOL) of Twitter users using automated semantic processing methods. We collected tweets from 878 Twitter users recruited through online solicitation and in-person contact with patients. All participants completed the four-item Centers for Disease Control Healthy Days Questionnaire at the time of enrollment and 30 days later to measure “ground truth” HRQOL. We used a combination of document frequency analysis, sentiment analysis, topic analysis, and concept mapping to extract features from tweets, which we then used to estimate dichotomized HRQOL (“high” vs. “low”) using logistic regression. Binary HRQOL status was estimated with moderate performance (AUC=0.64). This result indicates that free-range social media data only offers a window into HRQOL, but does not afford direct access to current health status.
format Online
Article
Text
id pubmed-8081585
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-80815852021-04-28 Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing Sarma, Karthik V. Spiegel, Brennan M. R. Reid, Mark W. Chen, Shawn Merchant, Raina M. Seltzer, Emily Arnold, Corey W. Stud Health Technol Inform Article Social media presents a rich opportunity to gather health information with limited intervention through the analysis of completely unstructured and unlabeled microposts. We sought to estimate the health-related quality of life (HRQOL) of Twitter users using automated semantic processing methods. We collected tweets from 878 Twitter users recruited through online solicitation and in-person contact with patients. All participants completed the four-item Centers for Disease Control Healthy Days Questionnaire at the time of enrollment and 30 days later to measure “ground truth” HRQOL. We used a combination of document frequency analysis, sentiment analysis, topic analysis, and concept mapping to extract features from tweets, which we then used to estimate dichotomized HRQOL (“high” vs. “low”) using logistic regression. Binary HRQOL status was estimated with moderate performance (AUC=0.64). This result indicates that free-range social media data only offers a window into HRQOL, but does not afford direct access to current health status. 2019-08-21 /pmc/articles/PMC8081585/ /pubmed/31438088 http://dx.doi.org/10.3233/SHTI190388 Text en https://creativecommons.org/licenses/by/4.0/This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0).
spellingShingle Article
Sarma, Karthik V.
Spiegel, Brennan M. R.
Reid, Mark W.
Chen, Shawn
Merchant, Raina M.
Seltzer, Emily
Arnold, Corey W.
Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title_full Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title_fullStr Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title_full_unstemmed Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title_short Estimating the Health-Related Quality of Life of Twitter Users Using Semantic Processing
title_sort estimating the health-related quality of life of twitter users using semantic processing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8081585/
https://www.ncbi.nlm.nih.gov/pubmed/31438088
http://dx.doi.org/10.3233/SHTI190388
work_keys_str_mv AT sarmakarthikv estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT spiegelbrennanmr estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT reidmarkw estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT chenshawn estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT merchantrainam estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT seltzeremily estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing
AT arnoldcoreyw estimatingthehealthrelatedqualityoflifeoftwitterusersusingsemanticprocessing