Cargando…

Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media

The rapid evolution of the COVID-19 pandemic has underscored the need to quickly disseminate the latest clinical knowledge during a public-health emergency. One surprisingly effective platform for healthcare professionals (HCPs) to share knowledge and experiences from the front lines has been social...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Julia, Sivaraman, Venkatesh, Kumar, Dheekshita, Banda, Juan M., Sontag, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885911/
https://www.ncbi.nlm.nih.gov/pubmed/33594339
_version_ 1783651690657873920
author Wu, Julia
Sivaraman, Venkatesh
Kumar, Dheekshita
Banda, Juan M.
Sontag, David
author_facet Wu, Julia
Sivaraman, Venkatesh
Kumar, Dheekshita
Banda, Juan M.
Sontag, David
author_sort Wu, Julia
collection PubMed
description The rapid evolution of the COVID-19 pandemic has underscored the need to quickly disseminate the latest clinical knowledge during a public-health emergency. One surprisingly effective platform for healthcare professionals (HCPs) to share knowledge and experiences from the front lines has been social media (for example, the “#medtwitter” community on Twitter). However, identifying clinically-relevant content in social media without manual labeling is a challenge because of the sheer volume of irrelevant data. We present an unsupervised, iterative approach to mine clinically relevant information from social media data, which begins by heuristically filtering for HCP-authored texts and incorporates topic modeling and concept extraction with MetaMap. This approach identifies granular topics and tweets with high clinical relevance from a set of about 52 million COVID-19-related tweets from January to mid-June 2020. We also show that because the technique does not require manual labeling, it can be used to identify emerging topics on a week-to-week basis. Our method can aid in future public-health emergencies by facilitating knowledge transfer among healthcare workers in a rapidly-changing information environment, and by providing an efficient and unsupervised way of highlighting potential areas for clinical research.
format Online
Article
Text
id pubmed-7885911
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-78859112021-02-17 Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media Wu, Julia Sivaraman, Venkatesh Kumar, Dheekshita Banda, Juan M. Sontag, David ArXiv Article The rapid evolution of the COVID-19 pandemic has underscored the need to quickly disseminate the latest clinical knowledge during a public-health emergency. One surprisingly effective platform for healthcare professionals (HCPs) to share knowledge and experiences from the front lines has been social media (for example, the “#medtwitter” community on Twitter). However, identifying clinically-relevant content in social media without manual labeling is a challenge because of the sheer volume of irrelevant data. We present an unsupervised, iterative approach to mine clinically relevant information from social media data, which begins by heuristically filtering for HCP-authored texts and incorporates topic modeling and concept extraction with MetaMap. This approach identifies granular topics and tweets with high clinical relevance from a set of about 52 million COVID-19-related tweets from January to mid-June 2020. We also show that because the technique does not require manual labeling, it can be used to identify emerging topics on a week-to-week basis. Our method can aid in future public-health emergencies by facilitating knowledge transfer among healthcare workers in a rapidly-changing information environment, and by providing an efficient and unsupervised way of highlighting potential areas for clinical research. Cornell University 2021-02-13 /pmc/articles/PMC7885911/ /pubmed/33594339 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Wu, Julia
Sivaraman, Venkatesh
Kumar, Dheekshita
Banda, Juan M.
Sontag, David
Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title_full Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title_fullStr Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title_full_unstemmed Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title_short Pulse of the Pandemic: Iterative Topic Filtering for Clinical Information Extraction from Social Media
title_sort pulse of the pandemic: iterative topic filtering for clinical information extraction from social media
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7885911/
https://www.ncbi.nlm.nih.gov/pubmed/33594339
work_keys_str_mv AT wujulia pulseofthepandemiciterativetopicfilteringforclinicalinformationextractionfromsocialmedia
AT sivaramanvenkatesh pulseofthepandemiciterativetopicfilteringforclinicalinformationextractionfromsocialmedia
AT kumardheekshita pulseofthepandemiciterativetopicfilteringforclinicalinformationextractionfromsocialmedia
AT bandajuanm pulseofthepandemiciterativetopicfilteringforclinicalinformationextractionfromsocialmedia
AT sontagdavid pulseofthepandemiciterativetopicfilteringforclinicalinformationextractionfromsocialmedia