Cargando…

Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium

The aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. The eighth iteration of the #SMM4H shared...

Descripción completa

Detalles Bibliográficos
Autores principales: Klein, Ari Z., Banda, Juan M., Guo, Yuting, Schmidt, Ana Lucia, Xu, Dongfang, Amaro, Jesus Ivan Flores, Rodriguez-Esteban, Raul, Sarker, Abeed, Gonzalez-Hernandez, Graciela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10659479/
https://www.ncbi.nlm.nih.gov/pubmed/37986776
http://dx.doi.org/10.1101/2023.11.06.23298168
_version_ 1785148326065209344
author Klein, Ari Z.
Banda, Juan M.
Guo, Yuting
Schmidt, Ana Lucia
Xu, Dongfang
Amaro, Jesus Ivan Flores
Rodriguez-Esteban, Raul
Sarker, Abeed
Gonzalez-Hernandez, Graciela
author_facet Klein, Ari Z.
Banda, Juan M.
Guo, Yuting
Schmidt, Ana Lucia
Xu, Dongfang
Amaro, Jesus Ivan Flores
Rodriguez-Esteban, Raul
Sarker, Abeed
Gonzalez-Hernandez, Graciela
author_sort Klein, Ari Z.
collection PubMed
description The aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. The eighth iteration of the #SMM4H shared tasks was hosted at the AMIA 2023 Annual Symposium and consisted of five tasks that represented various social media platforms (Twitter and Reddit), languages (English and Spanish), methods (binary classification, multi-class classification, extraction, and normalization), and topics (COVID-19, therapies, social anxiety disorder, and adverse drug events). In total, 29 teams registered, representing 18 countries. In this paper, we present the annotated corpora, a technical summary of the systems, and the performance results. In general, the top-performing systems used deep neural network architectures based on pre-trained transformer models. In particular, the top-performing systems for the classification tasks were based on single models that were pre-trained on social media corpora. To facilitate future work, the datasets—a total of 61,353 posts—will remain available by request, and the CodaLab sites will remain active for a post-evaluation phase.
format Online
Article
Text
id pubmed-10659479
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-106594792023-11-20 Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium Klein, Ari Z. Banda, Juan M. Guo, Yuting Schmidt, Ana Lucia Xu, Dongfang Amaro, Jesus Ivan Flores Rodriguez-Esteban, Raul Sarker, Abeed Gonzalez-Hernandez, Graciela medRxiv Article The aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. The eighth iteration of the #SMM4H shared tasks was hosted at the AMIA 2023 Annual Symposium and consisted of five tasks that represented various social media platforms (Twitter and Reddit), languages (English and Spanish), methods (binary classification, multi-class classification, extraction, and normalization), and topics (COVID-19, therapies, social anxiety disorder, and adverse drug events). In total, 29 teams registered, representing 18 countries. In this paper, we present the annotated corpora, a technical summary of the systems, and the performance results. In general, the top-performing systems used deep neural network architectures based on pre-trained transformer models. In particular, the top-performing systems for the classification tasks were based on single models that were pre-trained on social media corpora. To facilitate future work, the datasets—a total of 61,353 posts—will remain available by request, and the CodaLab sites will remain active for a post-evaluation phase. Cold Spring Harbor Laboratory 2023-11-08 /pmc/articles/PMC10659479/ /pubmed/37986776 http://dx.doi.org/10.1101/2023.11.06.23298168 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Klein, Ari Z.
Banda, Juan M.
Guo, Yuting
Schmidt, Ana Lucia
Xu, Dongfang
Amaro, Jesus Ivan Flores
Rodriguez-Esteban, Raul
Sarker, Abeed
Gonzalez-Hernandez, Graciela
Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title_full Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title_fullStr Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title_full_unstemmed Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title_short Overview of the 8(th) Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium
title_sort overview of the 8(th) social media mining for health applications (#smm4h) shared tasks at the amia 2023 annual symposium
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10659479/
https://www.ncbi.nlm.nih.gov/pubmed/37986776
http://dx.doi.org/10.1101/2023.11.06.23298168
work_keys_str_mv AT kleinariz overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT bandajuanm overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT guoyuting overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT schmidtanalucia overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT xudongfang overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT amarojesusivanflores overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT rodriguezestebanraul overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT sarkerabeed overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium
AT gonzalezhernandezgraciela overviewofthe8thsocialmediaminingforhealthapplicationssmm4hsharedtasksattheamia2023annualsymposium