Cargando…

Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition

This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language...

Descripción completa

Detalles Bibliográficos
Autores principales: Weissenbacher, Davy, O’Connor, Karen, Rawal, Siddharth, Zhang, Yu, Tsai, Richard Tzong-Han, Miller, Timothy, Xu, Dongfang, Anderson, Carol, Liu, Bo, Han, Qing, Zhang, Jinfeng, Kulev, Igor, Köprü, Berkay, Rodriguez-Esteban, Raul, Ozkirimli, Elif, Ayach, Ammer, Roller, Roland, Piccolo, Stephen, Han, Peijin, Vydiswaran, V G Vinod, Tekumalla, Ramya, Banda, Juan M, Bagherzadeh, Parsa, Bergler, Sabine, Silva, João F, Almeida, Tiago, Martinez, Paloma, Rivera-Zavala, Renzo, Wang, Chen-Kai, Dai, Hong-Jie, Alberto Robles Hernandez, Luis, Gonzalez-Hernandez, Graciela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896308/
https://www.ncbi.nlm.nih.gov/pubmed/36734300
http://dx.doi.org/10.1093/database/baac108
_version_ 1784882034230951936
author Weissenbacher, Davy
O’Connor, Karen
Rawal, Siddharth
Zhang, Yu
Tsai, Richard Tzong-Han
Miller, Timothy
Xu, Dongfang
Anderson, Carol
Liu, Bo
Han, Qing
Zhang, Jinfeng
Kulev, Igor
Köprü, Berkay
Rodriguez-Esteban, Raul
Ozkirimli, Elif
Ayach, Ammer
Roller, Roland
Piccolo, Stephen
Han, Peijin
Vydiswaran, V G Vinod
Tekumalla, Ramya
Banda, Juan M
Bagherzadeh, Parsa
Bergler, Sabine
Silva, João F
Almeida, Tiago
Martinez, Paloma
Rivera-Zavala, Renzo
Wang, Chen-Kai
Dai, Hong-Jie
Alberto Robles Hernandez, Luis
Gonzalez-Hernandez, Graciela
author_facet Weissenbacher, Davy
O’Connor, Karen
Rawal, Siddharth
Zhang, Yu
Tsai, Richard Tzong-Han
Miller, Timothy
Xu, Dongfang
Anderson, Carol
Liu, Bo
Han, Qing
Zhang, Jinfeng
Kulev, Igor
Köprü, Berkay
Rodriguez-Esteban, Raul
Ozkirimli, Elif
Ayach, Ammer
Roller, Roland
Piccolo, Stephen
Han, Peijin
Vydiswaran, V G Vinod
Tekumalla, Ramya
Banda, Juan M
Bagherzadeh, Parsa
Bergler, Sabine
Silva, João F
Almeida, Tiago
Martinez, Paloma
Rivera-Zavala, Renzo
Wang, Chen-Kai
Dai, Hong-Jie
Alberto Robles Hernandez, Luis
Gonzalez-Hernandez, Graciela
author_sort Weissenbacher, Davy
collection PubMed
description This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language processing tools. The main challenge, aside from the informality of the language used, is that people tweet about any and all topics, and most of their tweets are not related to health. Thus, finding those tweets in a user’s timeline that mention specific health-related concepts such as medications requires addressing extreme imbalance. Task 3 called for detecting tweets in a user’s timeline that mentions a medication name and, for each detected mention, extracting its span. The organizers made available a corpus consisting of 182 049 tweets publicly posted by 212 Twitter users with all medication mentions manually annotated. The corpus exhibits the natural distribution of positive tweets, with only 442 tweets (0.2%) mentioning a medication. This task was an opportunity for participants to evaluate methods that are robust to class imbalance beyond the simple lexical match. A total of 65 teams registered, and 16 teams submitted a system run. This study summarizes the corpus created by the organizers and the approaches taken by the participating teams for this challenge. The corpus is freely available at https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-3/. The methods and the results of the competing systems are analyzed with a focus on the approaches taken for learning from class-imbalanced data.
format Online
Article
Text
id pubmed-9896308
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98963082023-02-06 Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition Weissenbacher, Davy O’Connor, Karen Rawal, Siddharth Zhang, Yu Tsai, Richard Tzong-Han Miller, Timothy Xu, Dongfang Anderson, Carol Liu, Bo Han, Qing Zhang, Jinfeng Kulev, Igor Köprü, Berkay Rodriguez-Esteban, Raul Ozkirimli, Elif Ayach, Ammer Roller, Roland Piccolo, Stephen Han, Peijin Vydiswaran, V G Vinod Tekumalla, Ramya Banda, Juan M Bagherzadeh, Parsa Bergler, Sabine Silva, João F Almeida, Tiago Martinez, Paloma Rivera-Zavala, Renzo Wang, Chen-Kai Dai, Hong-Jie Alberto Robles Hernandez, Luis Gonzalez-Hernandez, Graciela Database (Oxford) Original Article This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language processing tools. The main challenge, aside from the informality of the language used, is that people tweet about any and all topics, and most of their tweets are not related to health. Thus, finding those tweets in a user’s timeline that mention specific health-related concepts such as medications requires addressing extreme imbalance. Task 3 called for detecting tweets in a user’s timeline that mentions a medication name and, for each detected mention, extracting its span. The organizers made available a corpus consisting of 182 049 tweets publicly posted by 212 Twitter users with all medication mentions manually annotated. The corpus exhibits the natural distribution of positive tweets, with only 442 tweets (0.2%) mentioning a medication. This task was an opportunity for participants to evaluate methods that are robust to class imbalance beyond the simple lexical match. A total of 65 teams registered, and 16 teams submitted a system run. This study summarizes the corpus created by the organizers and the approaches taken by the participating teams for this challenge. The corpus is freely available at https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-3/. The methods and the results of the competing systems are analyzed with a focus on the approaches taken for learning from class-imbalanced data. Oxford University Press 2023-02-03 /pmc/articles/PMC9896308/ /pubmed/36734300 http://dx.doi.org/10.1093/database/baac108 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Weissenbacher, Davy
O’Connor, Karen
Rawal, Siddharth
Zhang, Yu
Tsai, Richard Tzong-Han
Miller, Timothy
Xu, Dongfang
Anderson, Carol
Liu, Bo
Han, Qing
Zhang, Jinfeng
Kulev, Igor
Köprü, Berkay
Rodriguez-Esteban, Raul
Ozkirimli, Elif
Ayach, Ammer
Roller, Roland
Piccolo, Stephen
Han, Peijin
Vydiswaran, V G Vinod
Tekumalla, Ramya
Banda, Juan M
Bagherzadeh, Parsa
Bergler, Sabine
Silva, João F
Almeida, Tiago
Martinez, Paloma
Rivera-Zavala, Renzo
Wang, Chen-Kai
Dai, Hong-Jie
Alberto Robles Hernandez, Luis
Gonzalez-Hernandez, Graciela
Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title_full Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title_fullStr Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title_full_unstemmed Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title_short Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
title_sort automatic extraction of medication mentions from tweets—overview of the biocreative vii shared task 3 competition
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896308/
https://www.ncbi.nlm.nih.gov/pubmed/36734300
http://dx.doi.org/10.1093/database/baac108
work_keys_str_mv AT weissenbacherdavy automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT oconnorkaren automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT rawalsiddharth automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT zhangyu automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT tsairichardtzonghan automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT millertimothy automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT xudongfang automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT andersoncarol automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT liubo automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT hanqing automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT zhangjinfeng automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT kulevigor automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT kopruberkay automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT rodriguezestebanraul automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT ozkirimlielif automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT ayachammer automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT rollerroland automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT piccolostephen automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT hanpeijin automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT vydiswaranvgvinod automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT tekumallaramya automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT bandajuanm automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT bagherzadehparsa automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT berglersabine automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT silvajoaof automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT almeidatiago automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT martinezpaloma automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT riverazavalarenzo automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT wangchenkai automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT daihongjie automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT albertorobleshernandezluis automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition
AT gonzalezhernandezgraciela automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition