Cargando…
Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition
This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896308/ https://www.ncbi.nlm.nih.gov/pubmed/36734300 http://dx.doi.org/10.1093/database/baac108 |
_version_ | 1784882034230951936 |
---|---|
author | Weissenbacher, Davy O’Connor, Karen Rawal, Siddharth Zhang, Yu Tsai, Richard Tzong-Han Miller, Timothy Xu, Dongfang Anderson, Carol Liu, Bo Han, Qing Zhang, Jinfeng Kulev, Igor Köprü, Berkay Rodriguez-Esteban, Raul Ozkirimli, Elif Ayach, Ammer Roller, Roland Piccolo, Stephen Han, Peijin Vydiswaran, V G Vinod Tekumalla, Ramya Banda, Juan M Bagherzadeh, Parsa Bergler, Sabine Silva, João F Almeida, Tiago Martinez, Paloma Rivera-Zavala, Renzo Wang, Chen-Kai Dai, Hong-Jie Alberto Robles Hernandez, Luis Gonzalez-Hernandez, Graciela |
author_facet | Weissenbacher, Davy O’Connor, Karen Rawal, Siddharth Zhang, Yu Tsai, Richard Tzong-Han Miller, Timothy Xu, Dongfang Anderson, Carol Liu, Bo Han, Qing Zhang, Jinfeng Kulev, Igor Köprü, Berkay Rodriguez-Esteban, Raul Ozkirimli, Elif Ayach, Ammer Roller, Roland Piccolo, Stephen Han, Peijin Vydiswaran, V G Vinod Tekumalla, Ramya Banda, Juan M Bagherzadeh, Parsa Bergler, Sabine Silva, João F Almeida, Tiago Martinez, Paloma Rivera-Zavala, Renzo Wang, Chen-Kai Dai, Hong-Jie Alberto Robles Hernandez, Luis Gonzalez-Hernandez, Graciela |
author_sort | Weissenbacher, Davy |
collection | PubMed |
description | This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language processing tools. The main challenge, aside from the informality of the language used, is that people tweet about any and all topics, and most of their tweets are not related to health. Thus, finding those tweets in a user’s timeline that mention specific health-related concepts such as medications requires addressing extreme imbalance. Task 3 called for detecting tweets in a user’s timeline that mentions a medication name and, for each detected mention, extracting its span. The organizers made available a corpus consisting of 182 049 tweets publicly posted by 212 Twitter users with all medication mentions manually annotated. The corpus exhibits the natural distribution of positive tweets, with only 442 tweets (0.2%) mentioning a medication. This task was an opportunity for participants to evaluate methods that are robust to class imbalance beyond the simple lexical match. A total of 65 teams registered, and 16 teams submitted a system run. This study summarizes the corpus created by the organizers and the approaches taken by the participating teams for this challenge. The corpus is freely available at https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-3/. The methods and the results of the competing systems are analyzed with a focus on the approaches taken for learning from class-imbalanced data. |
format | Online Article Text |
id | pubmed-9896308 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98963082023-02-06 Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition Weissenbacher, Davy O’Connor, Karen Rawal, Siddharth Zhang, Yu Tsai, Richard Tzong-Han Miller, Timothy Xu, Dongfang Anderson, Carol Liu, Bo Han, Qing Zhang, Jinfeng Kulev, Igor Köprü, Berkay Rodriguez-Esteban, Raul Ozkirimli, Elif Ayach, Ammer Roller, Roland Piccolo, Stephen Han, Peijin Vydiswaran, V G Vinod Tekumalla, Ramya Banda, Juan M Bagherzadeh, Parsa Bergler, Sabine Silva, João F Almeida, Tiago Martinez, Paloma Rivera-Zavala, Renzo Wang, Chen-Kai Dai, Hong-Jie Alberto Robles Hernandez, Luis Gonzalez-Hernandez, Graciela Database (Oxford) Original Article This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user’s publicly available tweets (the user’s ‘timeline’). In general, detecting health-related tweets is notoriously challenging for natural language processing tools. The main challenge, aside from the informality of the language used, is that people tweet about any and all topics, and most of their tweets are not related to health. Thus, finding those tweets in a user’s timeline that mention specific health-related concepts such as medications requires addressing extreme imbalance. Task 3 called for detecting tweets in a user’s timeline that mentions a medication name and, for each detected mention, extracting its span. The organizers made available a corpus consisting of 182 049 tweets publicly posted by 212 Twitter users with all medication mentions manually annotated. The corpus exhibits the natural distribution of positive tweets, with only 442 tweets (0.2%) mentioning a medication. This task was an opportunity for participants to evaluate methods that are robust to class imbalance beyond the simple lexical match. A total of 65 teams registered, and 16 teams submitted a system run. This study summarizes the corpus created by the organizers and the approaches taken by the participating teams for this challenge. The corpus is freely available at https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-3/. The methods and the results of the competing systems are analyzed with a focus on the approaches taken for learning from class-imbalanced data. Oxford University Press 2023-02-03 /pmc/articles/PMC9896308/ /pubmed/36734300 http://dx.doi.org/10.1093/database/baac108 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Weissenbacher, Davy O’Connor, Karen Rawal, Siddharth Zhang, Yu Tsai, Richard Tzong-Han Miller, Timothy Xu, Dongfang Anderson, Carol Liu, Bo Han, Qing Zhang, Jinfeng Kulev, Igor Köprü, Berkay Rodriguez-Esteban, Raul Ozkirimli, Elif Ayach, Ammer Roller, Roland Piccolo, Stephen Han, Peijin Vydiswaran, V G Vinod Tekumalla, Ramya Banda, Juan M Bagherzadeh, Parsa Bergler, Sabine Silva, João F Almeida, Tiago Martinez, Paloma Rivera-Zavala, Renzo Wang, Chen-Kai Dai, Hong-Jie Alberto Robles Hernandez, Luis Gonzalez-Hernandez, Graciela Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title | Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title_full | Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title_fullStr | Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title_full_unstemmed | Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title_short | Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition |
title_sort | automatic extraction of medication mentions from tweets—overview of the biocreative vii shared task 3 competition |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9896308/ https://www.ncbi.nlm.nih.gov/pubmed/36734300 http://dx.doi.org/10.1093/database/baac108 |
work_keys_str_mv | AT weissenbacherdavy automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT oconnorkaren automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT rawalsiddharth automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT zhangyu automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT tsairichardtzonghan automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT millertimothy automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT xudongfang automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT andersoncarol automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT liubo automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT hanqing automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT zhangjinfeng automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT kulevigor automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT kopruberkay automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT rodriguezestebanraul automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT ozkirimlielif automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT ayachammer automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT rollerroland automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT piccolostephen automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT hanpeijin automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT vydiswaranvgvinod automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT tekumallaramya automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT bandajuanm automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT bagherzadehparsa automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT berglersabine automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT silvajoaof automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT almeidatiago automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT martinezpaloma automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT riverazavalarenzo automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT wangchenkai automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT daihongjie automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT albertorobleshernandezluis automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition AT gonzalezhernandezgraciela automaticextractionofmedicationmentionsfromtweetsoverviewofthebiocreativeviisharedtask3competition |