Cargando…

University of Turku in the BioNLP'11 Shared Task

BACKGROUND: We present a system for extracting biomedical events (detailed descriptions of biomolecular interactions) from research articles, developed for the BioNLP'11 Shared Task. Our goal is to develop a system easily adaptable to different event schemes, following the theme of the BioNLP&#...

Descripción completa

Detalles Bibliográficos
Autores principales: Björne, Jari, Ginter, Filip, Salakoski, Tapio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3384251/
https://www.ncbi.nlm.nih.gov/pubmed/22759458
http://dx.doi.org/10.1186/1471-2105-13-S11-S4
_version_ 1782236683355291648
author Björne, Jari
Ginter, Filip
Salakoski, Tapio
author_facet Björne, Jari
Ginter, Filip
Salakoski, Tapio
author_sort Björne, Jari
collection PubMed
description BACKGROUND: We present a system for extracting biomedical events (detailed descriptions of biomolecular interactions) from research articles, developed for the BioNLP'11 Shared Task. Our goal is to develop a system easily adaptable to different event schemes, following the theme of the BioNLP'11 Shared Task: generalization, the extension of event extraction to varied biomedical domains. Our system extends our BioNLP'09 Shared Task winning Turku Event Extraction System, which uses support vector machines to first detect event-defining words, followed by detection of their relationships. RESULTS: Our current system successfully predicts events for every domain case introduced in the BioNLP'11 Shared Task, being the only system to participate in all eight tasks and all of their subtasks, with best performance in four tasks. Following the Shared Task, we improve the system on the Infectious Diseases task from 42.57% to 53.87% F-score, bringing performance into line with the similar GENIA Event Extraction and Epigenetics and Post-translational Modifications tasks. We evaluate the machine learning performance of the system by calculating learning curves for all tasks, detecting areas where additional annotated data could be used to improve performance. Finally, we evaluate the use of system output on external articles as additional training data in a form of self-training. CONCLUSIONS: We show that the updated Turku Event Extraction System can easily be adapted to all presently available event extraction targets, with competitive performance in most tasks. The scope of the performance gains between the 2009 and 2011 BioNLP Shared Tasks indicates event extraction is still a new field requiring more work. We provide several analyses of event extraction methods and performance, highlighting potential future directions for continued development.
format Online
Article
Text
id pubmed-3384251
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33842512012-06-28 University of Turku in the BioNLP'11 Shared Task Björne, Jari Ginter, Filip Salakoski, Tapio BMC Bioinformatics Proceedings BACKGROUND: We present a system for extracting biomedical events (detailed descriptions of biomolecular interactions) from research articles, developed for the BioNLP'11 Shared Task. Our goal is to develop a system easily adaptable to different event schemes, following the theme of the BioNLP'11 Shared Task: generalization, the extension of event extraction to varied biomedical domains. Our system extends our BioNLP'09 Shared Task winning Turku Event Extraction System, which uses support vector machines to first detect event-defining words, followed by detection of their relationships. RESULTS: Our current system successfully predicts events for every domain case introduced in the BioNLP'11 Shared Task, being the only system to participate in all eight tasks and all of their subtasks, with best performance in four tasks. Following the Shared Task, we improve the system on the Infectious Diseases task from 42.57% to 53.87% F-score, bringing performance into line with the similar GENIA Event Extraction and Epigenetics and Post-translational Modifications tasks. We evaluate the machine learning performance of the system by calculating learning curves for all tasks, detecting areas where additional annotated data could be used to improve performance. Finally, we evaluate the use of system output on external articles as additional training data in a form of self-training. CONCLUSIONS: We show that the updated Turku Event Extraction System can easily be adapted to all presently available event extraction targets, with competitive performance in most tasks. The scope of the performance gains between the 2009 and 2011 BioNLP Shared Tasks indicates event extraction is still a new field requiring more work. We provide several analyses of event extraction methods and performance, highlighting potential future directions for continued development. BioMed Central 2012-06-26 /pmc/articles/PMC3384251/ /pubmed/22759458 http://dx.doi.org/10.1186/1471-2105-13-S11-S4 Text en Copyright ©2012 Björne et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Björne, Jari
Ginter, Filip
Salakoski, Tapio
University of Turku in the BioNLP'11 Shared Task
title University of Turku in the BioNLP'11 Shared Task
title_full University of Turku in the BioNLP'11 Shared Task
title_fullStr University of Turku in the BioNLP'11 Shared Task
title_full_unstemmed University of Turku in the BioNLP'11 Shared Task
title_short University of Turku in the BioNLP'11 Shared Task
title_sort university of turku in the bionlp'11 shared task
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3384251/
https://www.ncbi.nlm.nih.gov/pubmed/22759458
http://dx.doi.org/10.1186/1471-2105-13-S11-S4
work_keys_str_mv AT bjornejari universityofturkuinthebionlp11sharedtask
AT ginterfilip universityofturkuinthebionlp11sharedtask
AT salakoskitapio universityofturkuinthebionlp11sharedtask