Cargando…
Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis
BACKGROUND: Modern methods for mining biomolecular interactions from literature typically make predictions based solely on the immediate textual context, in effect a single sentence. No prior work has been published on extending this context to the information automatically gathered from the whole b...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642107/ https://www.ncbi.nlm.nih.gov/pubmed/26551766 http://dx.doi.org/10.1186/1471-2105-16-S16-S3 |
_version_ | 1782400307379044352 |
---|---|
author | Hakala, Kai Van Landeghem, Sofie Salakoski, Tapio Van de Peer, Yves Ginter, Filip |
author_facet | Hakala, Kai Van Landeghem, Sofie Salakoski, Tapio Van de Peer, Yves Ginter, Filip |
author_sort | Hakala, Kai |
collection | PubMed |
description | BACKGROUND: Modern methods for mining biomolecular interactions from literature typically make predictions based solely on the immediate textual context, in effect a single sentence. No prior work has been published on extending this context to the information automatically gathered from the whole biomedical literature. Thus, our motivation for this study is to explore whether mutually supporting evidence, aggregated across several documents can be utilized to improve the performance of the state-of-the-art event extraction systems. In this paper, we describe our participation in the latest BioNLP Shared Task using the large-scale text mining resource EVEX. We participated in the Genia Event Extraction (GE) and Gene Regulation Network (GRN) tasks with two separate systems. In the GE task, we implemented a re-ranking approach to improve the precision of an existing event extraction system, incorporating features from the EVEX resource. In the GRN task, our system relied solely on the EVEX resource and utilized a rule-based conversion algorithm between the EVEX and GRN formats. RESULTS: In the GE task, our re-ranking approach led to a modest performance increase and resulted in the first rank of the official Shared Task results with 50.97% F-score. Additionally, in this paper we explore and evaluate the usage of distributed vector representations for this challenge. In the GRN task, we ranked fifth in the official results with a strict/relaxed SER score of 0.92/0.81 respectively. To try and improve upon these results, we have implemented a novel machine learning based conversion system and benchmarked its performance against the original rule-based system. CONCLUSIONS: For the GRN task, we were able to produce a gene regulatory network from the EVEX data, warranting the use of such generic large-scale text mining data in network biology settings. A detailed performance and error analysis provides more insight into the relatively low recall rates. In the GE task we demonstrate that both the re-ranking approach and the word vectors can provide slight performance improvement. A manual evaluation of the re-ranking results pinpoints some of the challenges faced in applying large-scale text mining knowledge to event extraction. |
format | Online Article Text |
id | pubmed-4642107 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46421072015-11-19 Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis Hakala, Kai Van Landeghem, Sofie Salakoski, Tapio Van de Peer, Yves Ginter, Filip BMC Bioinformatics Research BACKGROUND: Modern methods for mining biomolecular interactions from literature typically make predictions based solely on the immediate textual context, in effect a single sentence. No prior work has been published on extending this context to the information automatically gathered from the whole biomedical literature. Thus, our motivation for this study is to explore whether mutually supporting evidence, aggregated across several documents can be utilized to improve the performance of the state-of-the-art event extraction systems. In this paper, we describe our participation in the latest BioNLP Shared Task using the large-scale text mining resource EVEX. We participated in the Genia Event Extraction (GE) and Gene Regulation Network (GRN) tasks with two separate systems. In the GE task, we implemented a re-ranking approach to improve the precision of an existing event extraction system, incorporating features from the EVEX resource. In the GRN task, our system relied solely on the EVEX resource and utilized a rule-based conversion algorithm between the EVEX and GRN formats. RESULTS: In the GE task, our re-ranking approach led to a modest performance increase and resulted in the first rank of the official Shared Task results with 50.97% F-score. Additionally, in this paper we explore and evaluate the usage of distributed vector representations for this challenge. In the GRN task, we ranked fifth in the official results with a strict/relaxed SER score of 0.92/0.81 respectively. To try and improve upon these results, we have implemented a novel machine learning based conversion system and benchmarked its performance against the original rule-based system. CONCLUSIONS: For the GRN task, we were able to produce a gene regulatory network from the EVEX data, warranting the use of such generic large-scale text mining data in network biology settings. A detailed performance and error analysis provides more insight into the relatively low recall rates. In the GE task we demonstrate that both the re-ranking approach and the word vectors can provide slight performance improvement. A manual evaluation of the re-ranking results pinpoints some of the challenges faced in applying large-scale text mining knowledge to event extraction. BioMed Central 2015-10-30 /pmc/articles/PMC4642107/ /pubmed/26551766 http://dx.doi.org/10.1186/1471-2105-16-S16-S3 Text en Copyright © 2015 Hakala et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hakala, Kai Van Landeghem, Sofie Salakoski, Tapio Van de Peer, Yves Ginter, Filip Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title | Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title_full | Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title_fullStr | Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title_full_unstemmed | Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title_short | Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis |
title_sort | application of the evex resource to event extraction and network construction: shared task entry and result analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4642107/ https://www.ncbi.nlm.nih.gov/pubmed/26551766 http://dx.doi.org/10.1186/1471-2105-16-S16-S3 |
work_keys_str_mv | AT hakalakai applicationoftheevexresourcetoeventextractionandnetworkconstructionsharedtaskentryandresultanalysis AT vanlandeghemsofie applicationoftheevexresourcetoeventextractionandnetworkconstructionsharedtaskentryandresultanalysis AT salakoskitapio applicationoftheevexresourcetoeventextractionandnetworkconstructionsharedtaskentryandresultanalysis AT vandepeeryves applicationoftheevexresourcetoeventextractionandnetworkconstructionsharedtaskentryandresultanalysis AT ginterfilip applicationoftheevexresourcetoeventextractionandnetworkconstructionsharedtaskentryandresultanalysis |