Cargando…

Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task

BACKGROUND: We present the two Bacteria Track tasks of BioNLP 2013 Shared Task (ST): Gene Regulation Network (GRN) and Bacteria Biotope (BB). These tasks were previously introduced in the 2011 BioNLP-ST Bacteria Track as Bacteria Gene Interaction (BI) and Bacteria Biotope (BB). The Bacteria Track wa...

Descripción completa

Detalles Bibliográficos
Autores principales: Bossy, Robert, Golik, Wiktoria, Ratkovic, Zorana, Valsamou, Dialekti, Bessières, Philippe, Nédellec, Claire
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4511173/
https://www.ncbi.nlm.nih.gov/pubmed/26202448
http://dx.doi.org/10.1186/1471-2105-16-S10-S1
_version_ 1782382288526376960
author Bossy, Robert
Golik, Wiktoria
Ratkovic, Zorana
Valsamou, Dialekti
Bessières, Philippe
Nédellec, Claire
author_facet Bossy, Robert
Golik, Wiktoria
Ratkovic, Zorana
Valsamou, Dialekti
Bessières, Philippe
Nédellec, Claire
author_sort Bossy, Robert
collection PubMed
description BACKGROUND: We present the two Bacteria Track tasks of BioNLP 2013 Shared Task (ST): Gene Regulation Network (GRN) and Bacteria Biotope (BB). These tasks were previously introduced in the 2011 BioNLP-ST Bacteria Track as Bacteria Gene Interaction (BI) and Bacteria Biotope (BB). The Bacteria Track was motivated by a need to develop specific BioNLP tools for fine-grained event extraction in bacteria biology. The 2013 tasks expand on the 2011 version by better addressing the biological knowledge modeling needs. New evaluation metrics were designed for the new goals. Moving beyond a list of gene interactions, the goal of the GRN task is to build a gene regulation network from the extracted gene interactions. BB'13 is dedicated to the extraction of bacteria biotopes, i.e. bacterial environmental information, as was BB'11. BB'13 extends the typology of BB'11 to a large diversity of biotopes, as defined by the OntoBiotope ontology. The detection of entities and events is tackled by distinct subtasks in order to measure the progress achieved by the participant systems since 2011. RESULTS: This paper details the corpus preparations and the evaluation metrics, as well as summarizing and discussing the participant results. Five groups participated in each of the two tasks. The high diversity of the participant methods reflects the dynamism of the BioNLP research community. The highest scores for the GRN and BB'13 tasks are similar to those obtained by the participants in 2011, despite of the increase in difficulty. The high density of events in short text segments (multi-event extraction) was a difficult issue for the participating systems for both tasks. The analysis of the BB'13 results also shows that co-reference resolution and entity boundary detection remain major hindrances. CONCLUSION: The evaluation results suggest new research directions for the improvement and development of Information Extraction for molecular and environmental biology. The Bacteria Track tasks remain publicly open; the BioNLP-ST website provides an online evaluation service, the reference corpora and the evaluation tools.
format Online
Article
Text
id pubmed-4511173
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45111732015-07-28 Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task Bossy, Robert Golik, Wiktoria Ratkovic, Zorana Valsamou, Dialekti Bessières, Philippe Nédellec, Claire BMC Bioinformatics Research BACKGROUND: We present the two Bacteria Track tasks of BioNLP 2013 Shared Task (ST): Gene Regulation Network (GRN) and Bacteria Biotope (BB). These tasks were previously introduced in the 2011 BioNLP-ST Bacteria Track as Bacteria Gene Interaction (BI) and Bacteria Biotope (BB). The Bacteria Track was motivated by a need to develop specific BioNLP tools for fine-grained event extraction in bacteria biology. The 2013 tasks expand on the 2011 version by better addressing the biological knowledge modeling needs. New evaluation metrics were designed for the new goals. Moving beyond a list of gene interactions, the goal of the GRN task is to build a gene regulation network from the extracted gene interactions. BB'13 is dedicated to the extraction of bacteria biotopes, i.e. bacterial environmental information, as was BB'11. BB'13 extends the typology of BB'11 to a large diversity of biotopes, as defined by the OntoBiotope ontology. The detection of entities and events is tackled by distinct subtasks in order to measure the progress achieved by the participant systems since 2011. RESULTS: This paper details the corpus preparations and the evaluation metrics, as well as summarizing and discussing the participant results. Five groups participated in each of the two tasks. The high diversity of the participant methods reflects the dynamism of the BioNLP research community. The highest scores for the GRN and BB'13 tasks are similar to those obtained by the participants in 2011, despite of the increase in difficulty. The high density of events in short text segments (multi-event extraction) was a difficult issue for the participating systems for both tasks. The analysis of the BB'13 results also shows that co-reference resolution and entity boundary detection remain major hindrances. CONCLUSION: The evaluation results suggest new research directions for the improvement and development of Information Extraction for molecular and environmental biology. The Bacteria Track tasks remain publicly open; the BioNLP-ST website provides an online evaluation service, the reference corpora and the evaluation tools. BioMed Central 2015-07-13 /pmc/articles/PMC4511173/ /pubmed/26202448 http://dx.doi.org/10.1186/1471-2105-16-S10-S1 Text en Copyright © 2015 Bossy et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bossy, Robert
Golik, Wiktoria
Ratkovic, Zorana
Valsamou, Dialekti
Bessières, Philippe
Nédellec, Claire
Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title_full Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title_fullStr Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title_full_unstemmed Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title_short Overview of the gene regulation network and the bacteria biotope tasks in BioNLP'13 shared task
title_sort overview of the gene regulation network and the bacteria biotope tasks in bionlp'13 shared task
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4511173/
https://www.ncbi.nlm.nih.gov/pubmed/26202448
http://dx.doi.org/10.1186/1471-2105-16-S10-S1
work_keys_str_mv AT bossyrobert overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask
AT golikwiktoria overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask
AT ratkoviczorana overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask
AT valsamoudialekti overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask
AT bessieresphilippe overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask
AT nedellecclaire overviewofthegeneregulationnetworkandthebacteriabiotopetasksinbionlp13sharedtask