Cargando…
Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain know...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239303/ https://www.ncbi.nlm.nih.gov/pubmed/22166672 http://dx.doi.org/10.1186/2041-1480-2-S5-S3 |
_version_ | 1782219163505262592 |
---|---|
author | Kim, Jung-jae Rebholz-Schuhmann, Dietrich |
author_facet | Kim, Jung-jae Rebholz-Schuhmann, Dietrich |
author_sort | Kim, Jung-jae |
collection | PubMed |
description | BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain knowledge. RESULTS: We developed a system that deduces implicit events from explicitly expressed events by using inference rules that encode domain knowledge. We evaluated the system with the inference module on three tasks: First, when tested against a corpus with manually annotated events, the inference module of our system contributes 53.2% of correct extractions, but does not cause any incorrect results. Second, the system overall reproduces 33.1% of the transcription regulatory events contained in RegulonDB (up to 85.0% precision) and the inference module is required for 93.8% of the reproduced events. Third, we applied the system with minimum adaptations to the identification of cell activity regulation events, confirming that the inference improves the performance of the system also on this task. CONCLUSIONS: Our research shows that the inference based on domain knowledge plays a significant role in extracting complex events from text. This approach has great potential in recognizing the complex concepts of such biomedical ontologies as Gene Ontology in the literature. |
format | Online Article Text |
id | pubmed-3239303 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32393032011-12-16 Improving the extraction of complex regulatory events from scientific text by using ontology-based inference Kim, Jung-jae Rebholz-Schuhmann, Dietrich J Biomed Semantics Research BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain knowledge. RESULTS: We developed a system that deduces implicit events from explicitly expressed events by using inference rules that encode domain knowledge. We evaluated the system with the inference module on three tasks: First, when tested against a corpus with manually annotated events, the inference module of our system contributes 53.2% of correct extractions, but does not cause any incorrect results. Second, the system overall reproduces 33.1% of the transcription regulatory events contained in RegulonDB (up to 85.0% precision) and the inference module is required for 93.8% of the reproduced events. Third, we applied the system with minimum adaptations to the identification of cell activity regulation events, confirming that the inference improves the performance of the system also on this task. CONCLUSIONS: Our research shows that the inference based on domain knowledge plays a significant role in extracting complex events from text. This approach has great potential in recognizing the complex concepts of such biomedical ontologies as Gene Ontology in the literature. BioMed Central 2011-10-06 /pmc/articles/PMC3239303/ /pubmed/22166672 http://dx.doi.org/10.1186/2041-1480-2-S5-S3 Text en Copyright ©2011 Kim and Rebholz-Schuhmann; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Kim, Jung-jae Rebholz-Schuhmann, Dietrich Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title | Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title_full | Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title_fullStr | Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title_full_unstemmed | Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title_short | Improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
title_sort | improving the extraction of complex regulatory events from scientific text by using ontology-based inference |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239303/ https://www.ncbi.nlm.nih.gov/pubmed/22166672 http://dx.doi.org/10.1186/2041-1480-2-S5-S3 |
work_keys_str_mv | AT kimjungjae improvingtheextractionofcomplexregulatoryeventsfromscientifictextbyusingontologybasedinference AT rebholzschuhmanndietrich improvingtheextractionofcomplexregulatoryeventsfromscientifictextbyusingontologybasedinference |