Cargando…

Improving the extraction of complex regulatory events from scientific text by using ontology-based inference

BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain know...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jung-jae, Rebholz-Schuhmann, Dietrich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239303/
https://www.ncbi.nlm.nih.gov/pubmed/22166672
http://dx.doi.org/10.1186/2041-1480-2-S5-S3
_version_ 1782219163505262592
author Kim, Jung-jae
Rebholz-Schuhmann, Dietrich
author_facet Kim, Jung-jae
Rebholz-Schuhmann, Dietrich
author_sort Kim, Jung-jae
collection PubMed
description BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain knowledge. RESULTS: We developed a system that deduces implicit events from explicitly expressed events by using inference rules that encode domain knowledge. We evaluated the system with the inference module on three tasks: First, when tested against a corpus with manually annotated events, the inference module of our system contributes 53.2% of correct extractions, but does not cause any incorrect results. Second, the system overall reproduces 33.1% of the transcription regulatory events contained in RegulonDB (up to 85.0% precision) and the inference module is required for 93.8% of the reproduced events. Third, we applied the system with minimum adaptations to the identification of cell activity regulation events, confirming that the inference improves the performance of the system also on this task. CONCLUSIONS: Our research shows that the inference based on domain knowledge plays a significant role in extracting complex events from text. This approach has great potential in recognizing the complex concepts of such biomedical ontologies as Gene Ontology in the literature.
format Online
Article
Text
id pubmed-3239303
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32393032011-12-16 Improving the extraction of complex regulatory events from scientific text by using ontology-based inference Kim, Jung-jae Rebholz-Schuhmann, Dietrich J Biomed Semantics Research BACKGROUND: The extraction of complex events from biomedical text is a challenging task and requires in-depth semantic analysis. Previous approaches associate lexical and syntactic resources with ontologies for the semantic analysis, but fall short in testing the benefits from the use of domain knowledge. RESULTS: We developed a system that deduces implicit events from explicitly expressed events by using inference rules that encode domain knowledge. We evaluated the system with the inference module on three tasks: First, when tested against a corpus with manually annotated events, the inference module of our system contributes 53.2% of correct extractions, but does not cause any incorrect results. Second, the system overall reproduces 33.1% of the transcription regulatory events contained in RegulonDB (up to 85.0% precision) and the inference module is required for 93.8% of the reproduced events. Third, we applied the system with minimum adaptations to the identification of cell activity regulation events, confirming that the inference improves the performance of the system also on this task. CONCLUSIONS: Our research shows that the inference based on domain knowledge plays a significant role in extracting complex events from text. This approach has great potential in recognizing the complex concepts of such biomedical ontologies as Gene Ontology in the literature. BioMed Central 2011-10-06 /pmc/articles/PMC3239303/ /pubmed/22166672 http://dx.doi.org/10.1186/2041-1480-2-S5-S3 Text en Copyright ©2011 Kim and Rebholz-Schuhmann; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Kim, Jung-jae
Rebholz-Schuhmann, Dietrich
Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title_full Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title_fullStr Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title_full_unstemmed Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title_short Improving the extraction of complex regulatory events from scientific text by using ontology-based inference
title_sort improving the extraction of complex regulatory events from scientific text by using ontology-based inference
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239303/
https://www.ncbi.nlm.nih.gov/pubmed/22166672
http://dx.doi.org/10.1186/2041-1480-2-S5-S3
work_keys_str_mv AT kimjungjae improvingtheextractionofcomplexregulatoryeventsfromscientifictextbyusingontologybasedinference
AT rebholzschuhmanndietrich improvingtheextractionofcomplexregulatoryeventsfromscientifictextbyusingontologybasedinference