Cargando…
Extracting semantically enriched events from biomedical literature
BACKGROUND: Research into event-based text mining from the biomedical literature has been growing in popularity to facilitate the development of advanced biomedical text mining systems. Such technology permits advanced search, which goes beyond document or sentence-based retrieval. However, existing...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3464657/ https://www.ncbi.nlm.nih.gov/pubmed/22621266 http://dx.doi.org/10.1186/1471-2105-13-108 |
_version_ | 1782245446604816384 |
---|---|
author | Miwa, Makoto Thompson, Paul McNaught, John Kell, Douglas B Ananiadou, Sophia |
author_facet | Miwa, Makoto Thompson, Paul McNaught, John Kell, Douglas B Ananiadou, Sophia |
author_sort | Miwa, Makoto |
collection | PubMed |
description | BACKGROUND: Research into event-based text mining from the biomedical literature has been growing in popularity to facilitate the development of advanced biomedical text mining systems. Such technology permits advanced search, which goes beyond document or sentence-based retrieval. However, existing event-based systems typically ignore additional information within the textual context of events that can determine, amongst other things, whether an event represents a fact, hypothesis, experimental result or analysis of results, whether it describes new or previously reported knowledge, and whether it is speculated or negated. We refer to such contextual information as meta-knowledge. The automatic recognition of such information can permit the training of systems allowing finer-grained searching of events according to the meta-knowledge that is associated with them. RESULTS: Based on a corpus of 1,000 MEDLINE abstracts, fully manually annotated with both events and associated meta-knowledge, we have constructed a machine learning-based system that automatically assigns meta-knowledge information to events. This system has been integrated into EventMine, a state-of-the-art event extraction system, in order to create a more advanced system (EventMine-MK) that not only extracts events from text automatically, but also assigns five different types of meta-knowledge to these events. The meta-knowledge assignment module of EventMine-MK performs with macro-averaged F-scores in the range of 57-87% on the BioNLP’09 Shared Task corpus. EventMine-MK has been evaluated on the BioNLP’09 Shared Task subtask of detecting negated and speculated events. Our results show that EventMine-MK can outperform other state-of-the-art systems that participated in this task. CONCLUSIONS: We have constructed the first practical system that extracts both events and associated, detailed meta-knowledge information from biomedical literature. The automatically assigned meta-knowledge information can be used to refine search systems, in order to provide an extra search layer beyond entities and assertions, dealing with phenomena such as rhetorical intent, speculations, contradictions and negations. This finer grained search functionality can assist in several important tasks, e.g., database curation (by locating new experimental knowledge) and pathway enrichment (by providing information for inference). To allow easy integration into text mining systems, EventMine-MK is provided as a UIMA component that can be used in the interoperable text mining infrastructure, U-Compare. |
format | Online Article Text |
id | pubmed-3464657 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34646572012-10-05 Extracting semantically enriched events from biomedical literature Miwa, Makoto Thompson, Paul McNaught, John Kell, Douglas B Ananiadou, Sophia BMC Bioinformatics Research Article BACKGROUND: Research into event-based text mining from the biomedical literature has been growing in popularity to facilitate the development of advanced biomedical text mining systems. Such technology permits advanced search, which goes beyond document or sentence-based retrieval. However, existing event-based systems typically ignore additional information within the textual context of events that can determine, amongst other things, whether an event represents a fact, hypothesis, experimental result or analysis of results, whether it describes new or previously reported knowledge, and whether it is speculated or negated. We refer to such contextual information as meta-knowledge. The automatic recognition of such information can permit the training of systems allowing finer-grained searching of events according to the meta-knowledge that is associated with them. RESULTS: Based on a corpus of 1,000 MEDLINE abstracts, fully manually annotated with both events and associated meta-knowledge, we have constructed a machine learning-based system that automatically assigns meta-knowledge information to events. This system has been integrated into EventMine, a state-of-the-art event extraction system, in order to create a more advanced system (EventMine-MK) that not only extracts events from text automatically, but also assigns five different types of meta-knowledge to these events. The meta-knowledge assignment module of EventMine-MK performs with macro-averaged F-scores in the range of 57-87% on the BioNLP’09 Shared Task corpus. EventMine-MK has been evaluated on the BioNLP’09 Shared Task subtask of detecting negated and speculated events. Our results show that EventMine-MK can outperform other state-of-the-art systems that participated in this task. CONCLUSIONS: We have constructed the first practical system that extracts both events and associated, detailed meta-knowledge information from biomedical literature. The automatically assigned meta-knowledge information can be used to refine search systems, in order to provide an extra search layer beyond entities and assertions, dealing with phenomena such as rhetorical intent, speculations, contradictions and negations. This finer grained search functionality can assist in several important tasks, e.g., database curation (by locating new experimental knowledge) and pathway enrichment (by providing information for inference). To allow easy integration into text mining systems, EventMine-MK is provided as a UIMA component that can be used in the interoperable text mining infrastructure, U-Compare. BioMed Central 2012-05-23 /pmc/articles/PMC3464657/ /pubmed/22621266 http://dx.doi.org/10.1186/1471-2105-13-108 Text en Copyright ©2012 Miwa et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Miwa, Makoto Thompson, Paul McNaught, John Kell, Douglas B Ananiadou, Sophia Extracting semantically enriched events from biomedical literature |
title | Extracting semantically enriched events from biomedical literature |
title_full | Extracting semantically enriched events from biomedical literature |
title_fullStr | Extracting semantically enriched events from biomedical literature |
title_full_unstemmed | Extracting semantically enriched events from biomedical literature |
title_short | Extracting semantically enriched events from biomedical literature |
title_sort | extracting semantically enriched events from biomedical literature |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3464657/ https://www.ncbi.nlm.nih.gov/pubmed/22621266 http://dx.doi.org/10.1186/1471-2105-13-108 |
work_keys_str_mv | AT miwamakoto extractingsemanticallyenrichedeventsfrombiomedicalliterature AT thompsonpaul extractingsemanticallyenrichedeventsfrombiomedicalliterature AT mcnaughtjohn extractingsemanticallyenrichedeventsfrombiomedicalliterature AT kelldouglasb extractingsemanticallyenrichedeventsfrombiomedicalliterature AT ananiadousophia extractingsemanticallyenrichedeventsfrombiomedicalliterature |