Cargando…
Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3288782/ https://www.ncbi.nlm.nih.gov/pubmed/20406499 http://dx.doi.org/10.1186/1471-2105-11-S2-S1 |
_version_ | 1782224827097022464 |
---|---|
author | Segura-Bedmar, Isabel Crespo, Mario de Pablo-Sánchez, César Martínez, Paloma |
author_facet | Segura-Bedmar, Isabel Crespo, Mario de Pablo-Sánchez, César Martínez, Paloma |
author_sort | Segura-Bedmar, Isabel |
collection | PubMed |
description | BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction. METHODS: The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution. RESULTS: An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmalogical documents. The obtained results resemble those found in related-semantic domains. CONCLUSIONS: The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system. |
format | Online Article Text |
id | pubmed-3288782 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32887822012-02-29 Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents Segura-Bedmar, Isabel Crespo, Mario de Pablo-Sánchez, César Martínez, Paloma BMC Bioinformatics Proceedings BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction. METHODS: The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution. RESULTS: An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmalogical documents. The obtained results resemble those found in related-semantic domains. CONCLUSIONS: The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system. BioMed Central 2010-04-16 /pmc/articles/PMC3288782/ /pubmed/20406499 http://dx.doi.org/10.1186/1471-2105-11-S2-S1 Text en Copyright ©2010 Segura-Bedmar et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Segura-Bedmar, Isabel Crespo, Mario de Pablo-Sánchez, César Martínez, Paloma Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title | Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title_full | Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title_fullStr | Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title_full_unstemmed | Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title_short | Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
title_sort | resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3288782/ https://www.ncbi.nlm.nih.gov/pubmed/20406499 http://dx.doi.org/10.1186/1471-2105-11-S2-S1 |
work_keys_str_mv | AT segurabedmarisabel resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments AT crespomario resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments AT depablosanchezcesar resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments AT martinezpaloma resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments |