Cargando…

Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents

BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent...

Descripción completa

Detalles Bibliográficos
Autores principales: Segura-Bedmar, Isabel, Crespo, Mario, de Pablo-Sánchez, César, Martínez, Paloma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3288782/
https://www.ncbi.nlm.nih.gov/pubmed/20406499
http://dx.doi.org/10.1186/1471-2105-11-S2-S1
_version_ 1782224827097022464
author Segura-Bedmar, Isabel
Crespo, Mario
de Pablo-Sánchez, César
Martínez, Paloma
author_facet Segura-Bedmar, Isabel
Crespo, Mario
de Pablo-Sánchez, César
Martínez, Paloma
author_sort Segura-Bedmar, Isabel
collection PubMed
description BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction. METHODS: The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution. RESULTS: An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmalogical documents. The obtained results resemble those found in related-semantic domains. CONCLUSIONS: The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system.
format Online
Article
Text
id pubmed-3288782
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32887822012-02-29 Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents Segura-Bedmar, Isabel Crespo, Mario de Pablo-Sánchez, César Martínez, Paloma BMC Bioinformatics Proceedings BACKGROUND: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction. METHODS: The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution. RESULTS: An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmalogical documents. The obtained results resemble those found in related-semantic domains. CONCLUSIONS: The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system. BioMed Central 2010-04-16 /pmc/articles/PMC3288782/ /pubmed/20406499 http://dx.doi.org/10.1186/1471-2105-11-S2-S1 Text en Copyright ©2010 Segura-Bedmar et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Segura-Bedmar, Isabel
Crespo, Mario
de Pablo-Sánchez, César
Martínez, Paloma
Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title_full Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title_fullStr Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title_full_unstemmed Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title_short Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
title_sort resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3288782/
https://www.ncbi.nlm.nih.gov/pubmed/20406499
http://dx.doi.org/10.1186/1471-2105-11-S2-S1
work_keys_str_mv AT segurabedmarisabel resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments
AT crespomario resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments
AT depablosanchezcesar resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments
AT martinezpaloma resolvinganaphorasfortheextractionofdrugdruginteractionsinpharmacologicaldocuments