Cargando…
Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature
Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation disc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5859928/ https://www.ncbi.nlm.nih.gov/pubmed/26262109 |
_version_ | 1783307927649517568 |
---|---|
author | Rastegar-Mojarad, Majid Elayavilli, Ravikumar Komandur Li, Dingcheng Liu, Hongfang |
author_facet | Rastegar-Mojarad, Majid Elayavilli, Ravikumar Komandur Li, Dingcheng Liu, Hongfang |
author_sort | Rastegar-Mojarad, Majid |
collection | PubMed |
description | Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature. |
format | Online Article Text |
id | pubmed-5859928 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
record_format | MEDLINE/PubMed |
spelling | pubmed-58599282018-03-20 Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature Rastegar-Mojarad, Majid Elayavilli, Ravikumar Komandur Li, Dingcheng Liu, Hongfang Stud Health Technol Inform Article Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature. 2015 /pmc/articles/PMC5859928/ /pubmed/26262109 Text en This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Article Rastegar-Mojarad, Majid Elayavilli, Ravikumar Komandur Li, Dingcheng Liu, Hongfang Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title | Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title_full | Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title_fullStr | Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title_full_unstemmed | Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title_short | Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature |
title_sort | assessing the need of discourse-level analysis in identifying evidence of drug-disease relations in scientific literature |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5859928/ https://www.ncbi.nlm.nih.gov/pubmed/26262109 |
work_keys_str_mv | AT rastegarmojaradmajid assessingtheneedofdiscourselevelanalysisinidentifyingevidenceofdrugdiseaserelationsinscientificliterature AT elayavilliravikumarkomandur assessingtheneedofdiscourselevelanalysisinidentifyingevidenceofdrugdiseaserelationsinscientificliterature AT lidingcheng assessingtheneedofdiscourselevelanalysisinidentifyingevidenceofdrugdiseaserelationsinscientificliterature AT liuhongfang assessingtheneedofdiscourselevelanalysisinidentifyingevidenceofdrugdiseaserelationsinscientificliterature |