Cargando…

Exploiting document graphs for inter sentence relation extraction

BACKGROUND: Most previous relation extraction (RE) studies have focused on intra sentence relations and have ignored relations that span sentences, i.e. inter sentence relations. Such relations connect entities at the document level rather than as relational facts in a single sentence. Extracting fa...

Descripción completa

Detalles Bibliográficos
Autores principales: Le, Hoang-Quynh, Can, Duy-Cat, Collier, Nigel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166375/
https://www.ncbi.nlm.nih.gov/pubmed/35659292
http://dx.doi.org/10.1186/s13326-022-00267-3
_version_ 1784720588910100480
author Le, Hoang-Quynh
Can, Duy-Cat
Collier, Nigel
author_facet Le, Hoang-Quynh
Can, Duy-Cat
Collier, Nigel
author_sort Le, Hoang-Quynh
collection PubMed
description BACKGROUND: Most previous relation extraction (RE) studies have focused on intra sentence relations and have ignored relations that span sentences, i.e. inter sentence relations. Such relations connect entities at the document level rather than as relational facts in a single sentence. Extracting facts that are expressed across sentences leads to some challenges and requires different approaches than those usually applied in recent intra sentence relation extraction. Despite recent results, there are still limitations to be overcome. RESULTS: We present a novel representation for a sequence of consecutive sentences, namely document subgraph, to extract inter sentence relations. Experiments on the BioCreative V Chemical-Disease Relation corpus demonstrate the advantages and robustness of our novel system to extract both intra- and inter sentence relations in biomedical literature abstracts. The experimental results are comparable to state-of-the-art approaches and show the potential by demonstrating the effectiveness of graphs, deep learning-based model, and other processing techniques. Experiments were also carried out to verify the rationality and impact of various additional information and model components. CONCLUSIONS: Our proposed graph-based representation helps to extract ∼50% of inter sentence relations and boosts the model performance on both precision and recall compared to the baseline model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s13326-022-00267-3).
format Online
Article
Text
id pubmed-9166375
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91663752022-06-05 Exploiting document graphs for inter sentence relation extraction Le, Hoang-Quynh Can, Duy-Cat Collier, Nigel J Biomed Semantics Research BACKGROUND: Most previous relation extraction (RE) studies have focused on intra sentence relations and have ignored relations that span sentences, i.e. inter sentence relations. Such relations connect entities at the document level rather than as relational facts in a single sentence. Extracting facts that are expressed across sentences leads to some challenges and requires different approaches than those usually applied in recent intra sentence relation extraction. Despite recent results, there are still limitations to be overcome. RESULTS: We present a novel representation for a sequence of consecutive sentences, namely document subgraph, to extract inter sentence relations. Experiments on the BioCreative V Chemical-Disease Relation corpus demonstrate the advantages and robustness of our novel system to extract both intra- and inter sentence relations in biomedical literature abstracts. The experimental results are comparable to state-of-the-art approaches and show the potential by demonstrating the effectiveness of graphs, deep learning-based model, and other processing techniques. Experiments were also carried out to verify the rationality and impact of various additional information and model components. CONCLUSIONS: Our proposed graph-based representation helps to extract ∼50% of inter sentence relations and boosts the model performance on both precision and recall compared to the baseline model. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s13326-022-00267-3). BioMed Central 2022-06-03 /pmc/articles/PMC9166375/ /pubmed/35659292 http://dx.doi.org/10.1186/s13326-022-00267-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Le, Hoang-Quynh
Can, Duy-Cat
Collier, Nigel
Exploiting document graphs for inter sentence relation extraction
title Exploiting document graphs for inter sentence relation extraction
title_full Exploiting document graphs for inter sentence relation extraction
title_fullStr Exploiting document graphs for inter sentence relation extraction
title_full_unstemmed Exploiting document graphs for inter sentence relation extraction
title_short Exploiting document graphs for inter sentence relation extraction
title_sort exploiting document graphs for inter sentence relation extraction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9166375/
https://www.ncbi.nlm.nih.gov/pubmed/35659292
http://dx.doi.org/10.1186/s13326-022-00267-3
work_keys_str_mv AT lehoangquynh exploitingdocumentgraphsforintersentencerelationextraction
AT canduycat exploitingdocumentgraphsforintersentencerelationextraction
AT colliernigel exploitingdocumentgraphsforintersentencerelationextraction