Cargando…
A storytree-based model for inter-document causal relation extraction from news articles
With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on re...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633033/ https://www.ncbi.nlm.nih.gov/pubmed/36348735 http://dx.doi.org/10.1007/s10115-022-01781-7 |
_version_ | 1784824174439563264 |
---|---|
author | Zhang, Chong Lyu, Jiagao Xu, Ke |
author_facet | Zhang, Chong Lyu, Jiagao Xu, Ke |
author_sort | Zhang, Chong |
collection | PubMed |
description | With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on relation extraction cannot solve it well because of the following two reasons: (1) most relation extraction models are intra-document models, which focus on relation extraction between entities. However, news articles are many times longer and more complex than entities, which makes the inter-document relation extraction task harder than intra-document. (2) Existing inter-document relation extraction models rely on similarity information between news articles, which could limit the performance of extraction methods. In this paper, we propose an inter-document model based on storytree information to extract causal relations between news articles. We adopt storytree information to integer linear programming (ILP) and design the storytree constraints for the ILP objective function. Experimental results show that all the constraints are effective and the proposed method outperforms widely used machine learning models and a state-of-the-art deep learning model, with F1 improved by more than 5% on three different datasets. Further analysis shows that five constraints in our model improve the results to varying degrees and the effects on the three datasets are different. The experiment about link features also suggests the positive influence of link information. |
format | Online Article Text |
id | pubmed-9633033 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-96330332022-11-04 A storytree-based model for inter-document causal relation extraction from news articles Zhang, Chong Lyu, Jiagao Xu, Ke Knowl Inf Syst Regular Paper With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on relation extraction cannot solve it well because of the following two reasons: (1) most relation extraction models are intra-document models, which focus on relation extraction between entities. However, news articles are many times longer and more complex than entities, which makes the inter-document relation extraction task harder than intra-document. (2) Existing inter-document relation extraction models rely on similarity information between news articles, which could limit the performance of extraction methods. In this paper, we propose an inter-document model based on storytree information to extract causal relations between news articles. We adopt storytree information to integer linear programming (ILP) and design the storytree constraints for the ILP objective function. Experimental results show that all the constraints are effective and the proposed method outperforms widely used machine learning models and a state-of-the-art deep learning model, with F1 improved by more than 5% on three different datasets. Further analysis shows that five constraints in our model improve the results to varying degrees and the effects on the three datasets are different. The experiment about link features also suggests the positive influence of link information. Springer London 2022-11-03 2023 /pmc/articles/PMC9633033/ /pubmed/36348735 http://dx.doi.org/10.1007/s10115-022-01781-7 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Regular Paper Zhang, Chong Lyu, Jiagao Xu, Ke A storytree-based model for inter-document causal relation extraction from news articles |
title | A storytree-based model for inter-document causal relation extraction from news articles |
title_full | A storytree-based model for inter-document causal relation extraction from news articles |
title_fullStr | A storytree-based model for inter-document causal relation extraction from news articles |
title_full_unstemmed | A storytree-based model for inter-document causal relation extraction from news articles |
title_short | A storytree-based model for inter-document causal relation extraction from news articles |
title_sort | storytree-based model for inter-document causal relation extraction from news articles |
topic | Regular Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633033/ https://www.ncbi.nlm.nih.gov/pubmed/36348735 http://dx.doi.org/10.1007/s10115-022-01781-7 |
work_keys_str_mv | AT zhangchong astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles AT lyujiagao astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles AT xuke astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles AT zhangchong storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles AT lyujiagao storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles AT xuke storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles |