Cargando…

A storytree-based model for inter-document causal relation extraction from news articles

With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on re...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Chong, Lyu, Jiagao, Xu, Ke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633033/
https://www.ncbi.nlm.nih.gov/pubmed/36348735
http://dx.doi.org/10.1007/s10115-022-01781-7
_version_ 1784824174439563264
author Zhang, Chong
Lyu, Jiagao
Xu, Ke
author_facet Zhang, Chong
Lyu, Jiagao
Xu, Ke
author_sort Zhang, Chong
collection PubMed
description With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on relation extraction cannot solve it well because of the following two reasons: (1) most relation extraction models are intra-document models, which focus on relation extraction between entities. However, news articles are many times longer and more complex than entities, which makes the inter-document relation extraction task harder than intra-document. (2) Existing inter-document relation extraction models rely on similarity information between news articles, which could limit the performance of extraction methods. In this paper, we propose an inter-document model based on storytree information to extract causal relations between news articles. We adopt storytree information to integer linear programming (ILP) and design the storytree constraints for the ILP objective function. Experimental results show that all the constraints are effective and the proposed method outperforms widely used machine learning models and a state-of-the-art deep learning model, with F1 improved by more than 5% on three different datasets. Further analysis shows that five constraints in our model improve the results to varying degrees and the effects on the three datasets are different. The experiment about link features also suggests the positive influence of link information.
format Online
Article
Text
id pubmed-9633033
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-96330332022-11-04 A storytree-based model for inter-document causal relation extraction from news articles Zhang, Chong Lyu, Jiagao Xu, Ke Knowl Inf Syst Regular Paper With more and more news articles appearing on the Internet, discovering causal relations between news articles is very important for people to understand the development of news. Extracting the causal relations between news articles is an inter-document relation extraction task. Existing works on relation extraction cannot solve it well because of the following two reasons: (1) most relation extraction models are intra-document models, which focus on relation extraction between entities. However, news articles are many times longer and more complex than entities, which makes the inter-document relation extraction task harder than intra-document. (2) Existing inter-document relation extraction models rely on similarity information between news articles, which could limit the performance of extraction methods. In this paper, we propose an inter-document model based on storytree information to extract causal relations between news articles. We adopt storytree information to integer linear programming (ILP) and design the storytree constraints for the ILP objective function. Experimental results show that all the constraints are effective and the proposed method outperforms widely used machine learning models and a state-of-the-art deep learning model, with F1 improved by more than 5% on three different datasets. Further analysis shows that five constraints in our model improve the results to varying degrees and the effects on the three datasets are different. The experiment about link features also suggests the positive influence of link information. Springer London 2022-11-03 2023 /pmc/articles/PMC9633033/ /pubmed/36348735 http://dx.doi.org/10.1007/s10115-022-01781-7 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Regular Paper
Zhang, Chong
Lyu, Jiagao
Xu, Ke
A storytree-based model for inter-document causal relation extraction from news articles
title A storytree-based model for inter-document causal relation extraction from news articles
title_full A storytree-based model for inter-document causal relation extraction from news articles
title_fullStr A storytree-based model for inter-document causal relation extraction from news articles
title_full_unstemmed A storytree-based model for inter-document causal relation extraction from news articles
title_short A storytree-based model for inter-document causal relation extraction from news articles
title_sort storytree-based model for inter-document causal relation extraction from news articles
topic Regular Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9633033/
https://www.ncbi.nlm.nih.gov/pubmed/36348735
http://dx.doi.org/10.1007/s10115-022-01781-7
work_keys_str_mv AT zhangchong astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles
AT lyujiagao astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles
AT xuke astorytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles
AT zhangchong storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles
AT lyujiagao storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles
AT xuke storytreebasedmodelforinterdocumentcausalrelationextractionfromnewsarticles