Cargando…
Improving extractive document summarization with sentence centrality
Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307201/ https://www.ncbi.nlm.nih.gov/pubmed/35867732 http://dx.doi.org/10.1371/journal.pone.0268278 |
_version_ | 1784752708156129280 |
---|---|
author | Gong, Shuai Zhu, Zhenfang Qi, Jiangtao Tong, Chunling Lu, Qiang Wu, Wenqing |
author_facet | Gong, Shuai Zhu, Zhenfang Qi, Jiangtao Tong, Chunling Lu, Qiang Wu, Wenqing |
author_sort | Gong, Shuai |
collection | PubMed |
description | Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position information to enhance sentence representation, but this will cause the sentence-leading bias problem, especially in news datasets. In this paper, we propose a novel sentence centrality for the EDS task to address these two problems. The sentence centrality is based on directed graphs, while reflecting the sentence-document relationship, it also reflects the sentence position information in the document. We implicitly strengthen the relevance of sentences and documents by using sentence centrality to enhance sentence representation. Notably, we replaced the sentence position information with sentence centrality to reduce sentence-leading bias without causing model performance degradation. Experiments on the CNN/Daily Mail dataset showed that EDS models with sentence centrality significantly improved compared with baseline models. |
format | Online Article Text |
id | pubmed-9307201 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-93072012022-07-23 Improving extractive document summarization with sentence centrality Gong, Shuai Zhu, Zhenfang Qi, Jiangtao Tong, Chunling Lu, Qiang Wu, Wenqing PLoS One Research Article Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position information to enhance sentence representation, but this will cause the sentence-leading bias problem, especially in news datasets. In this paper, we propose a novel sentence centrality for the EDS task to address these two problems. The sentence centrality is based on directed graphs, while reflecting the sentence-document relationship, it also reflects the sentence position information in the document. We implicitly strengthen the relevance of sentences and documents by using sentence centrality to enhance sentence representation. Notably, we replaced the sentence position information with sentence centrality to reduce sentence-leading bias without causing model performance degradation. Experiments on the CNN/Daily Mail dataset showed that EDS models with sentence centrality significantly improved compared with baseline models. Public Library of Science 2022-07-22 /pmc/articles/PMC9307201/ /pubmed/35867732 http://dx.doi.org/10.1371/journal.pone.0268278 Text en © 2022 Gong et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Gong, Shuai Zhu, Zhenfang Qi, Jiangtao Tong, Chunling Lu, Qiang Wu, Wenqing Improving extractive document summarization with sentence centrality |
title | Improving extractive document summarization with sentence centrality |
title_full | Improving extractive document summarization with sentence centrality |
title_fullStr | Improving extractive document summarization with sentence centrality |
title_full_unstemmed | Improving extractive document summarization with sentence centrality |
title_short | Improving extractive document summarization with sentence centrality |
title_sort | improving extractive document summarization with sentence centrality |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307201/ https://www.ncbi.nlm.nih.gov/pubmed/35867732 http://dx.doi.org/10.1371/journal.pone.0268278 |
work_keys_str_mv | AT gongshuai improvingextractivedocumentsummarizationwithsentencecentrality AT zhuzhenfang improvingextractivedocumentsummarizationwithsentencecentrality AT qijiangtao improvingextractivedocumentsummarizationwithsentencecentrality AT tongchunling improvingextractivedocumentsummarizationwithsentencecentrality AT luqiang improvingextractivedocumentsummarizationwithsentencecentrality AT wuwenqing improvingextractivedocumentsummarizationwithsentencecentrality |