Cargando…

Improving extractive document summarization with sentence centrality

Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position...

Descripción completa

Detalles Bibliográficos
Autores principales: Gong, Shuai, Zhu, Zhenfang, Qi, Jiangtao, Tong, Chunling, Lu, Qiang, Wu, Wenqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307201/
https://www.ncbi.nlm.nih.gov/pubmed/35867732
http://dx.doi.org/10.1371/journal.pone.0268278
_version_ 1784752708156129280
author Gong, Shuai
Zhu, Zhenfang
Qi, Jiangtao
Tong, Chunling
Lu, Qiang
Wu, Wenqing
author_facet Gong, Shuai
Zhu, Zhenfang
Qi, Jiangtao
Tong, Chunling
Lu, Qiang
Wu, Wenqing
author_sort Gong, Shuai
collection PubMed
description Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position information to enhance sentence representation, but this will cause the sentence-leading bias problem, especially in news datasets. In this paper, we propose a novel sentence centrality for the EDS task to address these two problems. The sentence centrality is based on directed graphs, while reflecting the sentence-document relationship, it also reflects the sentence position information in the document. We implicitly strengthen the relevance of sentences and documents by using sentence centrality to enhance sentence representation. Notably, we replaced the sentence position information with sentence centrality to reduce sentence-leading bias without causing model performance degradation. Experiments on the CNN/Daily Mail dataset showed that EDS models with sentence centrality significantly improved compared with baseline models.
format Online
Article
Text
id pubmed-9307201
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-93072012022-07-23 Improving extractive document summarization with sentence centrality Gong, Shuai Zhu, Zhenfang Qi, Jiangtao Tong, Chunling Lu, Qiang Wu, Wenqing PLoS One Research Article Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position information to enhance sentence representation, but this will cause the sentence-leading bias problem, especially in news datasets. In this paper, we propose a novel sentence centrality for the EDS task to address these two problems. The sentence centrality is based on directed graphs, while reflecting the sentence-document relationship, it also reflects the sentence position information in the document. We implicitly strengthen the relevance of sentences and documents by using sentence centrality to enhance sentence representation. Notably, we replaced the sentence position information with sentence centrality to reduce sentence-leading bias without causing model performance degradation. Experiments on the CNN/Daily Mail dataset showed that EDS models with sentence centrality significantly improved compared with baseline models. Public Library of Science 2022-07-22 /pmc/articles/PMC9307201/ /pubmed/35867732 http://dx.doi.org/10.1371/journal.pone.0268278 Text en © 2022 Gong et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gong, Shuai
Zhu, Zhenfang
Qi, Jiangtao
Tong, Chunling
Lu, Qiang
Wu, Wenqing
Improving extractive document summarization with sentence centrality
title Improving extractive document summarization with sentence centrality
title_full Improving extractive document summarization with sentence centrality
title_fullStr Improving extractive document summarization with sentence centrality
title_full_unstemmed Improving extractive document summarization with sentence centrality
title_short Improving extractive document summarization with sentence centrality
title_sort improving extractive document summarization with sentence centrality
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307201/
https://www.ncbi.nlm.nih.gov/pubmed/35867732
http://dx.doi.org/10.1371/journal.pone.0268278
work_keys_str_mv AT gongshuai improvingextractivedocumentsummarizationwithsentencecentrality
AT zhuzhenfang improvingextractivedocumentsummarizationwithsentencecentrality
AT qijiangtao improvingextractivedocumentsummarizationwithsentencecentrality
AT tongchunling improvingextractivedocumentsummarizationwithsentencecentrality
AT luqiang improvingextractivedocumentsummarizationwithsentencecentrality
AT wuwenqing improvingextractivedocumentsummarizationwithsentencecentrality