Cargando…

Adapting Static and Contextual Representations for Policy Gradient-Based Summarization

Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great inter...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Ching-Sheng, Jwo, Jung-Sing, Lee, Cheng-Hsiung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181762/
https://www.ncbi.nlm.nih.gov/pubmed/37177717
http://dx.doi.org/10.3390/s23094513
_version_ 1785041651993935872
author Lin, Ching-Sheng
Jwo, Jung-Sing
Lee, Cheng-Hsiung
author_facet Lin, Ching-Sheng
Jwo, Jung-Sing
Lee, Cheng-Hsiung
author_sort Lin, Ching-Sheng
collection PubMed
description Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great interest for decades. Due to the difficulties of semantic understanding and the requirement of large training data, the development of this research field is still challenging and worth investigating. In this paper, we propose an automated text summarization approach with the adaptation of static and contextual representations based on an extractive approach to address the research gaps. To better obtain the semantic expression of the given text, we explore the combination of static embeddings from GloVe (Global Vectors) and the contextual embeddings from BERT (Bidirectional Encoder Representations from Transformer) and GPT (Generative Pre-trained Transformer) based models. In order to reduce human annotation costs, we employ policy gradient reinforcement learning to perform unsupervised training. We conduct empirical studies on the public dataset, Gigaword. The experimental results show that our approach achieves promising performance and is competitive with various state-of-the-art approaches.
format Online
Article
Text
id pubmed-10181762
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101817622023-05-13 Adapting Static and Contextual Representations for Policy Gradient-Based Summarization Lin, Ching-Sheng Jwo, Jung-Sing Lee, Cheng-Hsiung Sensors (Basel) Article Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great interest for decades. Due to the difficulties of semantic understanding and the requirement of large training data, the development of this research field is still challenging and worth investigating. In this paper, we propose an automated text summarization approach with the adaptation of static and contextual representations based on an extractive approach to address the research gaps. To better obtain the semantic expression of the given text, we explore the combination of static embeddings from GloVe (Global Vectors) and the contextual embeddings from BERT (Bidirectional Encoder Representations from Transformer) and GPT (Generative Pre-trained Transformer) based models. In order to reduce human annotation costs, we employ policy gradient reinforcement learning to perform unsupervised training. We conduct empirical studies on the public dataset, Gigaword. The experimental results show that our approach achieves promising performance and is competitive with various state-of-the-art approaches. MDPI 2023-05-05 /pmc/articles/PMC10181762/ /pubmed/37177717 http://dx.doi.org/10.3390/s23094513 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lin, Ching-Sheng
Jwo, Jung-Sing
Lee, Cheng-Hsiung
Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title_full Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title_fullStr Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title_full_unstemmed Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title_short Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
title_sort adapting static and contextual representations for policy gradient-based summarization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181762/
https://www.ncbi.nlm.nih.gov/pubmed/37177717
http://dx.doi.org/10.3390/s23094513
work_keys_str_mv AT linchingsheng adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization
AT jwojungsing adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization
AT leechenghsiung adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization