Cargando…
Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great inter...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181762/ https://www.ncbi.nlm.nih.gov/pubmed/37177717 http://dx.doi.org/10.3390/s23094513 |
_version_ | 1785041651993935872 |
---|---|
author | Lin, Ching-Sheng Jwo, Jung-Sing Lee, Cheng-Hsiung |
author_facet | Lin, Ching-Sheng Jwo, Jung-Sing Lee, Cheng-Hsiung |
author_sort | Lin, Ching-Sheng |
collection | PubMed |
description | Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great interest for decades. Due to the difficulties of semantic understanding and the requirement of large training data, the development of this research field is still challenging and worth investigating. In this paper, we propose an automated text summarization approach with the adaptation of static and contextual representations based on an extractive approach to address the research gaps. To better obtain the semantic expression of the given text, we explore the combination of static embeddings from GloVe (Global Vectors) and the contextual embeddings from BERT (Bidirectional Encoder Representations from Transformer) and GPT (Generative Pre-trained Transformer) based models. In order to reduce human annotation costs, we employ policy gradient reinforcement learning to perform unsupervised training. We conduct empirical studies on the public dataset, Gigaword. The experimental results show that our approach achieves promising performance and is competitive with various state-of-the-art approaches. |
format | Online Article Text |
id | pubmed-10181762 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101817622023-05-13 Adapting Static and Contextual Representations for Policy Gradient-Based Summarization Lin, Ching-Sheng Jwo, Jung-Sing Lee, Cheng-Hsiung Sensors (Basel) Article Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great interest for decades. Due to the difficulties of semantic understanding and the requirement of large training data, the development of this research field is still challenging and worth investigating. In this paper, we propose an automated text summarization approach with the adaptation of static and contextual representations based on an extractive approach to address the research gaps. To better obtain the semantic expression of the given text, we explore the combination of static embeddings from GloVe (Global Vectors) and the contextual embeddings from BERT (Bidirectional Encoder Representations from Transformer) and GPT (Generative Pre-trained Transformer) based models. In order to reduce human annotation costs, we employ policy gradient reinforcement learning to perform unsupervised training. We conduct empirical studies on the public dataset, Gigaword. The experimental results show that our approach achieves promising performance and is competitive with various state-of-the-art approaches. MDPI 2023-05-05 /pmc/articles/PMC10181762/ /pubmed/37177717 http://dx.doi.org/10.3390/s23094513 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lin, Ching-Sheng Jwo, Jung-Sing Lee, Cheng-Hsiung Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title | Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title_full | Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title_fullStr | Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title_full_unstemmed | Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title_short | Adapting Static and Contextual Representations for Policy Gradient-Based Summarization |
title_sort | adapting static and contextual representations for policy gradient-based summarization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181762/ https://www.ncbi.nlm.nih.gov/pubmed/37177717 http://dx.doi.org/10.3390/s23094513 |
work_keys_str_mv | AT linchingsheng adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization AT jwojungsing adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization AT leechenghsiung adaptingstaticandcontextualrepresentationsforpolicygradientbasedsummarization |