Cargando…

Semi-supervised Extractive Question Summarization Using Question-Answer Pairs

Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on communi...

Descripción completa

Detalles Bibliográficos
Autores principales: Machida, Kazuya, Ishigaki, Tatsuya, Kobayashi, Hayato, Takamura, Hiroya, Okumura, Manabu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148067/
http://dx.doi.org/10.1007/978-3-030-45442-5_32
_version_ 1783520524111970304
author Machida, Kazuya
Ishigaki, Tatsuya
Kobayashi, Hayato
Takamura, Hiroya
Okumura, Manabu
author_facet Machida, Kazuya
Ishigaki, Tatsuya
Kobayashi, Hayato
Takamura, Hiroya
Okumura, Manabu
author_sort Machida, Kazuya
collection PubMed
description Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision.
format Online
Article
Text
id pubmed-7148067
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71480672020-04-13 Semi-supervised Extractive Question Summarization Using Question-Answer Pairs Machida, Kazuya Ishigaki, Tatsuya Kobayashi, Hayato Takamura, Hiroya Okumura, Manabu Advances in Information Retrieval Article Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision. 2020-03-24 /pmc/articles/PMC7148067/ http://dx.doi.org/10.1007/978-3-030-45442-5_32 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Machida, Kazuya
Ishigaki, Tatsuya
Kobayashi, Hayato
Takamura, Hiroya
Okumura, Manabu
Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title_full Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title_fullStr Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title_full_unstemmed Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title_short Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
title_sort semi-supervised extractive question summarization using question-answer pairs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148067/
http://dx.doi.org/10.1007/978-3-030-45442-5_32
work_keys_str_mv AT machidakazuya semisupervisedextractivequestionsummarizationusingquestionanswerpairs
AT ishigakitatsuya semisupervisedextractivequestionsummarizationusingquestionanswerpairs
AT kobayashihayato semisupervisedextractivequestionsummarizationusingquestionanswerpairs
AT takamurahiroya semisupervisedextractivequestionsummarizationusingquestionanswerpairs
AT okumuramanabu semisupervisedextractivequestionsummarizationusingquestionanswerpairs