Cargando…
Semi-supervised Extractive Question Summarization Using Question-Answer Pairs
Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on communi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148067/ http://dx.doi.org/10.1007/978-3-030-45442-5_32 |
_version_ | 1783520524111970304 |
---|---|
author | Machida, Kazuya Ishigaki, Tatsuya Kobayashi, Hayato Takamura, Hiroya Okumura, Manabu |
author_facet | Machida, Kazuya Ishigaki, Tatsuya Kobayashi, Hayato Takamura, Hiroya Okumura, Manabu |
author_sort | Machida, Kazuya |
collection | PubMed |
description | Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision. |
format | Online Article Text |
id | pubmed-7148067 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71480672020-04-13 Semi-supervised Extractive Question Summarization Using Question-Answer Pairs Machida, Kazuya Ishigaki, Tatsuya Kobayashi, Hayato Takamura, Hiroya Okumura, Manabu Advances in Information Retrieval Article Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision. 2020-03-24 /pmc/articles/PMC7148067/ http://dx.doi.org/10.1007/978-3-030-45442-5_32 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Machida, Kazuya Ishigaki, Tatsuya Kobayashi, Hayato Takamura, Hiroya Okumura, Manabu Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title | Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title_full | Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title_fullStr | Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title_full_unstemmed | Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title_short | Semi-supervised Extractive Question Summarization Using Question-Answer Pairs |
title_sort | semi-supervised extractive question summarization using question-answer pairs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148067/ http://dx.doi.org/10.1007/978-3-030-45442-5_32 |
work_keys_str_mv | AT machidakazuya semisupervisedextractivequestionsummarizationusingquestionanswerpairs AT ishigakitatsuya semisupervisedextractivequestionsummarizationusingquestionanswerpairs AT kobayashihayato semisupervisedextractivequestionsummarizationusingquestionanswerpairs AT takamurahiroya semisupervisedextractivequestionsummarizationusingquestionanswerpairs AT okumuramanabu semisupervisedextractivequestionsummarizationusingquestionanswerpairs |