Cargando…

Semi-supervised Extractive Question Summarization Using Question-Answer Pairs

Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on communi...

Descripción completa

Detalles Bibliográficos
Autores principales: Machida, Kazuya, Ishigaki, Tatsuya, Kobayashi, Hayato, Takamura, Hiroya, Okumura, Manabu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148067/
http://dx.doi.org/10.1007/978-3-030-45442-5_32
Descripción
Sumario:Neural extractive summarization methods often require much labeled training data, for which headlines or lead summaries of news articles can sometimes be used. Such directly useful summaries are not always available, however, especially for user-generated content, such as questions posted on community question answering services. In this paper, we address an extractive summarization (i.e., headline extraction) task for such questions as a case study and consider how to alleviate the problem by using question-answer pairs, instead of missing-headline pairs. To this end, we propose a framework to examine how to use such unlabeled paired data from the viewpoint of training methods. Experimental results show that multi-task training performs well with undersampling and distant supervision.