Cargando…

GenPADS: Reinforcing politeness in an end-to-end dialogue system

In a task-oriented dialogue setting, user’s mood and demands can change in an ongoing dialogue, which may lead to a non-informative conversation or may result in conversation drop-off. To rectify such scenarios, a conversational agent should be able to learn the user’s behaviour online, and form inf...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mishra, Kshitij, Firdaus, Mauajama, Ekbal, Asif
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9821416/ https://www.ncbi.nlm.nih.gov/pubmed/36607963 http://dx.doi.org/10.1371/journal.pone.0278323

_version_	1784865691778678784
author	Mishra, Kshitij Firdaus, Mauajama Ekbal, Asif
author_facet	Mishra, Kshitij Firdaus, Mauajama Ekbal, Asif
author_sort	Mishra, Kshitij
collection	PubMed
description	In a task-oriented dialogue setting, user’s mood and demands can change in an ongoing dialogue, which may lead to a non-informative conversation or may result in conversation drop-off. To rectify such scenarios, a conversational agent should be able to learn the user’s behaviour online, and form informative, empathetic and interactive responses. To incorporate these three aspects, we propose a novel end-to-end dialogue system GenPADS. First, we build and train two models, viz. a politeness classifier to extract polite information present in user’s and agent’s utterances and a generation model (G) to generate varying but semantically correct responses. We then incorporate both of these models in a reinforcement learning (RL) setting using two different politeness oriented reward algorithms to adapt and generate polite responses. To train our politeness classifier, we annotate recently released Taskmaster dataset into four fine-grained classes depicting politeness and impoliteness. Further, to train our generator model, we prepare a GenDD dataset using the same Taskmaster dataset. Lastly, we train GenPADS and perform automatic and human evaluation by building seven different user simulators. Detailed analysis reveals that GenPADS performs better than the two considered baselines,viz. a transformer based seq2seq generator model for user’s and agent’s utterance and a retrieval based politeness adaptive dialogue system (PADS).
format	Online Article Text
id	pubmed-9821416
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-98214162023-01-07 GenPADS: Reinforcing politeness in an end-to-end dialogue system Mishra, Kshitij Firdaus, Mauajama Ekbal, Asif PLoS One Research Article In a task-oriented dialogue setting, user’s mood and demands can change in an ongoing dialogue, which may lead to a non-informative conversation or may result in conversation drop-off. To rectify such scenarios, a conversational agent should be able to learn the user’s behaviour online, and form informative, empathetic and interactive responses. To incorporate these three aspects, we propose a novel end-to-end dialogue system GenPADS. First, we build and train two models, viz. a politeness classifier to extract polite information present in user’s and agent’s utterances and a generation model (G) to generate varying but semantically correct responses. We then incorporate both of these models in a reinforcement learning (RL) setting using two different politeness oriented reward algorithms to adapt and generate polite responses. To train our politeness classifier, we annotate recently released Taskmaster dataset into four fine-grained classes depicting politeness and impoliteness. Further, to train our generator model, we prepare a GenDD dataset using the same Taskmaster dataset. Lastly, we train GenPADS and perform automatic and human evaluation by building seven different user simulators. Detailed analysis reveals that GenPADS performs better than the two considered baselines,viz. a transformer based seq2seq generator model for user’s and agent’s utterance and a retrieval based politeness adaptive dialogue system (PADS). Public Library of Science 2023-01-06 /pmc/articles/PMC9821416/ /pubmed/36607963 http://dx.doi.org/10.1371/journal.pone.0278323 Text en © 2023 Mishra et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Mishra, Kshitij Firdaus, Mauajama Ekbal, Asif GenPADS: Reinforcing politeness in an end-to-end dialogue system
title	GenPADS: Reinforcing politeness in an end-to-end dialogue system
title_full	GenPADS: Reinforcing politeness in an end-to-end dialogue system
title_fullStr	GenPADS: Reinforcing politeness in an end-to-end dialogue system
title_full_unstemmed	GenPADS: Reinforcing politeness in an end-to-end dialogue system
title_short	GenPADS: Reinforcing politeness in an end-to-end dialogue system
title_sort	genpads: reinforcing politeness in an end-to-end dialogue system
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9821416/ https://www.ncbi.nlm.nih.gov/pubmed/36607963 http://dx.doi.org/10.1371/journal.pone.0278323
work_keys_str_mv	AT mishrakshitij genpadsreinforcingpolitenessinanendtoenddialoguesystem AT firdausmauajama genpadsreinforcingpolitenessinanendtoenddialoguesystem AT ekbalasif genpadsreinforcingpolitenessinanendtoenddialoguesystem

GenPADS: Reinforcing politeness in an end-to-end dialogue system

Ejemplares similares