Cargando…

Reinforcing personalized persuasion in task-oriented virtual sales assistant

PURPOSE: Existing task-oriented virtual agents can assist users with simple tasks like ticket booking, hotel reservations, etc. effectively and with high confidence. These virtual assistants, however, assume specific, predictable end-user behavior, such as predefined/servable objectives, which resul...

Descripción completa

Detalles Bibliográficos
Autores principales: Raut, Aritra, Tiwari, Abhisek, Das, Subrata, Saha, Sriparna, Maitra, Anutosh, Ramnani, Roshni, Sengupta, Shubhashis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815581/
https://www.ncbi.nlm.nih.gov/pubmed/36602995
http://dx.doi.org/10.1371/journal.pone.0275750
Descripción
Sumario:PURPOSE: Existing task-oriented virtual agents can assist users with simple tasks like ticket booking, hotel reservations, etc. effectively and with high confidence. These virtual assistants, however, assume specific, predictable end-user behavior, such as predefined/servable objectives, which results in conversation failures in challenging situations, such as when goals are unavailable. METHODOLOGY: Inspired by the practice and its efficacy, we propose an end-to-end framework for task-oriented persuasive dialogue generation that combines pre-training and reinforcement learning for generating context-aware persuasive responses. We utilize four novel rewards to improve consistency and repetitiveness in generated responses. Additionally, a meta-learning strategy has also been utilized to make the model parameters better for domain adaptation. Furthermore, we also curate a personalized persuasive dialogue (PPD) corpus, which contains utterance-level intent, slot, sentiment, and persuasion strategy annotation. FINDINGS: The obtained results and detailed analysis firmly establish the effectiveness of the proposed persuasive virtual assistant over traditional task-oriented virtual assistants. The proposed framework considerably increases the quality of dialogue generation in terms of consistency and repetitiveness. Additionally, our experiment with a few shot and zero-shot settings proves that our meta-learned model learns to quickly adopt new domains with a few or even zero no. of training epochs. It outperforms the non-meta-learning-based approaches keeping the base model constant. ORIGINALITY: To the best of our knowledge, this is the first effort to improve a task-oriented virtual agent’s persuasiveness and domain adaptation.