Cargando…
Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue
Spoken dialogue systems must be able to express empathy to achieve natural interaction with human users. However, laughter generation requires a high level of dialogue understanding. Thus, implementing laughter in existing systems, such as in conversational robots, has been challenging. As a first s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522467/ https://www.ncbi.nlm.nih.gov/pubmed/36185977 http://dx.doi.org/10.3389/frobt.2022.933261 |
_version_ | 1784800073157181440 |
---|---|
author | Inoue, Koji Lala, Divesh Kawahara, Tatsuya |
author_facet | Inoue, Koji Lala, Divesh Kawahara, Tatsuya |
author_sort | Inoue, Koji |
collection | PubMed |
description | Spoken dialogue systems must be able to express empathy to achieve natural interaction with human users. However, laughter generation requires a high level of dialogue understanding. Thus, implementing laughter in existing systems, such as in conversational robots, has been challenging. As a first step toward solving this problem, rather than generating laughter from user dialogue, we focus on “shared laughter,” where a user laughs using either solo or speech laughs (initial laugh), and the system laughs in turn (response laugh). The proposed system consists of three models: 1) initial laugh detection, 2) shared laughter prediction, and 3) laugh type selection. We trained each model using a human-robot speed dating dialogue corpus. For the first model, a recurrent neural network was applied, and the detection performance achieved an F1 score of 82.6%. The second model used the acoustic and prosodic features of the initial laugh and achieved a prediction accuracy above that of the random prediction. The third model selects the type of system’s response laugh as social or mirthful laugh based on the same features of the initial laugh. We then implemented the full shared laughter generation system in an attentive listening dialogue system and conducted a dialogue listening experiment. The proposed system improved the impression of the dialogue system such as empathy perception compared to a naive baseline without laughter and a reactive system that always responded with only social laughs. We propose that our system can be used for situated robot interaction and also emphasize the need for integrating proper empathetic laughs into conversational robots and agents. |
format | Online Article Text |
id | pubmed-9522467 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95224672022-09-30 Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue Inoue, Koji Lala, Divesh Kawahara, Tatsuya Front Robot AI Robotics and AI Spoken dialogue systems must be able to express empathy to achieve natural interaction with human users. However, laughter generation requires a high level of dialogue understanding. Thus, implementing laughter in existing systems, such as in conversational robots, has been challenging. As a first step toward solving this problem, rather than generating laughter from user dialogue, we focus on “shared laughter,” where a user laughs using either solo or speech laughs (initial laugh), and the system laughs in turn (response laugh). The proposed system consists of three models: 1) initial laugh detection, 2) shared laughter prediction, and 3) laugh type selection. We trained each model using a human-robot speed dating dialogue corpus. For the first model, a recurrent neural network was applied, and the detection performance achieved an F1 score of 82.6%. The second model used the acoustic and prosodic features of the initial laugh and achieved a prediction accuracy above that of the random prediction. The third model selects the type of system’s response laugh as social or mirthful laugh based on the same features of the initial laugh. We then implemented the full shared laughter generation system in an attentive listening dialogue system and conducted a dialogue listening experiment. The proposed system improved the impression of the dialogue system such as empathy perception compared to a naive baseline without laughter and a reactive system that always responded with only social laughs. We propose that our system can be used for situated robot interaction and also emphasize the need for integrating proper empathetic laughs into conversational robots and agents. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9522467/ /pubmed/36185977 http://dx.doi.org/10.3389/frobt.2022.933261 Text en Copyright © 2022 Inoue, Lala and Kawahara. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Inoue, Koji Lala, Divesh Kawahara, Tatsuya Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title | Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title_full | Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title_fullStr | Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title_full_unstemmed | Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title_short | Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue |
title_sort | can a robot laugh with you?: shared laughter generation for empathetic spoken dialogue |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522467/ https://www.ncbi.nlm.nih.gov/pubmed/36185977 http://dx.doi.org/10.3389/frobt.2022.933261 |
work_keys_str_mv | AT inouekoji canarobotlaughwithyousharedlaughtergenerationforempatheticspokendialogue AT laladivesh canarobotlaughwithyousharedlaughtergenerationforempatheticspokendialogue AT kawaharatatsuya canarobotlaughwithyousharedlaughtergenerationforempatheticspokendialogue |