Cargando…

Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling

Participants in a conversation must carefully monitor the turn-management (speaking and listening) willingness of other conversational partners and adjust their turn-changing behaviors accordingly to have smooth conversation. Many studies have focused on developing actual turn-changing (i.e., next s...

Descripción completa

Detalles Bibliográficos
Autores principales: Ishii, Ryo, Ren, Xutong, Muszynski, Michal, Morency, Louis-Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624172/
https://www.ncbi.nlm.nih.gov/pubmed/36329758
http://dx.doi.org/10.3389/fpsyg.2022.774547
_version_ 1784822175430082560
author Ishii, Ryo
Ren, Xutong
Muszynski, Michal
Morency, Louis-Philippe
author_facet Ishii, Ryo
Ren, Xutong
Muszynski, Michal
Morency, Louis-Philippe
author_sort Ishii, Ryo
collection PubMed
description Participants in a conversation must carefully monitor the turn-management (speaking and listening) willingness of other conversational partners and adjust their turn-changing behaviors accordingly to have smooth conversation. Many studies have focused on developing actual turn-changing (i.e., next speaker or end-of-turn) models that can predict whether turn-keeping or turn-changing will occur. Participants' verbal and non-verbal behaviors have been used as input features for predictive models. To the best of our knowledge, these studies only model the relationship between participant behavior and turn-changing. Thus, there is no model that takes into account participants' willingness to acquire a turn (turn-management willingness). In this paper, we address the challenge of building such models to predict the willingness of both speakers and listeners. Firstly, we find that dissonance exists between willingness and actual turn-changing. Secondly, we propose predictive models that are based on trimodal inputs, including acoustic, linguistic, and visual cues distilled from conversations. Additionally, we study the impact of modeling willingness to help improve the task of turn-changing prediction. To do so, we introduce a dyadic conversation corpus with annotated scores of speaker/listener turn-management willingness. Our results show that using all three modalities (i.e., acoustic, linguistic, and visual cues) of the speaker and listener is critically important for predicting turn-management willingness. Furthermore, explicitly adding willingness as a prediction task improves the performance of turn-changing prediction. Moreover, turn-management willingness prediction becomes more accurate when this joint prediction of turn-management willingness and turn-changing is performed by using multi-task learning techniques.
format Online
Article
Text
id pubmed-9624172
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96241722022-11-02 Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling Ishii, Ryo Ren, Xutong Muszynski, Michal Morency, Louis-Philippe Front Psychol Psychology Participants in a conversation must carefully monitor the turn-management (speaking and listening) willingness of other conversational partners and adjust their turn-changing behaviors accordingly to have smooth conversation. Many studies have focused on developing actual turn-changing (i.e., next speaker or end-of-turn) models that can predict whether turn-keeping or turn-changing will occur. Participants' verbal and non-verbal behaviors have been used as input features for predictive models. To the best of our knowledge, these studies only model the relationship between participant behavior and turn-changing. Thus, there is no model that takes into account participants' willingness to acquire a turn (turn-management willingness). In this paper, we address the challenge of building such models to predict the willingness of both speakers and listeners. Firstly, we find that dissonance exists between willingness and actual turn-changing. Secondly, we propose predictive models that are based on trimodal inputs, including acoustic, linguistic, and visual cues distilled from conversations. Additionally, we study the impact of modeling willingness to help improve the task of turn-changing prediction. To do so, we introduce a dyadic conversation corpus with annotated scores of speaker/listener turn-management willingness. Our results show that using all three modalities (i.e., acoustic, linguistic, and visual cues) of the speaker and listener is critically important for predicting turn-management willingness. Furthermore, explicitly adding willingness as a prediction task improves the performance of turn-changing prediction. Moreover, turn-management willingness prediction becomes more accurate when this joint prediction of turn-management willingness and turn-changing is performed by using multi-task learning techniques. Frontiers Media S.A. 2022-10-18 /pmc/articles/PMC9624172/ /pubmed/36329758 http://dx.doi.org/10.3389/fpsyg.2022.774547 Text en Copyright © 2022 Ishii, Ren, Muszynski and Morency. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Ishii, Ryo
Ren, Xutong
Muszynski, Michal
Morency, Louis-Philippe
Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title_full Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title_fullStr Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title_full_unstemmed Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title_short Trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
title_sort trimodal prediction of speaking and listening willingness to help improve turn-changing modeling
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624172/
https://www.ncbi.nlm.nih.gov/pubmed/36329758
http://dx.doi.org/10.3389/fpsyg.2022.774547
work_keys_str_mv AT ishiiryo trimodalpredictionofspeakingandlisteningwillingnesstohelpimproveturnchangingmodeling
AT renxutong trimodalpredictionofspeakingandlisteningwillingnesstohelpimproveturnchangingmodeling
AT muszynskimichal trimodalpredictionofspeakingandlisteningwillingnesstohelpimproveturnchangingmodeling
AT morencylouisphilippe trimodalpredictionofspeakingandlisteningwillingnesstohelpimproveturnchangingmodeling