Cargando…

SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants

Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly di...

Descripción completa

Detalles Bibliográficos
Autores principales: Azaria, Amos, Nivasch, Keren
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/
https://www.ncbi.nlm.nih.gov/pubmed/33003380
http://dx.doi.org/10.3390/s20195577
_version_ 1783599206558072832
author Azaria, Amos
Nivasch, Keren
author_facet Azaria, Amos
Nivasch, Keren
author_sort Azaria, Amos
collection PubMed
description Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset.
format Online
Article
Text
id pubmed-7582502
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75825022020-10-29 SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants Azaria, Amos Nivasch, Keren Sensors (Basel) Communication Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset. MDPI 2020-09-29 /pmc/articles/PMC7582502/ /pubmed/33003380 http://dx.doi.org/10.3390/s20195577 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Azaria, Amos
Nivasch, Keren
SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_full SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_fullStr SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_full_unstemmed SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_short SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_sort saif: a correction-detection deep-learning architecture for personal assistants
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/
https://www.ncbi.nlm.nih.gov/pubmed/33003380
http://dx.doi.org/10.3390/s20195577
work_keys_str_mv AT azariaamos saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants
AT nivaschkeren saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants