Cargando…

SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants

Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly di...

Descripción completa

Detalles Bibliográficos
Autores principales:	Azaria, Amos, Nivasch, Keren
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/ https://www.ncbi.nlm.nih.gov/pubmed/33003380 http://dx.doi.org/10.3390/s20195577

_version_	1783599206558072832
author	Azaria, Amos Nivasch, Keren
author_facet	Azaria, Amos Nivasch, Keren
author_sort	Azaria, Amos
collection	PubMed
description	Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset.
format	Online Article Text
id	pubmed-7582502
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75825022020-10-29 SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants Azaria, Amos Nivasch, Keren Sensors (Basel) Communication Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset. MDPI 2020-09-29 /pmc/articles/PMC7582502/ /pubmed/33003380 http://dx.doi.org/10.3390/s20195577 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Azaria, Amos Nivasch, Keren SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title	SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_full	SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_fullStr	SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_full_unstemmed	SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_short	SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
title_sort	saif: a correction-detection deep-learning architecture for personal assistants
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/ https://www.ncbi.nlm.nih.gov/pubmed/33003380 http://dx.doi.org/10.3390/s20195577
work_keys_str_mv	AT azariaamos saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants AT nivaschkeren saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants

SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants

Ejemplares similares