Cargando…
SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants
Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly di...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/ https://www.ncbi.nlm.nih.gov/pubmed/33003380 http://dx.doi.org/10.3390/s20195577 |
_version_ | 1783599206558072832 |
---|---|
author | Azaria, Amos Nivasch, Keren |
author_facet | Azaria, Amos Nivasch, Keren |
author_sort | Azaria, Amos |
collection | PubMed |
description | Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset. |
format | Online Article Text |
id | pubmed-7582502 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75825022020-10-29 SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants Azaria, Amos Nivasch, Keren Sensors (Basel) Communication Intelligent agents that can interact with users using natural language are becoming increasingly common. Sometimes an intelligent agent may not correctly understand a user command or may not perform it properly. In such cases, the user might try a second time by giving the agent another, slightly different command. Giving an agent the ability to detect such user corrections might help it fix its own mistakes and avoid making them in the future. In this work, we consider the problem of automatically detecting user corrections using deep learning. We develop a multimodal architecture called SAIF, which detects such user corrections, taking as inputs the user’s voice commands as well as their transcripts. Voice inputs allow SAIF to take advantage of sound cues, such as tone, speed, and word emphasis. In addition to sound cues, our model uses transcripts to determine whether a command is a correction to the previous command. Our model also obtains internal input from the agent, indicating whether the previous command was executed successfully or not. Finally, we release a unique dataset in which users interacted with an intelligent agent assistant, by giving it commands. This dataset includes labels on pairs of consecutive commands, which indicate whether the latter command is in fact a correction of the former command. We show that SAIF outperforms current state-of-the-art methods on this dataset. MDPI 2020-09-29 /pmc/articles/PMC7582502/ /pubmed/33003380 http://dx.doi.org/10.3390/s20195577 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Azaria, Amos Nivasch, Keren SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title | SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title_full | SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title_fullStr | SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title_full_unstemmed | SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title_short | SAIF: A Correction-Detection Deep-Learning Architecture for Personal Assistants |
title_sort | saif: a correction-detection deep-learning architecture for personal assistants |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582502/ https://www.ncbi.nlm.nih.gov/pubmed/33003380 http://dx.doi.org/10.3390/s20195577 |
work_keys_str_mv | AT azariaamos saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants AT nivaschkeren saifacorrectiondetectiondeeplearningarchitectureforpersonalassistants |