Cargando…

Automatic Labeled Dialogue Generation for Nursing Record Systems

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the us...

Descripción completa

Detalles Bibliográficos
Autores principales: Mairittha, Tittaya, Mairittha, Nattaya, Inoue, Sozo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7564988/
https://www.ncbi.nlm.nih.gov/pubmed/32708593
http://dx.doi.org/10.3390/jpm10030062
_version_ 1783595838003478528
author Mairittha, Tittaya
Mairittha, Nattaya
Inoue, Sozo
author_facet Mairittha, Tittaya
Mairittha, Nattaya
Inoue, Sozo
author_sort Mairittha, Tittaya
collection PubMed
description The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.
format Online
Article
Text
id pubmed-7564988
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75649882020-10-26 Automatic Labeled Dialogue Generation for Nursing Record Systems Mairittha, Tittaya Mairittha, Nattaya Inoue, Sozo J Pers Med Article The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively. MDPI 2020-07-16 /pmc/articles/PMC7564988/ /pubmed/32708593 http://dx.doi.org/10.3390/jpm10030062 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mairittha, Tittaya
Mairittha, Nattaya
Inoue, Sozo
Automatic Labeled Dialogue Generation for Nursing Record Systems
title Automatic Labeled Dialogue Generation for Nursing Record Systems
title_full Automatic Labeled Dialogue Generation for Nursing Record Systems
title_fullStr Automatic Labeled Dialogue Generation for Nursing Record Systems
title_full_unstemmed Automatic Labeled Dialogue Generation for Nursing Record Systems
title_short Automatic Labeled Dialogue Generation for Nursing Record Systems
title_sort automatic labeled dialogue generation for nursing record systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7564988/
https://www.ncbi.nlm.nih.gov/pubmed/32708593
http://dx.doi.org/10.3390/jpm10030062
work_keys_str_mv AT mairitthatittaya automaticlabeleddialoguegenerationfornursingrecordsystems
AT mairitthanattaya automaticlabeleddialoguegenerationfornursingrecordsystems
AT inouesozo automaticlabeleddialoguegenerationfornursingrecordsystems