Cargando…
Automatic Labeled Dialogue Generation for Nursing Record Systems
The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the us...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7564988/ https://www.ncbi.nlm.nih.gov/pubmed/32708593 http://dx.doi.org/10.3390/jpm10030062 |
_version_ | 1783595838003478528 |
---|---|
author | Mairittha, Tittaya Mairittha, Nattaya Inoue, Sozo |
author_facet | Mairittha, Tittaya Mairittha, Nattaya Inoue, Sozo |
author_sort | Mairittha, Tittaya |
collection | PubMed |
description | The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively. |
format | Online Article Text |
id | pubmed-7564988 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75649882020-10-26 Automatic Labeled Dialogue Generation for Nursing Record Systems Mairittha, Tittaya Mairittha, Nattaya Inoue, Sozo J Pers Med Article The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively. MDPI 2020-07-16 /pmc/articles/PMC7564988/ /pubmed/32708593 http://dx.doi.org/10.3390/jpm10030062 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Mairittha, Tittaya Mairittha, Nattaya Inoue, Sozo Automatic Labeled Dialogue Generation for Nursing Record Systems |
title | Automatic Labeled Dialogue Generation for Nursing Record Systems |
title_full | Automatic Labeled Dialogue Generation for Nursing Record Systems |
title_fullStr | Automatic Labeled Dialogue Generation for Nursing Record Systems |
title_full_unstemmed | Automatic Labeled Dialogue Generation for Nursing Record Systems |
title_short | Automatic Labeled Dialogue Generation for Nursing Record Systems |
title_sort | automatic labeled dialogue generation for nursing record systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7564988/ https://www.ncbi.nlm.nih.gov/pubmed/32708593 http://dx.doi.org/10.3390/jpm10030062 |
work_keys_str_mv | AT mairitthatittaya automaticlabeleddialoguegenerationfornursingrecordsystems AT mairitthanattaya automaticlabeleddialoguegenerationfornursingrecordsystems AT inouesozo automaticlabeleddialoguegenerationfornursingrecordsystems |