Cargando…

Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation

BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Shih-Yen, Wu, Ya-Lun, Chen, Kai-Ching, Chang, Ting-An, Hsu, Yi-Min, Chuang, Su-Jung, Chang, Ying, Hsu, Kai-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/
https://www.ncbi.nlm.nih.gov/pubmed/36476781
http://dx.doi.org/10.2196/37562
_version_ 1784855108614356992
author Hou, Shih-Yen
Wu, Ya-Lun
Chen, Kai-Ching
Chang, Ting-An
Hsu, Yi-Min
Chuang, Su-Jung
Chang, Ying
Hsu, Kai-Cheng
author_facet Hou, Shih-Yen
Wu, Ya-Lun
Chen, Kai-Ching
Chang, Ting-An
Hsu, Yi-Min
Chuang, Su-Jung
Chang, Ying
Hsu, Kai-Cheng
author_sort Hou, Shih-Yen
collection PubMed
description BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary. OBJECTIVE: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits. METHODS: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied. RESULTS: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words. CONCLUSIONS: This paper has two main contributions. First, the CMaiSpeech data set—a Mandarin-English corpus—has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers.
format Online
Article
Text
id pubmed-9773023
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-97730232022-12-23 Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation Hou, Shih-Yen Wu, Ya-Lun Chen, Kai-Ching Chang, Ting-An Hsu, Yi-Min Chuang, Su-Jung Chang, Ying Hsu, Kai-Cheng JMIR Nurs Original Paper BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary. OBJECTIVE: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits. METHODS: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied. RESULTS: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words. CONCLUSIONS: This paper has two main contributions. First, the CMaiSpeech data set—a Mandarin-English corpus—has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers. JMIR Publications 2022-12-07 /pmc/articles/PMC9773023/ /pubmed/36476781 http://dx.doi.org/10.2196/37562 Text en ©Shih-Yen Hou, Ya-Lun Wu, Kai-Ching Chen, Ting-An Chang, Yi-Min Hsu, Su-Jung Chuang, Ying Chang, Kai-Cheng Hsu. Originally published in JMIR Nursing (https://nursing.jmir.org), 07.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Nursing, is properly cited. The complete bibliographic information, a link to the original publication on https://nursing.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Hou, Shih-Yen
Wu, Ya-Lun
Chen, Kai-Ching
Chang, Ting-An
Hsu, Yi-Min
Chuang, Su-Jung
Chang, Ying
Hsu, Kai-Cheng
Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title_full Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title_fullStr Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title_full_unstemmed Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title_short Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
title_sort code-switching automatic speech recognition for nursing record documentation: system development and evaluation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/
https://www.ncbi.nlm.nih.gov/pubmed/36476781
http://dx.doi.org/10.2196/37562
work_keys_str_mv AT houshihyen codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT wuyalun codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT chenkaiching codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT changtingan codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT hsuyimin codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT chuangsujung codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT changying codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation
AT hsukaicheng codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation