Cargando…
Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation
BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/ https://www.ncbi.nlm.nih.gov/pubmed/36476781 http://dx.doi.org/10.2196/37562 |
_version_ | 1784855108614356992 |
---|---|
author | Hou, Shih-Yen Wu, Ya-Lun Chen, Kai-Ching Chang, Ting-An Hsu, Yi-Min Chuang, Su-Jung Chang, Ying Hsu, Kai-Cheng |
author_facet | Hou, Shih-Yen Wu, Ya-Lun Chen, Kai-Ching Chang, Ting-An Hsu, Yi-Min Chuang, Su-Jung Chang, Ying Hsu, Kai-Cheng |
author_sort | Hou, Shih-Yen |
collection | PubMed |
description | BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary. OBJECTIVE: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits. METHODS: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied. RESULTS: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words. CONCLUSIONS: This paper has two main contributions. First, the CMaiSpeech data set—a Mandarin-English corpus—has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers. |
format | Online Article Text |
id | pubmed-9773023 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-97730232022-12-23 Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation Hou, Shih-Yen Wu, Ya-Lun Chen, Kai-Ching Chang, Ting-An Hsu, Yi-Min Chuang, Su-Jung Chang, Ying Hsu, Kai-Cheng JMIR Nurs Original Paper BACKGROUND: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary. OBJECTIVE: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits. METHODS: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied. RESULTS: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words. CONCLUSIONS: This paper has two main contributions. First, the CMaiSpeech data set—a Mandarin-English corpus—has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers. JMIR Publications 2022-12-07 /pmc/articles/PMC9773023/ /pubmed/36476781 http://dx.doi.org/10.2196/37562 Text en ©Shih-Yen Hou, Ya-Lun Wu, Kai-Ching Chen, Ting-An Chang, Yi-Min Hsu, Su-Jung Chuang, Ying Chang, Kai-Cheng Hsu. Originally published in JMIR Nursing (https://nursing.jmir.org), 07.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Nursing, is properly cited. The complete bibliographic information, a link to the original publication on https://nursing.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Hou, Shih-Yen Wu, Ya-Lun Chen, Kai-Ching Chang, Ting-An Hsu, Yi-Min Chuang, Su-Jung Chang, Ying Hsu, Kai-Cheng Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title | Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title_full | Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title_fullStr | Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title_full_unstemmed | Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title_short | Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation |
title_sort | code-switching automatic speech recognition for nursing record documentation: system development and evaluation |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/ https://www.ncbi.nlm.nih.gov/pubmed/36476781 http://dx.doi.org/10.2196/37562 |
work_keys_str_mv | AT houshihyen codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT wuyalun codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT chenkaiching codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT changtingan codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT hsuyimin codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT chuangsujung codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT changying codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation AT hsukaicheng codeswitchingautomaticspeechrecognitionfornursingrecorddocumentationsystemdevelopmentandevaluation |