Cargando…
Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is poss...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8875410/ https://www.ncbi.nlm.nih.gov/pubmed/35214483 http://dx.doi.org/10.3390/s22041583 |
_version_ | 1784657905534894080 |
---|---|
author | Kantithammakorn, Pimarn Punyabukkana, Proadpran Pratanwanich, Ploy N. Hemrungrojn, Solaphat Chunharas, Chaipat Wanvarie, Dittaya |
author_facet | Kantithammakorn, Pimarn Punyabukkana, Proadpran Pratanwanich, Ploy N. Hemrungrojn, Solaphat Chunharas, Chaipat Wanvarie, Dittaya |
author_sort | Kantithammakorn, Pimarn |
collection | PubMed |
description | The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is possible that unique words may be counted differently for various languages. This study focuses on Thai as a language that differs from English in terms of word combinations. We applied various automatic speech recognition (ASR) techniques to develop an assisted scoring system for the MoCA language fluency test with Thai language support. This was a challenge because Thai is a low-resource language for which domain-specific data are not publicly available, especially speech data from patients with MCIs. Furthermore, the great variety of pronunciation, intonation, tone, and accent of the patients, all of which might differ from healthy controls, bring more complexity to the model. We propose a hybrid time delay neural network hidden Markov model (TDNN-HMM) architecture for acoustic model training to create our ASR system that is robust to environmental noise and to the variation of voice quality impacted by MCI. The LOTUS Thai speech corpus was incorporated into the training set to improve the model’s generalization. A preprocessing algorithm was implemented to reduce the background noise and improve the overall data quality before feeding data into the TDNN-HMM system for automatic word detection and language fluency score calculation. The results show that the TDNN-HMM model in combination with data augmentation using lattice-free maximum mutual information (LF-MMI) objective function provides a word error rate (WER) of 30.77%. To our knowledge, this is the first study to develop an ASR with Thai language support to automate the scoring system of MoCA’s language fluency assessment. |
format | Online Article Text |
id | pubmed-8875410 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-88754102022-02-26 Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) Kantithammakorn, Pimarn Punyabukkana, Proadpran Pratanwanich, Ploy N. Hemrungrojn, Solaphat Chunharas, Chaipat Wanvarie, Dittaya Sensors (Basel) Article The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is possible that unique words may be counted differently for various languages. This study focuses on Thai as a language that differs from English in terms of word combinations. We applied various automatic speech recognition (ASR) techniques to develop an assisted scoring system for the MoCA language fluency test with Thai language support. This was a challenge because Thai is a low-resource language for which domain-specific data are not publicly available, especially speech data from patients with MCIs. Furthermore, the great variety of pronunciation, intonation, tone, and accent of the patients, all of which might differ from healthy controls, bring more complexity to the model. We propose a hybrid time delay neural network hidden Markov model (TDNN-HMM) architecture for acoustic model training to create our ASR system that is robust to environmental noise and to the variation of voice quality impacted by MCI. The LOTUS Thai speech corpus was incorporated into the training set to improve the model’s generalization. A preprocessing algorithm was implemented to reduce the background noise and improve the overall data quality before feeding data into the TDNN-HMM system for automatic word detection and language fluency score calculation. The results show that the TDNN-HMM model in combination with data augmentation using lattice-free maximum mutual information (LF-MMI) objective function provides a word error rate (WER) of 30.77%. To our knowledge, this is the first study to develop an ASR with Thai language support to automate the scoring system of MoCA’s language fluency assessment. MDPI 2022-02-17 /pmc/articles/PMC8875410/ /pubmed/35214483 http://dx.doi.org/10.3390/s22041583 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kantithammakorn, Pimarn Punyabukkana, Proadpran Pratanwanich, Ploy N. Hemrungrojn, Solaphat Chunharas, Chaipat Wanvarie, Dittaya Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title | Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title_full | Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title_fullStr | Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title_full_unstemmed | Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title_short | Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) |
title_sort | using automatic speech recognition to assess thai speech language fluency in the montreal cognitive assessment (moca) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8875410/ https://www.ncbi.nlm.nih.gov/pubmed/35214483 http://dx.doi.org/10.3390/s22041583 |
work_keys_str_mv | AT kantithammakornpimarn usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca AT punyabukkanaproadpran usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca AT pratanwanichployn usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca AT hemrungrojnsolaphat usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca AT chunharaschaipat usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca AT wanvariedittaya usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca |