Cargando…

Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)

The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is poss...

Descripción completa

Detalles Bibliográficos
Autores principales: Kantithammakorn, Pimarn, Punyabukkana, Proadpran, Pratanwanich, Ploy N., Hemrungrojn, Solaphat, Chunharas, Chaipat, Wanvarie, Dittaya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8875410/
https://www.ncbi.nlm.nih.gov/pubmed/35214483
http://dx.doi.org/10.3390/s22041583
_version_ 1784657905534894080
author Kantithammakorn, Pimarn
Punyabukkana, Proadpran
Pratanwanich, Ploy N.
Hemrungrojn, Solaphat
Chunharas, Chaipat
Wanvarie, Dittaya
author_facet Kantithammakorn, Pimarn
Punyabukkana, Proadpran
Pratanwanich, Ploy N.
Hemrungrojn, Solaphat
Chunharas, Chaipat
Wanvarie, Dittaya
author_sort Kantithammakorn, Pimarn
collection PubMed
description The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is possible that unique words may be counted differently for various languages. This study focuses on Thai as a language that differs from English in terms of word combinations. We applied various automatic speech recognition (ASR) techniques to develop an assisted scoring system for the MoCA language fluency test with Thai language support. This was a challenge because Thai is a low-resource language for which domain-specific data are not publicly available, especially speech data from patients with MCIs. Furthermore, the great variety of pronunciation, intonation, tone, and accent of the patients, all of which might differ from healthy controls, bring more complexity to the model. We propose a hybrid time delay neural network hidden Markov model (TDNN-HMM) architecture for acoustic model training to create our ASR system that is robust to environmental noise and to the variation of voice quality impacted by MCI. The LOTUS Thai speech corpus was incorporated into the training set to improve the model’s generalization. A preprocessing algorithm was implemented to reduce the background noise and improve the overall data quality before feeding data into the TDNN-HMM system for automatic word detection and language fluency score calculation. The results show that the TDNN-HMM model in combination with data augmentation using lattice-free maximum mutual information (LF-MMI) objective function provides a word error rate (WER) of 30.77%. To our knowledge, this is the first study to develop an ASR with Thai language support to automate the scoring system of MoCA’s language fluency assessment.
format Online
Article
Text
id pubmed-8875410
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88754102022-02-26 Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA) Kantithammakorn, Pimarn Punyabukkana, Proadpran Pratanwanich, Ploy N. Hemrungrojn, Solaphat Chunharas, Chaipat Wanvarie, Dittaya Sensors (Basel) Article The Montreal cognitive assessment (MoCA), a widely accepted screening tool for identifying patients with mild cognitive impairment (MCI), includes a language fluency test of verbal functioning; its scores are based on the number of unique correct words produced by the test taker. However, it is possible that unique words may be counted differently for various languages. This study focuses on Thai as a language that differs from English in terms of word combinations. We applied various automatic speech recognition (ASR) techniques to develop an assisted scoring system for the MoCA language fluency test with Thai language support. This was a challenge because Thai is a low-resource language for which domain-specific data are not publicly available, especially speech data from patients with MCIs. Furthermore, the great variety of pronunciation, intonation, tone, and accent of the patients, all of which might differ from healthy controls, bring more complexity to the model. We propose a hybrid time delay neural network hidden Markov model (TDNN-HMM) architecture for acoustic model training to create our ASR system that is robust to environmental noise and to the variation of voice quality impacted by MCI. The LOTUS Thai speech corpus was incorporated into the training set to improve the model’s generalization. A preprocessing algorithm was implemented to reduce the background noise and improve the overall data quality before feeding data into the TDNN-HMM system for automatic word detection and language fluency score calculation. The results show that the TDNN-HMM model in combination with data augmentation using lattice-free maximum mutual information (LF-MMI) objective function provides a word error rate (WER) of 30.77%. To our knowledge, this is the first study to develop an ASR with Thai language support to automate the scoring system of MoCA’s language fluency assessment. MDPI 2022-02-17 /pmc/articles/PMC8875410/ /pubmed/35214483 http://dx.doi.org/10.3390/s22041583 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kantithammakorn, Pimarn
Punyabukkana, Proadpran
Pratanwanich, Ploy N.
Hemrungrojn, Solaphat
Chunharas, Chaipat
Wanvarie, Dittaya
Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title_full Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title_fullStr Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title_full_unstemmed Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title_short Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
title_sort using automatic speech recognition to assess thai speech language fluency in the montreal cognitive assessment (moca)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8875410/
https://www.ncbi.nlm.nih.gov/pubmed/35214483
http://dx.doi.org/10.3390/s22041583
work_keys_str_mv AT kantithammakornpimarn usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca
AT punyabukkanaproadpran usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca
AT pratanwanichployn usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca
AT hemrungrojnsolaphat usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca
AT chunharaschaipat usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca
AT wanvariedittaya usingautomaticspeechrecognitiontoassessthaispeechlanguagefluencyinthemontrealcognitiveassessmentmoca