Cargando…

Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition

Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (in...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Zhong, Li, Keyi, Feng, Gang, Guo, Yang, Li, Yinan, Xiao, Lili, Liu, Chengqi, He, Shouhuan, Zhang, Zhen, Qian, Di, Feng, Yanmei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678109/
https://www.ncbi.nlm.nih.gov/pubmed/34924928
http://dx.doi.org/10.3389/fnins.2021.744959
_version_ 1784616282173210624
author Zheng, Zhong
Li, Keyi
Feng, Gang
Guo, Yang
Li, Yinan
Xiao, Lili
Liu, Chengqi
He, Shouhuan
Zhang, Zhen
Qian, Di
Feng, Yanmei
author_facet Zheng, Zhong
Li, Keyi
Feng, Gang
Guo, Yang
Li, Yinan
Xiao, Lili
Liu, Chengqi
He, Shouhuan
Zhang, Zhen
Qian, Di
Feng, Yanmei
author_sort Zheng, Zhong
collection PubMed
description Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (including vowel, and consonant) and lexical tone recognition to provide information for speech coding schemes specific to Mandarin. Design: Eleven normal hearing subjects were studied using acoustic temporal E cues that were extracted from 30 continuous frequency bands between 80 and 7,562 Hz using the Hilbert transform and divided into five frequency regions. Percent-correct recognition scores were obtained with acoustic E cues presented in three, four, and five frequency regions and their relative weights calculated using the least-square approach. Results: For stimuli with three, four, and five frequency regions, percent-correct scores for vowel recognition using E cues were 50.43–84.82%, 76.27–95.24%, and 96.58%, respectively; for consonant recognition 35.49–63.77%, 67.75–78.87%, and 87.87%; for lexical tone recognition 60.80–97.15%, 73.16–96.87%, and 96.73%. For frequency region 1 to frequency region 5, the mean weights in vowel recognition were 0.17, 0.31, 0.22, 0.18, and 0.12, respectively; in consonant recognition 0.10, 0.16, 0.18, 0.23, and 0.33; in lexical tone recognition 0.38, 0.18, 0.14, 0.16, and 0.14. Conclusion: Regions that contributed most for vowel recognition was Region 2 (502–1,022 Hz) that contains first formant (F1) information; Region 5 (3,856–7,562 Hz) contributed most to consonant recognition; Region 1 (80–502 Hz) that contains fundamental frequency (F0) information contributed most to lexical tone recognition.
format Online
Article
Text
id pubmed-8678109
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86781092021-12-18 Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition Zheng, Zhong Li, Keyi Feng, Gang Guo, Yang Li, Yinan Xiao, Lili Liu, Chengqi He, Shouhuan Zhang, Zhen Qian, Di Feng, Yanmei Front Neurosci Neuroscience Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (including vowel, and consonant) and lexical tone recognition to provide information for speech coding schemes specific to Mandarin. Design: Eleven normal hearing subjects were studied using acoustic temporal E cues that were extracted from 30 continuous frequency bands between 80 and 7,562 Hz using the Hilbert transform and divided into five frequency regions. Percent-correct recognition scores were obtained with acoustic E cues presented in three, four, and five frequency regions and their relative weights calculated using the least-square approach. Results: For stimuli with three, four, and five frequency regions, percent-correct scores for vowel recognition using E cues were 50.43–84.82%, 76.27–95.24%, and 96.58%, respectively; for consonant recognition 35.49–63.77%, 67.75–78.87%, and 87.87%; for lexical tone recognition 60.80–97.15%, 73.16–96.87%, and 96.73%. For frequency region 1 to frequency region 5, the mean weights in vowel recognition were 0.17, 0.31, 0.22, 0.18, and 0.12, respectively; in consonant recognition 0.10, 0.16, 0.18, 0.23, and 0.33; in lexical tone recognition 0.38, 0.18, 0.14, 0.16, and 0.14. Conclusion: Regions that contributed most for vowel recognition was Region 2 (502–1,022 Hz) that contains first formant (F1) information; Region 5 (3,856–7,562 Hz) contributed most to consonant recognition; Region 1 (80–502 Hz) that contains fundamental frequency (F0) information contributed most to lexical tone recognition. Frontiers Media S.A. 2021-12-02 /pmc/articles/PMC8678109/ /pubmed/34924928 http://dx.doi.org/10.3389/fnins.2021.744959 Text en Copyright © 2021 Zheng, Li, Feng, Guo, Li, Xiao, Liu, He, Zhang, Qian and Feng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Zheng, Zhong
Li, Keyi
Feng, Gang
Guo, Yang
Li, Yinan
Xiao, Lili
Liu, Chengqi
He, Shouhuan
Zhang, Zhen
Qian, Di
Feng, Yanmei
Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title_full Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title_fullStr Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title_full_unstemmed Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title_short Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition
title_sort relative weights of temporal envelope cues in different frequency regions for mandarin vowel, consonant, and lexical tone recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678109/
https://www.ncbi.nlm.nih.gov/pubmed/34924928
http://dx.doi.org/10.3389/fnins.2021.744959
work_keys_str_mv AT zhengzhong relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT likeyi relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT fenggang relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT guoyang relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT liyinan relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT xiaolili relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT liuchengqi relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT heshouhuan relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT zhangzhen relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT qiandi relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition
AT fengyanmei relativeweightsoftemporalenvelopecuesindifferentfrequencyregionsformandarinvowelconsonantandlexicaltonerecognition