Cargando…

Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training

Phoneme pronunciations are usually considered as basic skills for learning a foreign language. Practicing the pronunciations in a computer-assisted way is helpful in a self-directed or long-distance learning environment. Recent researches indicate that machine learning is a promising method to build...

Descripción completa

Detalles Bibliográficos
Autores principales: Bi, Yanjing, Li, Chao, Benezeth, Yannick, Yang, Fan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8523060/
https://www.ncbi.nlm.nih.gov/pubmed/34662367
http://dx.doi.org/10.1371/journal.pone.0257901
_version_ 1784585215542296576
author Bi, Yanjing
Li, Chao
Benezeth, Yannick
Yang, Fan
author_facet Bi, Yanjing
Li, Chao
Benezeth, Yannick
Yang, Fan
author_sort Bi, Yanjing
collection PubMed
description Phoneme pronunciations are usually considered as basic skills for learning a foreign language. Practicing the pronunciations in a computer-assisted way is helpful in a self-directed or long-distance learning environment. Recent researches indicate that machine learning is a promising method to build high-performance computer-assisted pronunciation training modalities. Many data-driven classifying models, such as support vector machines, back-propagation networks, deep neural networks and convolutional neural networks, are increasingly widely used for it. Yet, the acoustic waveforms of phoneme are essentially modulated from the base vibrations of vocal cords, and this fact somehow makes the predictors collinear, distorting the classifying models. A commonly-used solution to address this issue is to suppressing the collinearity of predictors via partial least square regressing algorithm. It allows to obtain high-quality predictor weighting results via predictor relationship analysis. However, as a linear regressor, the classifiers of this type possess very simple topology structures, constraining the universality of the regressors. For this issue, this paper presents an heterogeneous phoneme recognition framework which can further benefit the phoneme pronunciation diagnostic tasks by combining the partial least square with support vector machines. A French phoneme data set containing 4830 samples is established for the evaluation experiments. The experiments of this paper demonstrates that the new method improves the accuracy performance of the phoneme classifiers by 0.21 − 8.47% comparing to state-of-the-arts with different data training data density.
format Online
Article
Text
id pubmed-8523060
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-85230602021-10-19 Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training Bi, Yanjing Li, Chao Benezeth, Yannick Yang, Fan PLoS One Research Article Phoneme pronunciations are usually considered as basic skills for learning a foreign language. Practicing the pronunciations in a computer-assisted way is helpful in a self-directed or long-distance learning environment. Recent researches indicate that machine learning is a promising method to build high-performance computer-assisted pronunciation training modalities. Many data-driven classifying models, such as support vector machines, back-propagation networks, deep neural networks and convolutional neural networks, are increasingly widely used for it. Yet, the acoustic waveforms of phoneme are essentially modulated from the base vibrations of vocal cords, and this fact somehow makes the predictors collinear, distorting the classifying models. A commonly-used solution to address this issue is to suppressing the collinearity of predictors via partial least square regressing algorithm. It allows to obtain high-quality predictor weighting results via predictor relationship analysis. However, as a linear regressor, the classifiers of this type possess very simple topology structures, constraining the universality of the regressors. For this issue, this paper presents an heterogeneous phoneme recognition framework which can further benefit the phoneme pronunciation diagnostic tasks by combining the partial least square with support vector machines. A French phoneme data set containing 4830 samples is established for the evaluation experiments. The experiments of this paper demonstrates that the new method improves the accuracy performance of the phoneme classifiers by 0.21 − 8.47% comparing to state-of-the-arts with different data training data density. Public Library of Science 2021-10-18 /pmc/articles/PMC8523060/ /pubmed/34662367 http://dx.doi.org/10.1371/journal.pone.0257901 Text en © 2021 Bi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bi, Yanjing
Li, Chao
Benezeth, Yannick
Yang, Fan
Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title_full Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title_fullStr Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title_full_unstemmed Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title_short Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training
title_sort impacts of multicollinearity on capt modalities: an heterogeneous machine learning framework for computer-assisted french phoneme pronunciation training
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8523060/
https://www.ncbi.nlm.nih.gov/pubmed/34662367
http://dx.doi.org/10.1371/journal.pone.0257901
work_keys_str_mv AT biyanjing impactsofmulticollinearityoncaptmodalitiesanheterogeneousmachinelearningframeworkforcomputerassistedfrenchphonemepronunciationtraining
AT lichao impactsofmulticollinearityoncaptmodalitiesanheterogeneousmachinelearningframeworkforcomputerassistedfrenchphonemepronunciationtraining
AT benezethyannick impactsofmulticollinearityoncaptmodalitiesanheterogeneousmachinelearningframeworkforcomputerassistedfrenchphonemepronunciationtraining
AT yangfan impactsofmulticollinearityoncaptmodalitiesanheterogeneousmachinelearningframeworkforcomputerassistedfrenchphonemepronunciationtraining