Cargando…

The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination

We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of the human phonetic articulators, to improve automatic phoneme discrimination. Using a multi-subject database of synchronized speech and lips/tongue trajectories, we first identify MIs commonly associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Castellini, Claudio, Badino, Leonardo, Metta, Giorgio, Sandini, Giulio, Tavella, Michele, Grimaldi, Mirko, Fadiga, Luciano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3164679/
https://www.ncbi.nlm.nih.gov/pubmed/21912661
http://dx.doi.org/10.1371/journal.pone.0024055
_version_ 1782211069362569216
author Castellini, Claudio
Badino, Leonardo
Metta, Giorgio
Sandini, Giulio
Tavella, Michele
Grimaldi, Mirko
Fadiga, Luciano
author_facet Castellini, Claudio
Badino, Leonardo
Metta, Giorgio
Sandini, Giulio
Tavella, Michele
Grimaldi, Mirko
Fadiga, Luciano
author_sort Castellini, Claudio
collection PubMed
description We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of the human phonetic articulators, to improve automatic phoneme discrimination. Using a multi-subject database of synchronized speech and lips/tongue trajectories, we first identify MIs commonly associated with bilabial and dental consonants, and use them to simultaneously segment speech and motor signals. We then build a simple neural network-based regression schema (called Audio-Motor Map, AMM) mapping audio features of these segments to the corresponding MIs. Extensive experimental results show that [Image: see text] a small set of features extracted from the MIs, as originally gathered from articulatory sensors, are dramatically more effective than a large, state-of-the-art set of audio features, in automatically discriminating bilabials from dentals; [Image: see text] the same features, extracted from AMM-reconstructed MIs, are as effective as or better than the audio features, when testing across speakers and coarticulating phonemes; and dramatically better as noise is added to the speech signal. These results seem to support some of the claims of the motor theory of speech perception and add experimental evidence of the actual usefulness of MIs in the more general framework of automated speech recognition.
format Online
Article
Text
id pubmed-3164679
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31646792011-09-12 The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination Castellini, Claudio Badino, Leonardo Metta, Giorgio Sandini, Giulio Tavella, Michele Grimaldi, Mirko Fadiga, Luciano PLoS One Research Article We investigate the use of phonetic motor invariants (MIs), that is, recurring kinematic patterns of the human phonetic articulators, to improve automatic phoneme discrimination. Using a multi-subject database of synchronized speech and lips/tongue trajectories, we first identify MIs commonly associated with bilabial and dental consonants, and use them to simultaneously segment speech and motor signals. We then build a simple neural network-based regression schema (called Audio-Motor Map, AMM) mapping audio features of these segments to the corresponding MIs. Extensive experimental results show that [Image: see text] a small set of features extracted from the MIs, as originally gathered from articulatory sensors, are dramatically more effective than a large, state-of-the-art set of audio features, in automatically discriminating bilabials from dentals; [Image: see text] the same features, extracted from AMM-reconstructed MIs, are as effective as or better than the audio features, when testing across speakers and coarticulating phonemes; and dramatically better as noise is added to the speech signal. These results seem to support some of the claims of the motor theory of speech perception and add experimental evidence of the actual usefulness of MIs in the more general framework of automated speech recognition. Public Library of Science 2011-09-01 /pmc/articles/PMC3164679/ /pubmed/21912661 http://dx.doi.org/10.1371/journal.pone.0024055 Text en Castellini et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Castellini, Claudio
Badino, Leonardo
Metta, Giorgio
Sandini, Giulio
Tavella, Michele
Grimaldi, Mirko
Fadiga, Luciano
The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title_full The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title_fullStr The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title_full_unstemmed The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title_short The Use of Phonetic Motor Invariants Can Improve Automatic Phoneme Discrimination
title_sort use of phonetic motor invariants can improve automatic phoneme discrimination
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3164679/
https://www.ncbi.nlm.nih.gov/pubmed/21912661
http://dx.doi.org/10.1371/journal.pone.0024055
work_keys_str_mv AT castelliniclaudio theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT badinoleonardo theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT mettagiorgio theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT sandinigiulio theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT tavellamichele theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT grimaldimirko theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT fadigaluciano theuseofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT castelliniclaudio useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT badinoleonardo useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT mettagiorgio useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT sandinigiulio useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT tavellamichele useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT grimaldimirko useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination
AT fadigaluciano useofphoneticmotorinvariantscanimproveautomaticphonemediscrimination