Cargando…

Modeling speech imitation and ecological learning of auditory-motor maps

Classical models of speech consider an antero-posterior distinction between perceptive and productive functions. However, the selective alteration of neural activity in speech motor centers, via transcranial magnetic stimulation, was shown to affect speech discrimination. On the automatic speech rec...

Descripción completa

Detalles Bibliográficos
Autores principales:	Canevari, Claudia, Badino, Leonardo, D'Ausilio, Alessandro, Fadiga, Luciano, Metta, Giorgio
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2013
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694210/ https://www.ncbi.nlm.nih.gov/pubmed/23818883 http://dx.doi.org/10.3389/fpsyg.2013.00364

_version_	1782274830447411200
author	Canevari, Claudia Badino, Leonardo D'Ausilio, Alessandro Fadiga, Luciano Metta, Giorgio
author_facet	Canevari, Claudia Badino, Leonardo D'Ausilio, Alessandro Fadiga, Luciano Metta, Giorgio
author_sort	Canevari, Claudia
collection	PubMed
description	Classical models of speech consider an antero-posterior distinction between perceptive and productive functions. However, the selective alteration of neural activity in speech motor centers, via transcranial magnetic stimulation, was shown to affect speech discrimination. On the automatic speech recognition (ASR) side, the recognition systems have classically relied solely on acoustic data, achieving rather good performance in optimal listening conditions. The main limitations of current ASR are mainly evident in the realistic use of such systems. These limitations can be partly reduced by using normalization strategies that minimize inter-speaker variability by either explicitly removing speakers' peculiarities or adapting different speakers to a reference model. In this paper we aim at modeling a motor-based imitation learning mechanism in ASR. We tested the utility of a speaker normalization strategy that uses motor representations of speech and compare it with strategies that ignore the motor domain. Specifically, we first trained a regressor through state-of-the-art machine learning techniques to build an auditory-motor mapping, in a sense mimicking a human learner that tries to reproduce utterances produced by other speakers. This auditory-motor mapping maps the speech acoustics of a speaker into the motor plans of a reference speaker. Since, during recognition, only speech acoustics are available, the mapping is necessary to “recover” motor information. Subsequently, in a phone classification task, we tested the system on either one of the speakers that was used during training or a new one. Results show that in both cases the motor-based speaker normalization strategy slightly but significantly outperforms all other strategies where only acoustics is taken into account.
format	Online Article Text
id	pubmed-3694210
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-36942102013-07-01 Modeling speech imitation and ecological learning of auditory-motor maps Canevari, Claudia Badino, Leonardo D'Ausilio, Alessandro Fadiga, Luciano Metta, Giorgio Front Psychol Psychology Classical models of speech consider an antero-posterior distinction between perceptive and productive functions. However, the selective alteration of neural activity in speech motor centers, via transcranial magnetic stimulation, was shown to affect speech discrimination. On the automatic speech recognition (ASR) side, the recognition systems have classically relied solely on acoustic data, achieving rather good performance in optimal listening conditions. The main limitations of current ASR are mainly evident in the realistic use of such systems. These limitations can be partly reduced by using normalization strategies that minimize inter-speaker variability by either explicitly removing speakers' peculiarities or adapting different speakers to a reference model. In this paper we aim at modeling a motor-based imitation learning mechanism in ASR. We tested the utility of a speaker normalization strategy that uses motor representations of speech and compare it with strategies that ignore the motor domain. Specifically, we first trained a regressor through state-of-the-art machine learning techniques to build an auditory-motor mapping, in a sense mimicking a human learner that tries to reproduce utterances produced by other speakers. This auditory-motor mapping maps the speech acoustics of a speaker into the motor plans of a reference speaker. Since, during recognition, only speech acoustics are available, the mapping is necessary to “recover” motor information. Subsequently, in a phone classification task, we tested the system on either one of the speakers that was used during training or a new one. Results show that in both cases the motor-based speaker normalization strategy slightly but significantly outperforms all other strategies where only acoustics is taken into account. Frontiers Media S.A. 2013-06-27 /pmc/articles/PMC3694210/ /pubmed/23818883 http://dx.doi.org/10.3389/fpsyg.2013.00364 Text en Copyright © 2013 Canevari, Badino, D'Ausilio, Fadiga and Metta. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle	Psychology Canevari, Claudia Badino, Leonardo D'Ausilio, Alessandro Fadiga, Luciano Metta, Giorgio Modeling speech imitation and ecological learning of auditory-motor maps
title	Modeling speech imitation and ecological learning of auditory-motor maps
title_full	Modeling speech imitation and ecological learning of auditory-motor maps
title_fullStr	Modeling speech imitation and ecological learning of auditory-motor maps
title_full_unstemmed	Modeling speech imitation and ecological learning of auditory-motor maps
title_short	Modeling speech imitation and ecological learning of auditory-motor maps
title_sort	modeling speech imitation and ecological learning of auditory-motor maps
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694210/ https://www.ncbi.nlm.nih.gov/pubmed/23818883 http://dx.doi.org/10.3389/fpsyg.2013.00364
work_keys_str_mv	AT canevariclaudia modelingspeechimitationandecologicallearningofauditorymotormaps AT badinoleonardo modelingspeechimitationandecologicallearningofauditorymotormaps AT dausilioalessandro modelingspeechimitationandecologicallearningofauditorymotormaps AT fadigaluciano modelingspeechimitationandecologicallearningofauditorymotormaps AT mettagiorgio modelingspeechimitationandecologicallearningofauditorymotormaps

Modeling speech imitation and ecological learning of auditory-motor maps

Ejemplares similares