Cargando…

Speech reconstruction using a deep partially supervised neural network

Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using Gaussian mixture models and, more recently, restricted Boltzmann machine arrays; however, deep neural network (DNN)-based systems have been hampered by the limited amount of training data available fro...

Descripción completa

Detalles Bibliográficos
Autores principales:	McLoughlin, Ian, Li, Jingjie, Song, Yan, Sharifzadeh, Hamid R.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Institution of Engineering and Technology 2017
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5569940/ https://www.ncbi.nlm.nih.gov/pubmed/28868149 http://dx.doi.org/10.1049/htl.2016.0103

Descripción
Sumario:	Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using Gaussian mixture models and, more recently, restricted Boltzmann machine arrays; however, deep neural network (DNN)-based systems have been hampered by the limited amount of training data available from individual voice-loss patients. The authors propose a novel DNN structure that allows a partially supervised training approach on spectral features from smaller data sets, yielding very good results compared with the current state-of-the-art.

Speech reconstruction using a deep partially supervised neural network

Ejemplares similares