Cargando…

A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor

The quality and intelligibility of the speech are usually impaired by the interference of background noise when using internet voice calls. To solve this problem in the context of wearable smart devices, this paper introduces a dual-microphone, bone-conduction (BC) sensor assisted beamformer and a s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yi, Chen, Yufan, Ma, Yongbao, Liu, Hongqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571026/
https://www.ncbi.nlm.nih.gov/pubmed/32899533
http://dx.doi.org/10.3390/s20185050
_version_ 1783597081810698240
author Zhou, Yi
Chen, Yufan
Ma, Yongbao
Liu, Hongqing
author_facet Zhou, Yi
Chen, Yufan
Ma, Yongbao
Liu, Hongqing
author_sort Zhou, Yi
collection PubMed
description The quality and intelligibility of the speech are usually impaired by the interference of background noise when using internet voice calls. To solve this problem in the context of wearable smart devices, this paper introduces a dual-microphone, bone-conduction (BC) sensor assisted beamformer and a simple recurrent unit (SRU)-based neural network postfilter for real-time speech enhancement. Assisted by the BC sensor, which is insensitive to the environmental noise compared to the regular air-conduction (AC) microphone, the accurate voice activity detection (VAD) can be obtained from the BC signal and incorporated into the adaptive noise canceller (ANC) and adaptive block matrix (ABM). The SRU-based postfilter consists of a recurrent neural network with a small number of parameters, which improves the computational efficiency. The sub-band signal processing is designed to compress the input features of the neural network, and the scale-invariant signal-to-distortion ratio (SI-SDR) is developed as the loss function to minimize the distortion of the desired speech signal. Experimental results demonstrate that the proposed real-time speech enhancement system provides significant speech sound quality and intelligibility improvements for all noise types and levels when compared with the AC-only beamformer with a postfiltering algorithm.
format Online
Article
Text
id pubmed-7571026
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75710262020-10-28 A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor Zhou, Yi Chen, Yufan Ma, Yongbao Liu, Hongqing Sensors (Basel) Article The quality and intelligibility of the speech are usually impaired by the interference of background noise when using internet voice calls. To solve this problem in the context of wearable smart devices, this paper introduces a dual-microphone, bone-conduction (BC) sensor assisted beamformer and a simple recurrent unit (SRU)-based neural network postfilter for real-time speech enhancement. Assisted by the BC sensor, which is insensitive to the environmental noise compared to the regular air-conduction (AC) microphone, the accurate voice activity detection (VAD) can be obtained from the BC signal and incorporated into the adaptive noise canceller (ANC) and adaptive block matrix (ABM). The SRU-based postfilter consists of a recurrent neural network with a small number of parameters, which improves the computational efficiency. The sub-band signal processing is designed to compress the input features of the neural network, and the scale-invariant signal-to-distortion ratio (SI-SDR) is developed as the loss function to minimize the distortion of the desired speech signal. Experimental results demonstrate that the proposed real-time speech enhancement system provides significant speech sound quality and intelligibility improvements for all noise types and levels when compared with the AC-only beamformer with a postfiltering algorithm. MDPI 2020-09-05 /pmc/articles/PMC7571026/ /pubmed/32899533 http://dx.doi.org/10.3390/s20185050 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhou, Yi
Chen, Yufan
Ma, Yongbao
Liu, Hongqing
A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title_full A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title_fullStr A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title_full_unstemmed A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title_short A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor
title_sort real-time dual-microphone speech enhancement algorithm assisted by bone conduction sensor
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571026/
https://www.ncbi.nlm.nih.gov/pubmed/32899533
http://dx.doi.org/10.3390/s20185050
work_keys_str_mv AT zhouyi arealtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT chenyufan arealtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT mayongbao arealtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT liuhongqing arealtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT zhouyi realtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT chenyufan realtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT mayongbao realtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor
AT liuhongqing realtimedualmicrophonespeechenhancementalgorithmassistedbyboneconductionsensor