Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kyle Hoefer

Multi-Channel Speech Denoising for Machine Ears

Feb 17, 2022

Cong Han, E. Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile

Figure 1 for Multi-Channel Speech Denoising for Machine Ears

Figure 2 for Multi-Channel Speech Denoising for Machine Ears

Figure 3 for Multi-Channel Speech Denoising for Machine Ears

Figure 4 for Multi-Channel Speech Denoising for Machine Ears

Abstract:This work describes a speech denoising system for machine ears that aims to improve speech intelligibility and the overall listening experience in noisy environments. We recorded approximately 100 hours of audio data with reverberation and moderate environmental noise using a pair of microphone arrays placed around each of the two ears and then mixed sound recordings to simulate adverse acoustic scenes. Then, we trained a multi-channel speech denoising network (MCSDN) on the mixture of recordings. To improve the training, we employ an unsupervised method, complex angular central Gaussian mixture model (cACGMM), to acquire cleaner speech from noisy recordings to serve as the learning target. We propose a MCSDN-Beamforming-MCSDN framework in the inference stage. The results of the subjective evaluation show that the cACGMM improves the training data, resulting in better noise reduction and user preference, and the entire system improves the intelligibility and listening experience in noisy situations.

* Accepted to ICASSP 2022

Via

Access Paper or Ask Questions