Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Attention-Based Beamformer For Multi-Channel Speech Enhancement

Sep 10, 2024

Jinglin Bai, Hao Li, Xueliang Zhang, Fei Chen

Figure 1 for Attention-Based Beamformer For Multi-Channel Speech Enhancement

Figure 2 for Attention-Based Beamformer For Multi-Channel Speech Enhancement

Figure 3 for Attention-Based Beamformer For Multi-Channel Speech Enhancement

Figure 4 for Attention-Based Beamformer For Multi-Channel Speech Enhancement

Share this with someone who'll enjoy it:

Abstract:Minimum Variance Distortionless Response (MVDR) is a classical adaptive beamformer that theoretically ensures the distortionless transmission of signals in the target direction. Its performance in noise reduction actually depends on the accuracy of the noise spatial covariance matrix (SCM) estimate. Although recent deep learning has shown remarkable performance in multi-channel speech enhancement, the property of distortionless response still makes MVDR highly popular in real applications. In this paper, we propose an attention-based mechanism to calculate the speech and noise SCM and then apply MVDR to obtain the enhanced speech. Moreover, a deep learning architecture using the inplace convolution operator and frequency-independent LSTM has proven effective in facilitating SCM estimation. The model is optimized in an end-to-end manner. Experimental results indicate that the proposed method is extremely effective in tracking moving or stationary speakers under non-causal and causal conditions, outperforming other baselines. It is worth mentioning that our model has only 0.35 million parameters, making it easy to be deployed on edge devices.

View paper on

Share this with someone who'll enjoy it:

Title:Attention-Based Beamformer For Multi-Channel Speech Enhancement

Paper and Code