Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

María Luis Valero

Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

Aug 23, 2024

Martin Strauss, Wolfgang Mack, María Luis Valero, Okan Köpüklü

Figure 1 for Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

Figure 2 for Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

Figure 3 for Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

Figure 4 for Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

Abstract:We propose a novel Neural Steering technique that adapts the target area of a spatial-aware multi-microphone sound source separation algorithm during inference without the necessity of retraining the deep neural network (DNN). To achieve this, we first train a DNN aiming to retain speech within a target region, defined by an angular span, while suppressing sound sources stemming from other directions. Afterward, a phase shift is applied to the microphone signals, allowing us to shift the center of the target area during inference at negligible additional cost in computational complexity. Further, we show that the proposed approach performs well in a wide variety of acoustic scenarios, including several speakers inside and outside the target area and additional noise. More precisely, the proposed approach performs on par with DNNs trained explicitly for the steered target area in terms of DNSMOS and SI-SDR.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions