Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Feb 23, 2023

Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

Figure 1 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Figure 2 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Figure 3 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Figure 4 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Share this with someone who'll enjoy it:

Abstract:In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement. This is achieved by incorporating a frame-wise conditioning input that specifies the type of enhancement output. To improve the quality of the enhanced output and mitigate oversuppression, we experiment with re-weighting frames by the presence or absence of speech activity and applying augmentations to speaker embeddings. By training under a multi-task learning setting, we empirically show that the proposed unified model obtains promising results on both personalized and non-personalized speech enhancement benchmarks and reaches similar performance to models that are trained specialized for either task. The strong performance of the proposed method demonstrates that the unified model is a more economical alternative compared to keeping separate task-specific models during inference.

* Accepted by ICASSP 2023

View paper on

Share this with someone who'll enjoy it:

Title:A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Paper and Code