Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On Using Transformers for Speech-Separation

Feb 06, 2022

Cem Subakan, Mirco Ravanelli, Samuele Cornell, Francois Grondin, Mirko Bronzi

Figure 1 for On Using Transformers for Speech-Separation

Figure 2 for On Using Transformers for Speech-Separation

Figure 3 for On Using Transformers for Speech-Separation

Figure 4 for On Using Transformers for Speech-Separation

Share this with someone who'll enjoy it:

Abstract:Transformers have enabled major improvements in deep learning. They often outperform recurrent and convolutional models in many tasks while taking advantage of parallel processing. Recently, we have proposed SepFormer, which uses self-attention and obtains state-of-the art results on WSJ0-2/3 Mix datasets for speech separation. In this paper, we extend our previous work by providing results on more datasets including LibriMix, and WHAM!, WHAMR! which include noisy and noisy-reverberant conditions. Moreover we provide denoising, and denoising+dereverberation results in the context of speech enhancement, respectively on WHAM! and WHAMR! datasets. We also investigate incorporating recently proposed efficient self-attention mechanisms inside the SepFormer model, and show that by using efficient self-attention mechanisms it is possible to reduce the memory requirements significantly while performing better than the popular convtasnet model on WSJ0-2Mix dataset.

* arXiv admin note: text overlap with arXiv:2010.13154

View paper on

Share this with someone who'll enjoy it:

Title:On Using Transformers for Speech-Separation

Paper and Code