Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Jun 07, 2021

Christopher Schymura, Benedikt Bönninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa

Figure 1 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Figure 2 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Figure 3 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Figure 4 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Share this with someone who'll enjoy it:

Abstract:Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e.g. a microphone array). Recent advances in this domain most prominently focused on utilizing deep recurrent neural networks. Inspired by the success of transformer architectures as a suitable alternative to classical recurrent neural networks, this paper introduces a novel transformer-based sound event localization framework, where temporal dependencies in the received multi-channel audio signals are captured via self-attention mechanisms. Additionally, the estimated sound event positions are represented as multivariate Gaussian variables, yielding an additional notion of uncertainty, which many previously proposed deep learning-based systems designed for this application do not provide. The framework is evaluated on three publicly available multi-source sound event localization datasets and compared against state-of-the-art methods in terms of localization error and event detection accuracy. It outperforms all competing systems on all datasets with statistical significant differences in performance.

* Accepted at INTERSPEECH 2021

View paper on

Share this with someone who'll enjoy it:

Title:PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Paper and Code