Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Aug 24, 2022

Paul Primus, Gerhard Widmer

Figure 1 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Figure 2 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Figure 3 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Figure 4 for Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Share this with someone who'll enjoy it:

Abstract:Standard machine learning models for tagging and classifying acoustic signals cannot handle classes that were not seen during training. Zero-Shot (ZS) learning overcomes this restriction by predicting classes based on adaptable class descriptions. This study sets out to investigate the effectiveness of self-attention-based audio embedding architectures for ZS learning. To this end, we compare the very recent patchout spectrogram transformer with two classic convolutional architectures. We evaluate these three architectures on three tasks and on three different benchmark datasets: general-purpose tagging on AudioSet, environmental sound classification on ESC-50, and instrument tagging on OpenMIC. Our results show that the self-attention-based embedding methods outperform both compared convolutional architectures in all of these settings. By designing training and test data accordingly, we observe that prediction performance suffers significantly when the `semantic distance' between training and new test classes is large, an effect that will deserve more detailed investigations.

* published in EUSIPCO 2022

View paper on

Share this with someone who'll enjoy it:

Title:Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

Paper and Code