Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Memory Controlled Sequential Self Attention for Sound Recognition

Jun 11, 2020

Arjun Pankajakshan, Helen L. Bear, Vinod Subramanian, Emmanouil Benetos

Figure 1 for Memory Controlled Sequential Self Attention for Sound Recognition

Figure 2 for Memory Controlled Sequential Self Attention for Sound Recognition

Figure 3 for Memory Controlled Sequential Self Attention for Sound Recognition

Share this with someone who'll enjoy it:

Abstract:In this paper we investigate the importance of the extent of memory in sequential self attention for sound recognition. We propose to use a memory controlled sequential self attention mechanism on top of a convolutional recurrent neural network (CRNN) model for polyphonic sound event detection (SED). Experiments on the URBAN-SED dataset demonstrate the impact of the extent of memory on sound recognition performance with the self attention induced SED model. We extend the proposed idea with a multi-head self attention mechanism where each attention head processes the audio embedding with explicit attention width values. The proposed use of memory controlled sequential self attention offers a way to induce relations among frames of sound event tokens. We show that our memory controlled self attention model achieves an event based F -score of 33.92% on the URBAN-SED dataset, outperforming the F -score of 20.10% reported by the model without self attention.

* Submitted to INTERSPEECH 2020

View paper on

Share this with someone who'll enjoy it:

Title:Memory Controlled Sequential Self Attention for Sound Recognition

Paper and Code