Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Jun 21, 2022

Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He

Figure 1 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Figure 2 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Figure 3 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Figure 4 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Share this with someone who'll enjoy it:

Abstract:Sound event detection (SED) is an interesting but challenging task due to the scarcity of data and diverse sound events in real life. This paper presents a multi-grained based attention network (MGA-Net) for semi-supervised sound event detection. To obtain the feature representations related to sound events, a residual hybrid convolution (RH-Conv) block is designed to boost the vanilla convolution's ability to extract the time-frequency features. Moreover, a multi-grained attention (MGA) module is designed to learn temporal resolution features from coarse-level to fine-level. With the MGA module,the network could capture the characteristics of target events with short- or long-duration, resulting in more accurately determining the onset and offset of sound events. Furthermore, to effectively boost the performance of the Mean Teacher (MT) method, a spatial shift (SS) module as a data perturbation mechanism is introduced to increase the diversity of data. Experimental results show that the MGA-Net outperforms the published state-of-the-art competitors, achieving 53.27% and 56.96% event-based macro F1 (EB-F1) score, 0.709 and 0.739 polyphonic sound detection score (PSDS) on the validation and public set respectively.

* INTERSPEECH 2022

View paper on

Share this with someone who'll enjoy it:

Title:A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Paper and Code