Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Heterogeneous Graph Learning for Acoustic Event Classification

Mar 12, 2023

Amir Shirian, Mona Ahmadian, Krishna Somandepalli, Tanaya Guha

Figure 1 for Heterogeneous Graph Learning for Acoustic Event Classification

Figure 2 for Heterogeneous Graph Learning for Acoustic Event Classification

Figure 3 for Heterogeneous Graph Learning for Acoustic Event Classification

Share this with someone who'll enjoy it:

Abstract:Heterogeneous graphs provide a compact, efficient, and scalable way to model data involving multiple disparate modalities. This makes modeling audiovisual data using heterogeneous graphs an attractive option. However, graph structure does not appear naturally in audiovisual data. Graphs for audiovisual data are constructed manually which is both difficult and sub-optimal. In this work, we address this problem by (i) proposing a parametric graph construction strategy for the intra-modal edges, and (ii) learning the crossmodal edges. To this end, we develop a new model, heterogeneous graph crossmodal network (HGCN) that learns the crossmodal edges. Our proposed model can adapt to various spatial and temporal scales owing to its parametric construction, while the learnable crossmodal edges effectively connect the relevant nodes across modalities. Experiments on a large benchmark dataset (AudioSet) show that our model is state-of-the-art (0.53 mean average precision), outperforming transformer-based models and other graph-based models.

* arXiv admin note: text overlap with arXiv:2207.07935

View paper on

Share this with someone who'll enjoy it:

Title:Heterogeneous Graph Learning for Acoustic Event Classification

Paper and Code