Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MetaAudio: A Few-Shot Audio Classification Benchmark

Apr 10, 2022

Calum Heggan, Sam Budgett, Timothy Hospedales, Mehrdad Yaghoobi

Figure 1 for MetaAudio: A Few-Shot Audio Classification Benchmark

Figure 2 for MetaAudio: A Few-Shot Audio Classification Benchmark

Figure 3 for MetaAudio: A Few-Shot Audio Classification Benchmark

Figure 4 for MetaAudio: A Few-Shot Audio Classification Benchmark

Share this with someone who'll enjoy it:

Abstract:Currently available benchmarks for few-shot learning (machine learning with few training examples) are limited in the domains they cover, primarily focusing on image classification. This work aims to alleviate this reliance on image-based benchmarks by offering the first comprehensive, public and fully reproducible audio based alternative, covering a variety of sound domains and experimental settings. We compare the few-shot classification performance of a variety of techniques on seven audio datasets (spanning environmental sounds to human-speech). Extending this, we carry out in-depth analyses of joint training (where all datasets are used during training) and cross-dataset adaptation protocols, establishing the possibility of a generalised audio few-shot classification algorithm. Our experimentation shows gradient-based meta-learning methods such as MAML and Meta-Curvature consistently outperform both metric and baseline methods. We also demonstrate that the joint training routine helps overall generalisation for the environmental sound databases included, as well as being a somewhat-effective method of tackling the cross-dataset/domain setting.

* 9 pages with 1 figure and 2 main results tables. V1 Preprint

View paper on

Share this with someone who'll enjoy it:

Title:MetaAudio: A Few-Shot Audio Classification Benchmark

Paper and Code