Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PALM: Few-Shot Prompt Learning for Audio Language Models

Sep 29, 2024

Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki

Figure 1 for PALM: Few-Shot Prompt Learning for Audio Language Models

Figure 2 for PALM: Few-Shot Prompt Learning for Audio Language Models

Figure 3 for PALM: Few-Shot Prompt Learning for Audio Language Models

Figure 4 for PALM: Few-Shot Prompt Learning for Audio Language Models

Share this with someone who'll enjoy it:

Abstract:Audio-Language Models (ALMs) have recently achieved remarkable success in zero-shot audio recognition tasks, which match features of audio waveforms with class-specific text prompt features, inspired by advancements in Vision-Language Models (VLMs). Given the sensitivity of zero-shot performance to the choice of hand-crafted text prompts, many prompt learning techniques have been developed for VLMs. We explore the efficacy of these approaches in ALMs and propose a novel method, Prompt Learning in Audio Language Models (PALM), which optimizes the feature space of the text encoder branch. Unlike existing methods that work in the input space, our approach results in greater training efficiency. We demonstrate the effectiveness of our approach on 11 audio recognition datasets, encompassing a variety of speech-processing tasks, and compare the results with three baselines in a few-shot learning setup. Our method is either on par with or outperforms other approaches while being computationally less demanding. Code is available at https://asif-hanif.github.io/palm/

* EMNLP 2024 (Main)

View paper on

Share this with someone who'll enjoy it:

Title:PALM: Few-Shot Prompt Learning for Audio Language Models

Paper and Code