Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dong Pang

FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Jul 09, 2023

Zihao Jiang, Yunkai Dang, Dong Pang, Huishuai Zhang, Weiran Huang

Figure 1 for FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Figure 2 for FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Figure 3 for FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Figure 4 for FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

Abstract:Few-shot learning aims to train models that can be generalized to novel classes with only a few samples. Recently, a line of works are proposed to enhance few-shot learning with accessible semantic information from class names. However, these works focus on improving existing modules such as visual prototypes and feature extractors of the standard few-shot learning framework. This limits the full potential use of semantic information. In this paper, we propose a novel few-shot learning framework that uses pre-trained language models based on contrastive learning. To address the challenge of alignment between visual features and textual embeddings obtained from text-based pre-trained language model, we carefully design the textual branch of our framework and introduce a metric module to generalize the cosine similarity. For better transferability, we let the metric module adapt to different few-shot tasks and adopt MAML to train the model via bi-level optimization. Moreover, we conduct extensive experiments on multiple benchmarks to demonstrate the effectiveness of our method.

Via

Access Paper or Ask Questions