Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Oct 24, 2023

Xin Xing, Zhexiao Xiong, Abby Stylianou, Srikumar Sastry, Liyu Gong, Nathan Jacobs

Figure 1 for Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Figure 2 for Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Figure 3 for Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Figure 4 for Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Share this with someone who'll enjoy it:

Abstract:This paper presents a novel approach to Single-Positive Multi-label Learning. In general multi-label learning, a model learns to predict multiple labels or categories for a single input image. This is in contrast with standard multi-class image classification, where the task is predicting a single label from many possible labels for an image. Single-Positive Multi-label Learning (SPML) specifically considers learning to predict multiple labels when there is only a single annotation per image in the training data. Multi-label learning is in many ways a more realistic task than single-label learning as real-world data often involves instances belonging to multiple categories simultaneously; however, most common computer vision datasets predominantly contain single labels due to the inherent complexity and cost of collecting multiple high quality annotations for each instance. We propose a novel approach called Vision-Language Pseudo-Labeling (VLPL), which uses a vision-language model to suggest strong positive and negative pseudo-labels, and outperforms the current SOTA methods by 5.5% on Pascal VOC, 18.4% on MS-COCO, 15.2% on NUS-WIDE, and 8.4% on CUB-Birds. Our code and data are available at https://github.com/mvrl/VLPL.

View paper on

Share this with someone who'll enjoy it:

Title:Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

Paper and Code