Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Jan 28, 2023

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

Figure 1 for ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Figure 2 for ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Figure 3 for ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Figure 4 for ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Share this with someone who'll enjoy it:

Abstract:Recent success of large-scale Contrastive Language-Image Pre-training (CLIP) has led to great promise in zero-shot semantic segmentation by transferring image-text aligned knowledge to pixel-level classification. However, existing methods usually require an additional image encoder or retraining/tuning the CLIP module. Here, we present a cost-effective strategy using text-prompt learning that keeps the entire CLIP module frozen while fully leveraging its rich information. Specifically, we propose a novel Zero-shot segmentation with Optimal Transport (ZegOT) method that matches multiple text prompts with frozen image embeddings through optimal transport, which allows each text prompt to efficiently focus on specific semantic attributes. Additionally, we propose Deep Local Feature Alignment (DLFA) that deeply aligns the text prompts with intermediate local feature of the frozen image encoder layers, which significantly boosts the zero-shot segmentation performance. Through extensive experiments on benchmark datasets, we show that our method achieves the state-of-the-art (SOTA) performance with only x7 lighter parameters compared to previous SOTA approaches.

* 16pages, 9 figures

View paper on

Share this with someone who'll enjoy it:

Title:ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

Paper and Code