Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Feb 06, 2024

Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Shijian Lu

Figure 1 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Figure 2 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Figure 3 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Figure 4 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Share this with someone who'll enjoy it:

Abstract:The recent Segment Anything Model (SAM) has demonstrated remarkable zero-shot capability and flexible geometric prompting in general image segmentation. However, SAM often struggles when handling various unconventional images, such as aerial, medical, and non-RGB images. This paper presents CAT-SAM, a ConditionAl Tuning network that adapts SAM toward various unconventional target tasks with just few-shot target samples. CAT-SAM freezes the entire SAM and adapts its mask decoder and image encoder simultaneously with a small number of learnable parameters. The core design is a prompt bridge structure that enables decoder-conditioned joint tuning of the heavyweight image encoder and the lightweight mask decoder. The bridging maps the prompt token of the mask decoder to the image encoder, fostering synergic adaptation of the encoder and the decoder with mutual benefits. We develop two representative tuning strategies for the image encoder which leads to two CAT-SAM variants: one injecting learnable prompt tokens in the input space and the other inserting lightweight adapter networks. Extensive experiments over 11 unconventional tasks show that both CAT-SAM variants achieve superior target segmentation performance consistently even under the very challenging one-shot adaptation setup. Project page: \url{https://xiaoaoran.github.io/projects/CAT-SAM}

* Project page: https://xiaoaoran.github.io/projects/CAT-SAM

View paper on

Share this with someone who'll enjoy it:

Title:CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Paper and Code