Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Oct 10, 2022

Adrian Bulat, Ricardo Guerrero, Brais Martinez, Georgios Tzimiropoulos

Figure 1 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Figure 2 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Figure 3 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Figure 4 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Share this with someone who'll enjoy it:

Abstract:This paper is on Few-Shot Object Detection (FSOD), where given a few templates (examples) depicting a novel class (not seen during training), the goal is to detect all of its occurrences within a set of images. From a practical perspective, an FSOD system must fulfil the following desiderata: (a) it must be used as is, without requiring any fine-tuning at test time, (b) it must be able to process an arbitrary number of novel objects concurrently while supporting an arbitrary number of examples from each class and (c) it must achieve accuracy comparable to a closed system. While there are (relatively) few systems that support (a), to our knowledge, there is no system supporting (b) and (c). In this work, we make the following contributions: We introduce, for the first time, a simple, yet powerful, few-shot detection transformer (FS-DETR) that can address both desiderata (a) and (b). Our system builds upon the DETR framework, extending it based on two key ideas: (1) feed the provided visual templates of the novel classes as visual prompts during test time, and (2) ``stamp'' these prompts with pseudo-class embeddings, which are then predicted at the output of the decoder. Importantly, we show that our system is not only more flexible than existing methods, but also, making a step towards satisfying desideratum (c), it is more accurate, matching and outperforming the current state-of-the-art on the most well-established benchmarks (PASCAL VOC & MSCOCO) for FSOD. Code will be made available.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Paper and Code