Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Feb 05, 2024

Hao Li, Wei Wang, Cong Wang, Zhigang Luo, Xinwang Liu, Kenli Li, Xiaochun Cao

Figure 1 for Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Figure 2 for Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Figure 3 for Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Figure 4 for Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Share this with someone who'll enjoy it:

Abstract:Single-domain generalized object detection aims to enhance a model's generalizability to multiple unseen target domains using only data from a single source domain during training. This is a practical yet challenging task as it requires the model to address domain shift without incorporating target domain data into training. In this paper, we propose a novel phrase grounding-based style transfer (PGST) approach for the task. Specifically, we first define textual prompts to describe potential objects for each unseen target domain. Then, we leverage the grounded language-image pre-training (GLIP) model to learn the style of these target domains and achieve style transfer from the source to the target domain. The style-transferred source visual features are semantically rich and could be close to imaginary counterparts in the target domain. Finally, we employ these style-transferred visual features to fine-tune GLIP. By introducing imaginary counterparts, the detector could be effectively generalized to unseen target domains using only a single source domain for training. Extensive experimental results on five diverse weather driving benchmarks demonstrate our proposed approach achieves state-of-the-art performance, even surpassing some domain adaptive methods that incorporate target domain images into the training process.The source codes and pre-trained models will be made available.

* 16 pages, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection

Paper and Code