Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Referring Video Object Segmentation from Weak Annotation

Aug 04, 2023

Wangbo Zhao, Kepan Nan, Songyang Zhang, Kai Chen, Dahua Lin, Yang You

Figure 1 for Learning Referring Video Object Segmentation from Weak Annotation

Figure 2 for Learning Referring Video Object Segmentation from Weak Annotation

Figure 3 for Learning Referring Video Object Segmentation from Weak Annotation

Figure 4 for Learning Referring Video Object Segmentation from Weak Annotation

Share this with someone who'll enjoy it:

Abstract:Referring video object segmentation (RVOS) is a task that aims to segment the target object in all video frames based on a sentence describing the object. Previous RVOS methods have achieved significant performance with densely-annotated datasets, whose construction is expensive and time-consuming. To relieve the burden of data annotation while maintaining sufficient supervision for segmentation, we propose a new annotation scheme, in which we label the frame where the object first appears with a mask and use bounding boxes for the subsequent frames. Based on this scheme, we propose a method to learn from this weak annotation. Specifically, we design a cross frame segmentation method, which uses the language-guided dynamic filters to thoroughly leverage the valuable mask annotation and bounding boxes. We further develop a bi-level contrastive learning method to encourage the model to learn discriminative representation at the pixel level. Extensive experiments and ablative analyses show that our method is able to achieve competitive performance without the demand of dense mask annotation. The code will be available at https://github.com/wangbo-zhao/WRVOS/.

View paper on

Share this with someone who'll enjoy it:

Title:Learning Referring Video Object Segmentation from Weak Annotation

Paper and Code