Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Armando Domi

TrickVOS: A Bag of Tricks for Video Object Segmentation

Jun 28, 2023

Evangelos Skartados, Konstantinos Georgiadis, Mehmet Kerim Yucel, Koskinas Ioannis, Armando Domi, Anastasios Drosou, Bruno Manganelli, Albert Saa-Garriga

Figure 1 for TrickVOS: A Bag of Tricks for Video Object Segmentation

Figure 2 for TrickVOS: A Bag of Tricks for Video Object Segmentation

Figure 3 for TrickVOS: A Bag of Tricks for Video Object Segmentation

Figure 4 for TrickVOS: A Bag of Tricks for Video Object Segmentation

Abstract:Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a structure-aware hybrid loss, ii) a simple decoder pretraining regime and iii) a cheap tracker that imposes spatial constraints in model predictions. Finally, we propose a lightweight network and show that when trained with TrickVOS, it achieves competitive results to state-of-the-art methods on DAVIS and YouTube benchmarks, while being one of the first STM-based SVOS methods that can run in real-time on a mobile device.

* Accepted to ICIP 2023

Via

Access Paper or Ask Questions