Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Apr 12, 2023

Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

Figure 1 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Figure 2 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Figure 3 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Figure 4 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Share this with someone who'll enjoy it:

Abstract:Multiscale video transformers have been explored in a wide variety of vision tasks. To date, however, the multiscale processing has been confined to the encoder or decoder alone. We present a unified multiscale encoder-decoder transformer that is focused on dense prediction tasks in videos. Multiscale representation at both encoder and decoder yields key benefits of implicit extraction of spatiotemporal features (i.e. without reliance on input optical flow) as well as temporal consistency at encoding and coarseto-fine detection for high-level (e.g. object) semantics to guide precise localization at decoding. Moreover, we propose a transductive learning scheme through many-to-many label propagation to provide temporally consistent predictions. We showcase our Multiscale Encoder-Decoder Video Transformer (MED-VT) on Automatic Video Object Segmentation (AVOS) and actor/action segmentation, where we outperform state-of-the-art approaches on multiple benchmarks using only raw images, without using optical flow.

* Accepted in CVPR 2023

View paper on

Share this with someone who'll enjoy it:

Title:MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Paper and Code