Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Video Instance Segmentation via Temporal Pyramid Routing

Jul 28, 2021

Xiangtai Li, Hao He, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Jianping Shi, Yunhai Tong

Figure 1 for Improving Video Instance Segmentation via Temporal Pyramid Routing

Figure 2 for Improving Video Instance Segmentation via Temporal Pyramid Routing

Figure 3 for Improving Video Instance Segmentation via Temporal Pyramid Routing

Figure 4 for Improving Video Instance Segmentation via Temporal Pyramid Routing

Share this with someone who'll enjoy it:

Abstract:Video Instance Segmentation (VIS) is a new and inherently multi-task problem, which aims to detect, segment and track each instance in a video sequence. Existing approaches are mainly based on single-frame features or single-scale features of multiple frames, where temporal information or multi-scale information is ignored. To incorporate both temporal and scale information, we propose a Temporal Pyramid Routing (TPR) strategy to conditionally align and conduct pixel-level aggregation from a feature pyramid pair of two adjacent frames. Specifically, TPR contains two novel components, including Dynamic Aligned Cell Routing (DACR) and Cross Pyramid Routing (CPR), where DACR is designed for aligning and gating pyramid features across temporal dimension, while CPR transfers temporally aggregated features across scale dimension. Moreover, our approach is a plug-and-play module and can be easily applied to existing instance segmentation methods. Extensive experiments on YouTube-VIS dataset demonstrate the effectiveness and efficiency of the proposed approach on several state-of-the-art instance segmentation methods. Codes and trained models will be publicly available to facilitate future research.(\url{https://github.com/lxtGH/TemporalPyramidRouting}).

View paper on

Share this with someone who'll enjoy it:

Title:Improving Video Instance Segmentation via Temporal Pyramid Routing

Paper and Code