Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Segmenting Moving Objects via an Object-Centric Layered Representation

Jul 05, 2022

Junyu Xie, Weidi Xie, Andrew Zisserman

Figure 1 for Segmenting Moving Objects via an Object-Centric Layered Representation

Figure 2 for Segmenting Moving Objects via an Object-Centric Layered Representation

Figure 3 for Segmenting Moving Objects via an Object-Centric Layered Representation

Figure 4 for Segmenting Moving Objects via an Object-Centric Layered Representation

Share this with someone who'll enjoy it:

Abstract:The objective of this paper is a model that is able to discover, track and segment multiple moving objects in a video. We make four contributions: First, we introduce an object-centric segmentation model with a depth-ordered layer representation. This is implemented using a variant of the transformer architecture that ingests optical flow, where each query vector specifies an object and its layer for the entire video. The model can effectively discover multiple moving objects and handle mutual occlusions; Second, we introduce a scalable pipeline for generating synthetic training data with multiple objects, significantly reducing the requirements for labour-intensive annotations, and supporting Sim2Real generalisation; Third, we show that the model is able to learn object permanence and temporal shape consistency, and is able to predict amodal segmentation masks; Fourth, we evaluate the model on standard video segmentation benchmarks, DAVIS, MoCA, SegTrack, FBMS-59, and achieve state-of-the-art unsupervised segmentation performance, even outperforming several supervised approaches. With test-time adaptation, we observe further performance boosts.

* Total 27 pages, 13 figures (including main text: 9 pages, 5 figures)

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Segmenting Moving Objects via an Object-Centric Layered Representation

Paper and Code