Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Dec 01, 2023

Ioannis Kakogeorgiou, Spyros Gidaris, Konstantinos Karantzalos, Nikos Komodakis

Figure 1 for SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Figure 2 for SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Figure 3 for SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Figure 4 for SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Share this with someone who'll enjoy it:

Abstract:Unsupervised object-centric learning aims to decompose scenes into interpretable object entities, termed slots. Slot-based auto-encoders stand out as a prominent method for this task. Within them, crucial aspects include guiding the encoder to generate object-specific slots and ensuring the decoder utilizes them during reconstruction. This work introduces two novel techniques, (i) an attention-based self-training approach, which distills superior slot-based attention masks from the decoder to the encoder, enhancing object segmentation, and (ii) an innovative patch-order permutation strategy for autoregressive transformers that strengthens the role of slot vectors in reconstruction. The effectiveness of these strategies is showcased experimentally. The combined approach significantly surpasses prior slot-based autoencoder methods in unsupervised object segmentation, especially with complex real-world images. We provide the implementation code at https://github.com/gkakogeorgiou/spot .

View paper on

Share this with someone who'll enjoy it:

Title:SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Paper and Code