Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Object-centric architectures enable efficient causal representation learning

Oct 29, 2023

Amin Mansouri, Jason Hartford, Yan Zhang, Yoshua Bengio

Figure 1 for Object-centric architectures enable efficient causal representation learning

Figure 2 for Object-centric architectures enable efficient causal representation learning

Figure 3 for Object-centric architectures enable efficient causal representation learning

Figure 4 for Object-centric architectures enable efficient causal representation learning

Share this with someone who'll enjoy it:

Abstract:Causal representation learning has showed a variety of settings in which we can disentangle latent variables with identifiability guarantees (up to some reasonable equivalence class). Common to all of these approaches is the assumption that (1) the latent variables are represented as $d$-dimensional vectors, and (2) that the observations are the output of some injective generative function of these latent variables. While these assumptions appear benign, we show that when the observations are of multiple objects, the generative function is no longer injective and disentanglement fails in practice. We can address this failure by combining recent developments in object-centric learning and causal representation learning. By modifying the Slot Attention architecture arXiv:2006.15055, we develop an object-centric architecture that leverages weak supervision from sparse perturbations to disentangle each object's properties. This approach is more data-efficient in the sense that it requires significantly fewer perturbations than a comparable approach that encodes to a Euclidean space and we show that this approach successfully disentangles the properties of a set of objects in a series of simple image-based disentanglement experiments.

View paper on

Share this with someone who'll enjoy it:

Title:Object-centric architectures enable efficient causal representation learning

Paper and Code