Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anson Lei

SPARTAN: A Sparse Transformer Learning Local Causation

Nov 12, 2024

Anson Lei, Bernhard Schölkopf, Ingmar Posner

Figure 1 for SPARTAN: A Sparse Transformer Learning Local Causation

Figure 2 for SPARTAN: A Sparse Transformer Learning Local Causation

Figure 3 for SPARTAN: A Sparse Transformer Learning Local Causation

Figure 4 for SPARTAN: A Sparse Transformer Learning Local Causation

Abstract:Causal structures play a central role in world models that flexibly adapt to changes in the environment. While recent works motivate the benefits of discovering local causal graphs for dynamics modelling, in this work we demonstrate that accurately capturing these relationships in complex settings remains challenging for the current state-of-the-art. To remedy this shortcoming, we postulate that sparsity is a critical ingredient for the discovery of such local causal structures. To this end we present the SPARse TrANsformer World model (SPARTAN), a Transformer-based world model that learns local causal structures between entities in a scene. By applying sparsity regularisation on the attention pattern between object-factored tokens, SPARTAN identifies sparse local causal models that accurately predict future object states. Furthermore, we extend our model to capture sparse interventions with unknown targets on the dynamics of the environment. This results in a highly interpretable world model that can efficiently adapt to changes. Empirically, we evaluate SPARTAN against the current state-of-the-art in object-centric world models on observation-based environments and demonstrate that our model can learn accurate local causal graphs and achieve significantly improved few-shot adaptation to changes in the dynamics of the environment as well as robustness against removing irrelevant distractors.

Via

Access Paper or Ask Questions

Compete and Compose: Learning Independent Mechanisms for Modular World Models

Apr 23, 2024

Anson Lei, Frederik Nolte, Bernhard Schölkopf, Ingmar Posner

Figure 1 for Compete and Compose: Learning Independent Mechanisms for Modular World Models

Figure 2 for Compete and Compose: Learning Independent Mechanisms for Modular World Models

Figure 3 for Compete and Compose: Learning Independent Mechanisms for Modular World Models

Figure 4 for Compete and Compose: Learning Independent Mechanisms for Modular World Models

Abstract:We present COmpetitive Mechanisms for Efficient Transfer (COMET), a modular world model which leverages reusable, independent mechanisms across different environments. COMET is trained on multiple environments with varying dynamics via a two-step process: competition and composition. This enables the model to recognise and learn transferable mechanisms. Specifically, in the competition phase, COMET is trained with a winner-takes-all gradient allocation, encouraging the emergence of independent mechanisms. These are then re-used in the composition phase, where COMET learns to re-compose learnt mechanisms in ways that capture the dynamics of intervened environments. In so doing, COMET explicitly reuses prior knowledge, enabling efficient and interpretable adaptation. We evaluate COMET on environments with image-based observations. In contrast to competitive baselines, we demonstrate that COMET captures recognisable mechanisms without supervision. Moreover, we show that COMET is able to adapt to new environments with varying numbers of objects with improved sample efficiency compared to more conventional finetuning approaches.

Via

Access Paper or Ask Questions

Variational Causal Dynamics: Discovering Modular World Models from Interventions

Jun 22, 2022

Anson Lei, Bernhard Schölkopf, Ingmar Posner

Figure 1 for Variational Causal Dynamics: Discovering Modular World Models from Interventions

Figure 2 for Variational Causal Dynamics: Discovering Modular World Models from Interventions

Figure 3 for Variational Causal Dynamics: Discovering Modular World Models from Interventions

Figure 4 for Variational Causal Dynamics: Discovering Modular World Models from Interventions

Abstract:Latent world models allow agents to reason about complex environments with high-dimensional observations. However, adapting to new environments and effectively leveraging previous knowledge remain significant challenges. We present variational causal dynamics (VCD), a structured world model that exploits the invariance of causal mechanisms across environments to achieve fast and modular adaptation. By causally factorising a transition model, VCD is able to identify reusable components across different environments. This is achieved by combining causal discovery and variational inference to learn a latent representation and transition model jointly in an unsupervised manner. Specifically, we optimise the evidence lower bound jointly over a representation model and a transition model structured as a causal graphical model. In evaluations on simulated environments with state and image observations, we show that VCD is able to successfully identify causal variables, and to discover consistent causal structures across different environments. Moreover, given a small number of observations in a previously unseen, intervened environment, VCD is able to identify the sparse changes in the dynamics and to adapt efficiently. In doing so, VCD significantly extends the capabilities of the current state-of-the-art in latent world models while also comparing favourably in terms of prediction accuracy.

Via

Access Paper or Ask Questions