Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Neurosymbolic Grounding for Compositional World Models

Oct 19, 2023

Atharva Sehgal, Arya Grayeli, Jennifer J. Sun, Swarat Chaudhuri

Figure 1 for Neurosymbolic Grounding for Compositional World Models

Figure 2 for Neurosymbolic Grounding for Compositional World Models

Figure 3 for Neurosymbolic Grounding for Compositional World Models

Figure 4 for Neurosymbolic Grounding for Compositional World Models

Share this with someone who'll enjoy it:

Abstract:We introduce Cosmos, a framework for object-centric world modeling that is designed for compositional generalization (CG), i.e., high performance on unseen input scenes obtained through the composition of known visual "atoms." The central insight behind Cosmos is the use of a novel form of neurosymbolic grounding. Specifically, the framework introduces two new tools: (i) neurosymbolic scene encodings, which represent each entity in a scene using a real vector computed using a neural encoder, as well as a vector of composable symbols describing attributes of the entity, and (ii) a neurosymbolic attention mechanism that binds these entities to learned rules of interaction. Cosmos is end-to-end differentiable; also, unlike traditional neurosymbolic methods that require representations to be manually mapped to symbols, it computes an entity's symbolic attributes using vision-language foundation models. Through an evaluation that considers two different forms of CG on an established blocks-pushing domain, we show that the framework establishes a new state-of-the-art for CG in world modeling.

View paper on

Share this with someone who'll enjoy it:

Title:Neurosymbolic Grounding for Compositional World Models

Paper and Code