Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ayellet Morgenstern

4-LEGS: 4D Language Embedded Gaussian Splatting

Oct 15, 2024

Gal Fiebelman, Tamir Cohen, Ayellet Morgenstern, Peter Hedman, Hadar Averbuch-Elor

Figure 1 for 4-LEGS: 4D Language Embedded Gaussian Splatting

Figure 2 for 4-LEGS: 4D Language Embedded Gaussian Splatting

Figure 3 for 4-LEGS: 4D Language Embedded Gaussian Splatting

Figure 4 for 4-LEGS: 4D Language Embedded Gaussian Splatting

Abstract:The emergence of neural representations has revolutionized our means for digitally viewing a wide range of 3D scenes, enabling the synthesis of photorealistic images rendered from novel views. Recently, several techniques have been proposed for connecting these low-level representations with the high-level semantics understanding embodied within the scene. These methods elevate the rich semantic understanding from 2D imagery to 3D representations, distilling high-dimensional spatial features onto 3D space. In our work, we are interested in connecting language with a dynamic modeling of the world. We show how to lift spatio-temporal features to a 4D representation based on 3D Gaussian Splatting. This enables an interactive interface where the user can spatiotemporally localize events in the video from text prompts. We demonstrate our system on public 3D video datasets of people and animals performing various actions.

* Project webpage: https://tau-vailab.github.io/4-LEGS/

Via

Access Paper or Ask Questions