Picture for Mehdi S. M. Sajjadi

Mehdi S. M. Sajjadi

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Moving Off-the-Grid: Scene-Grounded Video Representations

Add code
Nov 08, 2024
Figure 1 for Moving Off-the-Grid: Scene-Grounded Video Representations
Figure 2 for Moving Off-the-Grid: Scene-Grounded Video Representations
Figure 3 for Moving Off-the-Grid: Scene-Grounded Video Representations
Figure 4 for Moving Off-the-Grid: Scene-Grounded Video Representations
Viaarxiv icon

DyST: Towards Dynamic Neural Scene Representations on Real-World Videos

Add code
Oct 09, 2023
Figure 1 for DyST: Towards Dynamic Neural Scene Representations on Real-World Videos
Figure 2 for DyST: Towards Dynamic Neural Scene Representations on Real-World Videos
Figure 3 for DyST: Towards Dynamic Neural Scene Representations on Real-World Videos
Figure 4 for DyST: Towards Dynamic Neural Scene Representations on Real-World Videos
Viaarxiv icon

DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$

Add code
Jun 13, 2023
Viaarxiv icon

Sensitivity of Slot-Based Object-Centric Models to their Number of Slots

Add code
May 30, 2023
Figure 1 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 2 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 3 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Figure 4 for Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Viaarxiv icon

RePAST: Relative Pose Attention Scene Representation Transformer

Add code
Apr 10, 2023
Viaarxiv icon

PaLM-E: An Embodied Multimodal Language Model

Add code
Mar 06, 2023
Figure 1 for PaLM-E: An Embodied Multimodal Language Model
Figure 2 for PaLM-E: An Embodied Multimodal Language Model
Figure 3 for PaLM-E: An Embodied Multimodal Language Model
Figure 4 for PaLM-E: An Embodied Multimodal Language Model
Viaarxiv icon

Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames

Add code
Feb 09, 2023
Figure 1 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 2 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 3 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 4 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Viaarxiv icon

RUST: Latent Neural Scene Representations from Unposed Imagery

Add code
Nov 25, 2022
Figure 1 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 2 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 3 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 4 for RUST: Latent Neural Scene Representations from Unposed Imagery
Viaarxiv icon