Picture for Sergio Arnaud

Sergio Arnaud

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Add code
Jun 11, 2025
Viaarxiv icon

Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Add code
Apr 19, 2025
Viaarxiv icon

Unifying 2D and 3D Vision-Language Understanding

Add code
Mar 13, 2025
Viaarxiv icon

LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding

Add code
Feb 27, 2025
Viaarxiv icon

What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?

Add code
Oct 03, 2023
Figure 1 for What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
Figure 2 for What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
Figure 3 for What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
Figure 4 for What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
Viaarxiv icon

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

Add code
Mar 31, 2023
Figure 1 for Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Figure 2 for Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Figure 3 for Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Figure 4 for Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Viaarxiv icon