Picture for Amir Bar

Amir Bar

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

Add code
Feb 03, 2026
Viaarxiv icon

Grounding Generated Videos in Feasible Plans via World Models

Add code
Feb 02, 2026
Viaarxiv icon

Parallel Stochastic Gradient-Based Planning for World Models

Add code
Jan 31, 2026
Viaarxiv icon

DeFM: Learning Foundation Representations from Depth for Robotics

Add code
Jan 26, 2026
Viaarxiv icon

World Models Can Leverage Human Videos for Dexterous Manipulation

Add code
Dec 15, 2025
Figure 1 for World Models Can Leverage Human Videos for Dexterous Manipulation
Figure 2 for World Models Can Leverage Human Videos for Dexterous Manipulation
Figure 3 for World Models Can Leverage Human Videos for Dexterous Manipulation
Figure 4 for World Models Can Leverage Human Videos for Dexterous Manipulation
Viaarxiv icon

Whole-Body Conditioned Egocentric Video Prediction

Add code
Jun 26, 2025
Figure 1 for Whole-Body Conditioned Egocentric Video Prediction
Figure 2 for Whole-Body Conditioned Egocentric Video Prediction
Figure 3 for Whole-Body Conditioned Egocentric Video Prediction
Figure 4 for Whole-Body Conditioned Egocentric Video Prediction
Viaarxiv icon

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

Add code
May 21, 2025
Viaarxiv icon

Scaling Language-Free Visual Representation Learning

Add code
Apr 01, 2025
Figure 1 for Scaling Language-Free Visual Representation Learning
Figure 2 for Scaling Language-Free Visual Representation Learning
Figure 3 for Scaling Language-Free Visual Representation Learning
Figure 4 for Scaling Language-Free Visual Representation Learning
Viaarxiv icon

Forgotten Polygons: Multimodal Large Language Models are Shape-Blind

Add code
Feb 21, 2025
Viaarxiv icon

Navigation World Models

Add code
Dec 04, 2024
Figure 1 for Navigation World Models
Figure 2 for Navigation World Models
Figure 3 for Navigation World Models
Figure 4 for Navigation World Models
Viaarxiv icon