Picture for Vitor Guizilini

Vitor Guizilini

Fiducial Exoskeletons: Image-Centric Robot State Estimation

Add code
Jan 12, 2026
Viaarxiv icon

AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis

Add code
Dec 12, 2025
Viaarxiv icon

Robot Learning from a Physical World Model

Add code
Nov 10, 2025
Figure 1 for Robot Learning from a Physical World Model
Figure 2 for Robot Learning from a Physical World Model
Figure 3 for Robot Learning from a Physical World Model
Figure 4 for Robot Learning from a Physical World Model
Viaarxiv icon

Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning

Add code
Oct 02, 2025
Viaarxiv icon

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Add code
Apr 15, 2025
Figure 1 for ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Figure 2 for ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Figure 3 for ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Figure 4 for ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Viaarxiv icon

SIRE: SE(3) Intrinsic Rigidity Embeddings

Add code
Mar 10, 2025
Figure 1 for SIRE: SE(3) Intrinsic Rigidity Embeddings
Figure 2 for SIRE: SE(3) Intrinsic Rigidity Embeddings
Figure 3 for SIRE: SE(3) Intrinsic Rigidity Embeddings
Figure 4 for SIRE: SE(3) Intrinsic Rigidity Embeddings
Viaarxiv icon

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Add code
Jan 30, 2025
Figure 1 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 2 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 3 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 4 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Viaarxiv icon

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Add code
Jan 29, 2025
Viaarxiv icon

Learning from Massive Human Videos for Universal Humanoid Pose Control

Add code
Dec 18, 2024
Figure 1 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 2 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 3 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 4 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Viaarxiv icon

$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation

Add code
Nov 11, 2024
Viaarxiv icon