Picture for Vitor Guizilini

Vitor Guizilini

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Add code
Jan 30, 2025
Figure 1 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 2 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 3 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Figure 4 for Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Viaarxiv icon

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Add code
Jan 29, 2025
Viaarxiv icon

Learning from Massive Human Videos for Universal Humanoid Pose Control

Add code
Dec 18, 2024
Figure 1 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 2 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 3 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 4 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Viaarxiv icon

$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation

Add code
Nov 11, 2024
Viaarxiv icon

GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion

Add code
Sep 15, 2024
Figure 1 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 2 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 3 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Figure 4 for GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Viaarxiv icon

View-Invariant Policy Learning via Zero-Shot Novel View Synthesis

Add code
Sep 05, 2024
Viaarxiv icon

Incorporating dense metric depth into neural 3D representations for view synthesis and relighting

Add code
Sep 04, 2024
Figure 1 for Incorporating dense metric depth into neural 3D representations for view synthesis and relighting
Figure 2 for Incorporating dense metric depth into neural 3D representations for view synthesis and relighting
Figure 3 for Incorporating dense metric depth into neural 3D representations for view synthesis and relighting
Figure 4 for Incorporating dense metric depth into neural 3D representations for view synthesis and relighting
Viaarxiv icon

Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry

Add code
Jun 03, 2024
Viaarxiv icon

Transcrib3D: 3D Referring Expression Resolution through Large Language Models

Add code
Apr 30, 2024
Figure 1 for Transcrib3D: 3D Referring Expression Resolution through Large Language Models
Figure 2 for Transcrib3D: 3D Referring Expression Resolution through Large Language Models
Figure 3 for Transcrib3D: 3D Referring Expression Resolution through Large Language Models
Figure 4 for Transcrib3D: 3D Referring Expression Resolution through Large Language Models
Viaarxiv icon

NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields

Add code
Apr 01, 2024
Figure 1 for NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields
Figure 2 for NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields
Figure 3 for NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields
Figure 4 for NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields
Viaarxiv icon