Picture for Paolo Favaro

Paolo Favaro

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Add code
Dec 15, 2024
Figure 1 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 2 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 3 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 4 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Viaarxiv icon

Blind Image Restoration via Fast Diffusion Inversion

Add code
May 29, 2024
Viaarxiv icon

Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model

Add code
Apr 28, 2024
Viaarxiv icon

Masked and Shuffled Blind Spot Denoising for Real-World Images

Add code
Apr 15, 2024
Viaarxiv icon

Two Tricks to Improve Unsupervised Segmentation Learning

Add code
Apr 08, 2024
Viaarxiv icon

Enabling Visual Composition and Animation in Unsupervised Video Generation

Add code
Mar 21, 2024
Viaarxiv icon

A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D

Add code
Feb 29, 2024
Viaarxiv icon

Multi-View Unsupervised Image Generation with Cross Attention Guidance

Add code
Dec 07, 2023
Viaarxiv icon

SemiGPC: Distribution-Aware Label Refinement for Imbalanced Semi-Supervised Learning Using Gaussian Processes

Add code
Nov 03, 2023
Viaarxiv icon

Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation

Add code
Sep 29, 2023
Viaarxiv icon