Picture for Saurabh Saxena

Saurabh Saxena

RoMo: Robust Motion Segmentation Improves Structure from Motion

Add code
Nov 27, 2024
Figure 1 for RoMo: Robust Motion Segmentation Improves Structure from Motion
Figure 2 for RoMo: Robust Motion Segmentation Improves Structure from Motion
Figure 3 for RoMo: Robust Motion Segmentation Improves Structure from Motion
Figure 4 for RoMo: Robust Motion Segmentation Improves Structure from Motion
Viaarxiv icon

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion

Add code
Oct 15, 2024
Viaarxiv icon

Controlling Space and Time with Diffusion Models

Add code
Jul 10, 2024
Viaarxiv icon

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Add code
Dec 20, 2023
Viaarxiv icon

NeRFiller: Completing Scenes via Generative 3D Inpainting

Add code
Dec 07, 2023
Viaarxiv icon

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation

Add code
Jun 02, 2023
Viaarxiv icon

Monocular Depth Estimation using Diffusion Models

Add code
Feb 28, 2023
Figure 1 for Monocular Depth Estimation using Diffusion Models
Figure 2 for Monocular Depth Estimation using Diffusion Models
Figure 3 for Monocular Depth Estimation using Diffusion Models
Figure 4 for Monocular Depth Estimation using Diffusion Models
Viaarxiv icon

A Generalist Framework for Panoptic Segmentation of Images and Videos

Add code
Oct 12, 2022
Figure 1 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 2 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 3 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 4 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Viaarxiv icon

A Unified Sequence Interface for Vision Tasks

Add code
Jun 15, 2022
Figure 1 for A Unified Sequence Interface for Vision Tasks
Figure 2 for A Unified Sequence Interface for Vision Tasks
Figure 3 for A Unified Sequence Interface for Vision Tasks
Figure 4 for A Unified Sequence Interface for Vision Tasks
Viaarxiv icon

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Add code
May 23, 2022
Figure 1 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 2 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 3 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 4 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Viaarxiv icon