Picture for Saurabh Saxena

Saurabh Saxena

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion

Add code
Oct 15, 2024
Viaarxiv icon

Controlling Space and Time with Diffusion Models

Add code
Jul 10, 2024
Viaarxiv icon

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Add code
Dec 20, 2023
Viaarxiv icon

NeRFiller: Completing Scenes via Generative 3D Inpainting

Add code
Dec 07, 2023
Viaarxiv icon

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation

Add code
Jun 02, 2023
Viaarxiv icon

Monocular Depth Estimation using Diffusion Models

Add code
Feb 28, 2023
Figure 1 for Monocular Depth Estimation using Diffusion Models
Figure 2 for Monocular Depth Estimation using Diffusion Models
Figure 3 for Monocular Depth Estimation using Diffusion Models
Figure 4 for Monocular Depth Estimation using Diffusion Models
Viaarxiv icon

A Generalist Framework for Panoptic Segmentation of Images and Videos

Add code
Oct 12, 2022
Figure 1 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 2 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 3 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 4 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Viaarxiv icon

A Unified Sequence Interface for Vision Tasks

Add code
Jun 15, 2022
Figure 1 for A Unified Sequence Interface for Vision Tasks
Figure 2 for A Unified Sequence Interface for Vision Tasks
Figure 3 for A Unified Sequence Interface for Vision Tasks
Figure 4 for A Unified Sequence Interface for Vision Tasks
Viaarxiv icon

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Add code
May 23, 2022
Figure 1 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 2 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 3 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 4 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Viaarxiv icon

Pix2seq: A Language Modeling Framework for Object Detection

Add code
Sep 22, 2021
Figure 1 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 2 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 3 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 4 for Pix2seq: A Language Modeling Framework for Object Detection
Viaarxiv icon