Picture for Vighnesh Birodkar

Vighnesh Birodkar

Learning Complex Non-Rigid Image Edits from Multimodal Conditioning

Add code
Dec 13, 2024
Viaarxiv icon

Sample what you cant compress

Add code
Sep 04, 2024
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Text and Click inputs for unambiguous open vocabulary instance segmentation

Add code
Nov 24, 2023
Figure 1 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 2 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 3 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 4 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Add code
Feb 10, 2023
Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

Add code
Dec 20, 2022
Viaarxiv icon

Proper Reuse of Image Classification Features Improves Object Detection

Add code
Apr 01, 2022
Figure 1 for Proper Reuse of Image Classification Features Improves Object Detection
Figure 2 for Proper Reuse of Image Classification Features Improves Object Detection
Figure 3 for Proper Reuse of Image Classification Features Improves Object Detection
Figure 4 for Proper Reuse of Image Classification Features Improves Object Detection
Viaarxiv icon

Less is More: Generating Grounded Navigation Instructions from Landmarks

Add code
Nov 29, 2021
Figure 1 for Less is More: Generating Grounded Navigation Instructions from Landmarks
Figure 2 for Less is More: Generating Grounded Navigation Instructions from Landmarks
Figure 3 for Less is More: Generating Grounded Navigation Instructions from Landmarks
Figure 4 for Less is More: Generating Grounded Navigation Instructions from Landmarks
Viaarxiv icon

The iWildCam 2021 Competition Dataset

Add code
May 07, 2021
Figure 1 for The iWildCam 2021 Competition Dataset
Figure 2 for The iWildCam 2021 Competition Dataset
Figure 3 for The iWildCam 2021 Competition Dataset
Figure 4 for The iWildCam 2021 Competition Dataset
Viaarxiv icon

The surprising impact of mask-head architecture on novel class segmentation

Add code
Apr 01, 2021
Figure 1 for The surprising impact of mask-head architecture on novel class segmentation
Figure 2 for The surprising impact of mask-head architecture on novel class segmentation
Figure 3 for The surprising impact of mask-head architecture on novel class segmentation
Figure 4 for The surprising impact of mask-head architecture on novel class segmentation
Viaarxiv icon