Picture for Anurag Arnab

Anurag Arnab

VoCap: Video Object Captioning and Segmentation from Any Prompt

Add code
Aug 29, 2025
Viaarxiv icon

Progressive Data Dropout: An Embarrassingly Simple Approach to Faster Training

Add code
May 28, 2025
Viaarxiv icon

What Are You Doing? A Closer Look at Controllable Human Video Generation

Add code
Mar 06, 2025
Figure 1 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 2 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 3 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Figure 4 for What Are You Doing? A Closer Look at Controllable Human Video Generation
Viaarxiv icon

From Image to Video: An Empirical Study of Diffusion Representations

Add code
Feb 10, 2025
Viaarxiv icon

Principles of Visual Tokens for Efficient Video Understanding

Add code
Nov 20, 2024
Figure 1 for Principles of Visual Tokens for Efficient Video Understanding
Figure 2 for Principles of Visual Tokens for Efficient Video Understanding
Figure 3 for Principles of Visual Tokens for Efficient Video Understanding
Figure 4 for Principles of Visual Tokens for Efficient Video Understanding
Viaarxiv icon

Towards Optimal Adapter Placement for Efficient Transfer Learning

Add code
Oct 21, 2024
Figure 1 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 2 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 3 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Figure 4 for Towards Optimal Adapter Placement for Efficient Transfer Learning
Viaarxiv icon

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels

Add code
Sep 30, 2024
Figure 1 for Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Figure 2 for Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Figure 3 for Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Figure 4 for Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Viaarxiv icon

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Add code
Jul 29, 2024
Figure 1 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 2 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 3 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Figure 4 for Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Viaarxiv icon

Streaming Dense Video Captioning

Add code
Apr 01, 2024
Figure 1 for Streaming Dense Video Captioning
Figure 2 for Streaming Dense Video Captioning
Figure 3 for Streaming Dense Video Captioning
Figure 4 for Streaming Dense Video Captioning
Viaarxiv icon

Time-, Memory- and Parameter-Efficient Visual Adaptation

Add code
Feb 05, 2024
Viaarxiv icon