Picture for Abhinav Shrivastava

Abhinav Shrivastava

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Add code
Jan 25, 2026
Viaarxiv icon

Towards Understanding Best Practices for Quantization of Vision-Language Models

Add code
Jan 21, 2026
Viaarxiv icon

Characterizing Motion Encoding in Video Diffusion Timesteps

Add code
Dec 18, 2025
Viaarxiv icon

Growing Visual Generative Capacity for Pre-Trained MLLMs

Add code
Oct 02, 2025
Viaarxiv icon

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Add code
Jul 09, 2025
Viaarxiv icon

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Add code
Jun 18, 2025
Figure 1 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 2 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 3 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 4 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Viaarxiv icon

Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models

Add code
May 12, 2025
Figure 1 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 2 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 3 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Figure 4 for Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Viaarxiv icon

TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations

Add code
May 09, 2025
Figure 1 for TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Figure 2 for TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Figure 3 for TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Figure 4 for TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Viaarxiv icon

CoLLM: A Large Language Model for Composed Image Retrieval

Add code
Mar 25, 2025
Figure 1 for CoLLM: A Large Language Model for Composed Image Retrieval
Figure 2 for CoLLM: A Large Language Model for Composed Image Retrieval
Figure 3 for CoLLM: A Large Language Model for Composed Image Retrieval
Figure 4 for CoLLM: A Large Language Model for Composed Image Retrieval
Viaarxiv icon

Utilization of Neighbor Information for Image Classification with Different Levels of Supervision

Add code
Mar 18, 2025
Viaarxiv icon