Picture for Liangtao Shi

Liangtao Shi

Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model

Add code
Nov 16, 2024
Viaarxiv icon

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Add code
May 23, 2024
Viaarxiv icon

Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers

Add code
Mar 15, 2024
Viaarxiv icon

Explicit Visual Prompts for Visual Object Tracking

Add code
Jan 06, 2024
Viaarxiv icon