Picture for Shuo Xin

Shuo Xin

OmniVLM: A Token-Compressed, Sub-Billion-Parameter Vision-Language Model for Efficient On-Device Inference

Add code
Dec 16, 2024
Viaarxiv icon

Visual Object Tracking across Diverse Data Modalities: A Review

Add code
Dec 13, 2024
Figure 1 for Visual Object Tracking across Diverse Data Modalities: A Review
Figure 2 for Visual Object Tracking across Diverse Data Modalities: A Review
Figure 3 for Visual Object Tracking across Diverse Data Modalities: A Review
Figure 4 for Visual Object Tracking across Diverse Data Modalities: A Review
Viaarxiv icon

Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Add code
Sep 03, 2024
Figure 1 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 2 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 3 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 4 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Viaarxiv icon

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

Add code
Mar 28, 2024
Figure 1 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Figure 2 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Figure 3 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Viaarxiv icon