Picture for Shilin Yan

Shilin Yan

General Compression Framework for Efficient Transformer Object Tracking

Add code
Sep 26, 2024
Viaarxiv icon

VISA: Reasoning Video Object Segmentation via Large Language Models

Add code
Jul 16, 2024
Figure 1 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 2 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 3 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 4 for VISA: Reasoning Video Object Segmentation via Large Language Models
Viaarxiv icon

A Sanity Check for AI-generated Image Detection

Add code
Jun 27, 2024
Viaarxiv icon

Visual Perception by Large Language Model's Weights

Add code
May 30, 2024
Viaarxiv icon

Multi-Modal Generative Embedding Model

Add code
May 29, 2024
Viaarxiv icon

OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning

Add code
Mar 14, 2024
Viaarxiv icon

InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

Add code
Nov 30, 2023
Viaarxiv icon

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Add code
Sep 22, 2023
Viaarxiv icon

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

Add code
May 25, 2023
Viaarxiv icon

Personalize Segment Anything Model with One Shot

Add code
May 04, 2023
Viaarxiv icon