Picture for Shilin Yan

Shilin Yan

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs

Add code
Feb 05, 2026
Viaarxiv icon

AdaptMMBench: Benchmarking Adaptive Multimodal Reasoning for Mode Selection and Reasoning Process

Add code
Feb 02, 2026
Viaarxiv icon

Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval

Add code
Oct 26, 2025
Figure 1 for Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval
Figure 2 for Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval
Figure 3 for Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval
Figure 4 for Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval
Viaarxiv icon

Diffusion Language Models Know the Answer Before Decoding

Add code
Aug 27, 2025
Figure 1 for Diffusion Language Models Know the Answer Before Decoding
Figure 2 for Diffusion Language Models Know the Answer Before Decoding
Figure 3 for Diffusion Language Models Know the Answer Before Decoding
Figure 4 for Diffusion Language Models Know the Answer Before Decoding
Viaarxiv icon

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Progressive Scaling Visual Object Tracking

Add code
May 26, 2025
Viaarxiv icon

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

Add code
May 26, 2025
Viaarxiv icon

CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms

Add code
May 22, 2025
Viaarxiv icon

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Add code
May 20, 2025
Viaarxiv icon

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon