Picture for Zichen Wen

Zichen Wen

AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs

Add code
Oct 08, 2025
Viaarxiv icon

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Add code
Oct 08, 2025
Viaarxiv icon

DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation

Add code
Aug 06, 2025
Viaarxiv icon

TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

Add code
Jun 11, 2025
Viaarxiv icon

EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models

Add code
Jun 11, 2025
Viaarxiv icon

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Add code
May 25, 2025
Viaarxiv icon

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Add code
May 18, 2025
Viaarxiv icon

Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation

Add code
Mar 19, 2025
Viaarxiv icon

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Add code
Mar 19, 2025
Viaarxiv icon

Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?

Add code
Feb 17, 2025
Viaarxiv icon