Picture for Yao Hu

Yao Hu

Alibaba Group

CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression

Add code
Feb 05, 2026
Viaarxiv icon

Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Learning More from Less: Unlocking Internal Representations for Benchmark Compression

Add code
Feb 03, 2026
Viaarxiv icon

IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning

Add code
Feb 03, 2026
Viaarxiv icon

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Balancing Understanding and Generation in Discrete Diffusion Models

Add code
Feb 01, 2026
Viaarxiv icon

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Add code
Jan 31, 2026
Viaarxiv icon

Benchmarking Machine Translation on Chinese Social Media Texts

Add code
Jan 30, 2026
Viaarxiv icon

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning

Add code
Jan 29, 2026
Viaarxiv icon