Picture for Zhou Yu

Zhou Yu

University of California, Davis

Serving Large Language Models on Huawei CloudMatrix384

Add code
Jun 15, 2025
Viaarxiv icon

Not All Tokens Are What You Need In Thinking

Add code
May 23, 2025
Viaarxiv icon

Self-Classification Enhancement and Correction for Weakly Supervised Object Detection

Add code
May 22, 2025
Viaarxiv icon

When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs

Add code
May 16, 2025
Viaarxiv icon

Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts

Add code
Apr 19, 2025
Viaarxiv icon

Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning

Add code
Apr 02, 2025
Viaarxiv icon

Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation

Add code
Mar 26, 2025
Viaarxiv icon

Growing a Twig to Accelerate Large Vision-Language Models

Add code
Mar 18, 2025
Viaarxiv icon

Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs

Add code
Feb 28, 2025
Viaarxiv icon

Program Synthesis Dialog Agents for Interactive Decision-Making

Add code
Feb 26, 2025
Viaarxiv icon