Picture for Kai-Wei Chang

Kai-Wei Chang

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Open-Domain Safety Policy Construction

Add code
Apr 01, 2026
Viaarxiv icon

TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild

Add code
Mar 23, 2026
Viaarxiv icon

TiCo: Time-Controllable Training for Spoken Dialogue Models

Add code
Mar 23, 2026
Viaarxiv icon

TaoBench: Do Automated Theorem Prover LLMs Generalize Beyond MathLib?

Add code
Mar 13, 2026
Viaarxiv icon

Learning Structured Reasoning via Tractable Trajectory Control

Add code
Mar 02, 2026
Viaarxiv icon

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Add code
Feb 02, 2026
Viaarxiv icon

AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering

Add code
Jan 21, 2026
Viaarxiv icon

On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation

Add code
Jan 09, 2026
Viaarxiv icon