Picture for Yiwei Wang

Yiwei Wang

University of California, Merced, USA

Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents

Add code
Mar 02, 2026
Viaarxiv icon

PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding

Add code
Feb 24, 2026
Viaarxiv icon

AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

Add code
Feb 14, 2026
Viaarxiv icon

AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

ProjDevBench: Benchmarking AI Coding Agents on End-to-End Project Development

Add code
Feb 02, 2026
Viaarxiv icon

CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

Add code
Jan 30, 2026
Viaarxiv icon

Self-Manager: Parallel Agent Loop for Long-form Deep Research

Add code
Jan 25, 2026
Viaarxiv icon

OptiSQL: Executable SQL Generation from Optical Tokens

Add code
Jan 21, 2026
Viaarxiv icon

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon