Picture for Rui Zhang

Rui Zhang

Henry

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

Add code
Jun 03, 2026
Viaarxiv icon

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

Add code
Jun 03, 2026
Viaarxiv icon

Learning What to Learn: Stage-Specific Data Sets for SFT-then-RL in Small Language Model Reasoning

Add code
Jun 03, 2026
Viaarxiv icon

Microwave Linear Analog Computers Aided Multiuser Communication: General Impedance Matching and Precoding Optimization

Add code
Jun 03, 2026
Viaarxiv icon

GPU-Parallel Multi-Task Reinforcement Learning with Demonstration Guided Policy Optimization

Add code
Jun 02, 2026
Viaarxiv icon

CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts

Add code
May 30, 2026
Viaarxiv icon

LithoGRPO: Fast Inverse Lithography via GRPO Reinforced Flow Matching

Add code
May 29, 2026
Viaarxiv icon

Representation Collapse in Sequential Post-Training of Large Language Models

Add code
May 28, 2026
Viaarxiv icon

Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

Add code
May 27, 2026
Viaarxiv icon

A Policy-Driven Runtime Layer for Agentic LLM Serving

Add code
May 26, 2026
Viaarxiv icon