Picture for Huan Zhang

Huan Zhang

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Add code
Oct 14, 2025
Viaarxiv icon

EmoHeal: An End-to-End System for Personalized Therapeutic Music Retrieval from Fine-grained Emotions

Add code
Sep 19, 2025
Viaarxiv icon

Emotion-Aware Speech Generation with Character-Specific Voices for Comics

Add code
Sep 18, 2025
Viaarxiv icon

What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles

Add code
Aug 14, 2025
Viaarxiv icon

DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty

Add code
Jun 14, 2025
Viaarxiv icon

GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

SDP-CROWN: Efficient Bound Propagation for Neural Network Verification with Tightness of Semidefinite Programming

Add code
Jun 07, 2025
Viaarxiv icon

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Add code
Jun 05, 2025
Viaarxiv icon

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Add code
May 30, 2025
Viaarxiv icon

Beyond Freezing: Sparse Tuning Enhances Plasticity in Continual Learning with Pre-Trained Models

Add code
May 26, 2025
Viaarxiv icon