Picture for Yu Luo

Yu Luo

and Other Contributors

Uncertainty-Aware Concept and Motion Segmentation for Semi-Supervised Angiography Videos

Add code
Mar 01, 2026
Viaarxiv icon

RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models

Add code
Feb 28, 2026
Viaarxiv icon

RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis

Add code
Feb 28, 2026
Viaarxiv icon

FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching

Add code
Feb 13, 2026
Viaarxiv icon

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards

Add code
Feb 09, 2026
Viaarxiv icon

Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning

Add code
Jan 06, 2026
Viaarxiv icon

A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control

Add code
Jul 03, 2025
Figure 1 for A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
Figure 2 for A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
Figure 3 for A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
Figure 4 for A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
Viaarxiv icon

Flow-Based Policy for Online Reinforcement Learning

Add code
Jun 15, 2025
Figure 1 for Flow-Based Policy for Online Reinforcement Learning
Figure 2 for Flow-Based Policy for Online Reinforcement Learning
Figure 3 for Flow-Based Policy for Online Reinforcement Learning
Figure 4 for Flow-Based Policy for Online Reinforcement Learning
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Figure 1 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 2 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 3 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 4 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Figure 1 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 2 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 3 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 4 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Viaarxiv icon