Picture for Yang Yue

Yang Yue

Shenzhen University

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability

Add code
Feb 03, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Add code
Jan 21, 2026
Viaarxiv icon

Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model

Add code
Dec 25, 2025
Viaarxiv icon

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling

Add code
Nov 08, 2025
Figure 1 for Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Figure 2 for Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Figure 3 for Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Figure 4 for Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Viaarxiv icon

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Add code
Sep 18, 2025
Viaarxiv icon

AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Add code
Aug 09, 2025
Figure 1 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Figure 2 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Figure 3 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Figure 4 for AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Figure 1 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 2 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 3 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 4 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Viaarxiv icon

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon