Picture for Yufeng Yuan

Yufeng Yuan

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Viaarxiv icon

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Add code
Jun 12, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Viaarxiv icon

What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret

Add code
Mar 03, 2025
Viaarxiv icon

Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots

Add code
Mar 31, 2022
Figure 1 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 2 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 3 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 4 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Viaarxiv icon

Receptive Multi-granularity Representation for Person Re-Identification

Add code
Aug 31, 2020
Figure 1 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 2 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 3 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 4 for Receptive Multi-granularity Representation for Person Re-Identification
Viaarxiv icon

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Add code
Mar 09, 2020
Figure 1 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 2 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 3 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 4 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Viaarxiv icon

Relation-Aware Pyramid Network (RapNet) for temporal action proposal

Add code
Aug 09, 2019
Figure 1 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal
Figure 2 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal
Figure 3 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal
Viaarxiv icon