Picture for Shutong Ding

Shutong Ding

FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning

Add code
Oct 26, 2025
Figure 1 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 2 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 3 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Figure 4 for FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
Viaarxiv icon

One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion

Add code
May 24, 2025
Figure 1 for One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion
Figure 2 for One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion
Figure 3 for One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion
Figure 4 for One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion
Viaarxiv icon

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Add code
May 24, 2025
Figure 1 for GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
Figure 2 for GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
Figure 3 for GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
Figure 4 for GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
Viaarxiv icon

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Add code
May 25, 2024
Figure 1 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 2 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 3 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 4 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Viaarxiv icon

Guidance with Spherical Gaussian Constraint for Conditional Diffusion

Add code
Feb 05, 2024
Figure 1 for Guidance with Spherical Gaussian Constraint for Conditional Diffusion
Figure 2 for Guidance with Spherical Gaussian Constraint for Conditional Diffusion
Figure 3 for Guidance with Spherical Gaussian Constraint for Conditional Diffusion
Figure 4 for Guidance with Spherical Gaussian Constraint for Conditional Diffusion
Viaarxiv icon

Reduced Policy Optimization for Continuous Control with Hard Constraints

Add code
Oct 14, 2023
Figure 1 for Reduced Policy Optimization for Continuous Control with Hard Constraints
Figure 2 for Reduced Policy Optimization for Continuous Control with Hard Constraints
Figure 3 for Reduced Policy Optimization for Continuous Control with Hard Constraints
Figure 4 for Reduced Policy Optimization for Continuous Control with Hard Constraints
Viaarxiv icon

Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation

Add code
Oct 14, 2023
Viaarxiv icon