Picture for Huayu Chen

Huayu Chen

Exploratory Diffusion Policy for Unsupervised Reinforcement Learning

Add code
Feb 11, 2025
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Viaarxiv icon

Visual Generation Without Guidance

Add code
Jan 26, 2025
Viaarxiv icon

Free Process Rewards without Process Labels

Add code
Dec 02, 2024
Figure 1 for Free Process Rewards without Process Labels
Figure 2 for Free Process Rewards without Process Labels
Figure 3 for Free Process Rewards without Process Labels
Figure 4 for Free Process Rewards without Process Labels
Viaarxiv icon

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Add code
Oct 12, 2024
Figure 1 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 2 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 3 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 4 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Viaarxiv icon

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Add code
Oct 10, 2024
Figure 1 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 2 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 3 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 4 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Viaarxiv icon

Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control

Add code
Jul 12, 2024
Figure 1 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 2 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 3 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 4 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Viaarxiv icon

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Add code
Feb 26, 2024
Viaarxiv icon

Noise Contrastive Alignment of Language Models with Explicit Rewards

Add code
Feb 08, 2024
Figure 1 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 2 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 3 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Figure 4 for Noise Contrastive Alignment of Language Models with Explicit Rewards
Viaarxiv icon

Score Regularized Policy Optimization through Diffusion Behavior

Add code
Oct 12, 2023
Viaarxiv icon