Picture for Zifeng Zhuang

Zifeng Zhuang

Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL

Add code
Oct 10, 2024
Viaarxiv icon

ADR-BC: Adversarial Density Weighted Regression Behavior Cloning

Add code
May 28, 2024
Figure 1 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 2 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 3 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 4 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Viaarxiv icon

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

Add code
May 23, 2024
Viaarxiv icon

Reinformer: Max-Return Sequence Modeling for offline RL

Add code
May 14, 2024
Viaarxiv icon

Context-Former: Stitching via Latent Conditioned Sequence Modeling

Add code
Feb 03, 2024
Viaarxiv icon

Adaptive Proximal Policy Optimization with Upper Confidence Bound

Add code
Dec 12, 2023
Viaarxiv icon

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph

Add code
Nov 10, 2023
Viaarxiv icon

STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization

Add code
Jul 19, 2023
Viaarxiv icon

CEIL: Generalized Contextual Imitation Learning

Add code
Jun 26, 2023
Figure 1 for CEIL: Generalized Contextual Imitation Learning
Figure 2 for CEIL: Generalized Contextual Imitation Learning
Figure 3 for CEIL: Generalized Contextual Imitation Learning
Figure 4 for CEIL: Generalized Contextual Imitation Learning
Viaarxiv icon

Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization

Add code
Jun 26, 2023
Figure 1 for Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Figure 2 for Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Figure 3 for Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Figure 4 for Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Viaarxiv icon