Picture for Zongzhang Zhang

Zongzhang Zhang

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning

Add code
Feb 07, 2025
Viaarxiv icon

Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay

Add code
Nov 16, 2024
Figure 1 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 2 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 3 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 4 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Viaarxiv icon

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Add code
Oct 28, 2024
Figure 1 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 2 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 3 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 4 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Viaarxiv icon

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

Add code
Jul 05, 2024
Figure 1 for Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Figure 2 for Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Figure 3 for Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Figure 4 for Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Viaarxiv icon

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

Add code
Jul 04, 2024
Viaarxiv icon

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function

Add code
Jul 04, 2024
Viaarxiv icon

$\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning

Add code
Jun 26, 2024
Figure 1 for $\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning
Figure 2 for $\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning
Figure 3 for $\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning
Figure 4 for $\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning
Viaarxiv icon

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Mar 12, 2024
Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Reinforced In-Context Black-Box Optimization

Add code
Feb 27, 2024
Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Feb 17, 2024
Figure 1 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 2 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 3 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 4 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Viaarxiv icon