Picture for Zongzhang Zhang

Zongzhang Zhang

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Add code
Oct 28, 2024
Viaarxiv icon

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

Add code
Jul 05, 2024
Viaarxiv icon

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function

Add code
Jul 04, 2024
Viaarxiv icon

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

Add code
Jul 04, 2024
Viaarxiv icon

$\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning

Add code
Jun 26, 2024
Viaarxiv icon

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Mar 12, 2024
Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Reinforced In-Context Black-Box Optimization

Add code
Feb 27, 2024
Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Feb 17, 2024
Figure 1 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 2 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 3 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 4 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Viaarxiv icon

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Add code
Dec 26, 2023
Viaarxiv icon

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Add code
Oct 09, 2023
Viaarxiv icon