Picture for Jiafei Lyu

Jiafei Lyu

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Add code
Oct 28, 2024
Figure 1 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 2 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 3 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 4 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Viaarxiv icon

A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning

Add code
Oct 18, 2024
Viaarxiv icon

SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning

Add code
Aug 23, 2024
Viaarxiv icon

A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation

Add code
Jun 29, 2024
Figure 1 for A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Figure 2 for A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Figure 3 for A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Figure 4 for A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Viaarxiv icon

World Models with Hints of Large Language Models for Goal Achieving

Add code
Jun 11, 2024
Figure 1 for World Models with Hints of Large Language Models for Goal Achieving
Figure 2 for World Models with Hints of Large Language Models for Goal Achieving
Figure 3 for World Models with Hints of Large Language Models for Goal Achieving
Figure 4 for World Models with Hints of Large Language Models for Goal Achieving
Viaarxiv icon

Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Add code
May 24, 2024
Viaarxiv icon

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Add code
Feb 06, 2024
Figure 1 for SEABO: A Simple Search-Based Method for Offline Imitation Learning
Figure 2 for SEABO: A Simple Search-Based Method for Offline Imitation Learning
Figure 3 for SEABO: A Simple Search-Based Method for Offline Imitation Learning
Figure 4 for SEABO: A Simple Search-Based Method for Offline Imitation Learning
Viaarxiv icon

Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

Add code
Feb 05, 2024
Viaarxiv icon

Exploration and Anti-Exploration with Distributional Random Network Distillation

Add code
Jan 25, 2024
Viaarxiv icon

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model

Add code
Nov 23, 2023
Figure 1 for Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Figure 2 for Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Figure 3 for Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Figure 4 for Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Viaarxiv icon