Picture for Yiqin Yang

Yiqin Yang

Episodic Novelty Through Temporal Distance

Add code
Jan 26, 2025
Viaarxiv icon

S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning

Add code
Aug 22, 2024
Viaarxiv icon

Bayesian Design Principles for Offline-to-Online Reinforcement Learning

Add code
May 31, 2024
Viaarxiv icon

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning

Add code
Dec 11, 2023
Viaarxiv icon

Unsupervised Behavior Extraction via Random Intent Priors

Add code
Oct 28, 2023
Viaarxiv icon

Learning Diverse Risk Preferences in Population-based Self-play

Add code
May 19, 2023
Viaarxiv icon

The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning

Add code
Feb 27, 2023
Viaarxiv icon

Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery

Add code
Dec 02, 2022
Figure 1 for Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Figure 2 for Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Figure 3 for Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Figure 4 for Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Viaarxiv icon

On the Role of Discount Factor in Offline Reinforcement Learning

Add code
Jun 15, 2022
Figure 1 for On the Role of Discount Factor in Offline Reinforcement Learning
Figure 2 for On the Role of Discount Factor in Offline Reinforcement Learning
Figure 3 for On the Role of Discount Factor in Offline Reinforcement Learning
Figure 4 for On the Role of Discount Factor in Offline Reinforcement Learning
Viaarxiv icon

Offline Reinforcement Learning with Value-based Episodic Memory

Add code
Oct 19, 2021
Figure 1 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 2 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 3 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 4 for Offline Reinforcement Learning with Value-based Episodic Memory
Viaarxiv icon