Picture for Stephen Chung

Stephen Chung

Thinker: Learning to Think Fast and Slow

Add code
May 27, 2025
Viaarxiv icon

Learning from Peers in Reasoning Models

Add code
May 12, 2025
Viaarxiv icon

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Add code
Apr 02, 2025
Viaarxiv icon

Handling Delay in Real-Time Reinforcement Learning

Add code
Mar 30, 2025
Viaarxiv icon

Learning from Failures in Multi-Attempt Reinforcement Learning

Add code
Mar 04, 2025
Viaarxiv icon

Predicting Future Actions of Reinforcement Learning Agents

Add code
Oct 29, 2024
Viaarxiv icon

Thinker: Learning to Plan and Act

Add code
Jul 27, 2023
Viaarxiv icon

Structural Credit Assignment with Coordinated Exploration

Add code
Jul 25, 2023
Viaarxiv icon

Unbiased Weight Maximization

Add code
Jul 25, 2023
Figure 1 for Unbiased Weight Maximization
Figure 2 for Unbiased Weight Maximization
Figure 3 for Unbiased Weight Maximization
Figure 4 for Unbiased Weight Maximization
Viaarxiv icon

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Add code
Nov 27, 2022
Viaarxiv icon