Picture for Lili Wu

Lili Wu

Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

Add code
May 29, 2024
Viaarxiv icon

Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs

Add code
Apr 22, 2024
Viaarxiv icon

PcLast: Discovering Plannable Continuous Latent States

Add code
Nov 06, 2023
Viaarxiv icon

Anytime-valid off-policy inference for contextual bandits

Add code
Oct 19, 2022
Figure 1 for Anytime-valid off-policy inference for contextual bandits
Figure 2 for Anytime-valid off-policy inference for contextual bandits
Figure 3 for Anytime-valid off-policy inference for contextual bandits
Figure 4 for Anytime-valid off-policy inference for contextual bandits
Viaarxiv icon

Parameterized Exploration

Add code
Jul 13, 2019
Figure 1 for Parameterized Exploration
Figure 2 for Parameterized Exploration
Figure 3 for Parameterized Exploration
Figure 4 for Parameterized Exploration
Viaarxiv icon