Picture for Jeongyeol Kwon

Jeongyeol Kwon

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Add code
Oct 16, 2024
Viaarxiv icon

RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation

Add code
Jun 03, 2024
Viaarxiv icon

On the Complexity of First-Order Methods in Stochastic Bilevel Optimization

Add code
Feb 11, 2024
Viaarxiv icon

Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments

Add code
Feb 11, 2024
Viaarxiv icon

Prospective Side Information for Latent MDPs

Add code
Oct 11, 2023
Figure 1 for Prospective Side Information for Latent MDPs
Figure 2 for Prospective Side Information for Latent MDPs
Viaarxiv icon

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

Add code
Sep 04, 2023
Viaarxiv icon

Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection

Add code
Jun 15, 2023
Viaarxiv icon

A Fully First-Order Method for Stochastic Bilevel Optimization

Add code
Jan 26, 2023
Viaarxiv icon

Reward-Mixing MDPs with a Few Latent Contexts are Learnable

Add code
Oct 05, 2022
Viaarxiv icon

Tractable Optimality in Episodic Latent MABs

Add code
Oct 05, 2022
Figure 1 for Tractable Optimality in Episodic Latent MABs
Figure 2 for Tractable Optimality in Episodic Latent MABs
Figure 3 for Tractable Optimality in Episodic Latent MABs
Viaarxiv icon