Picture for Byung-Jun Lee

Byung-Jun Lee

Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training

Add code
Nov 15, 2024
Figure 1 for Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training
Figure 2 for Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training
Figure 3 for Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training
Figure 4 for Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training
Viaarxiv icon

VPO: Leveraging the Number of Votes in Preference Optimization

Add code
Oct 30, 2024
Viaarxiv icon

Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task

Add code
Oct 15, 2024
Viaarxiv icon

DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation

Add code
Oct 15, 2024
Viaarxiv icon

ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning

Add code
Jul 30, 2024
Viaarxiv icon

Offline Imitation Learning by Controlling the Effective Planning Horizon

Add code
Jan 18, 2024
Viaarxiv icon

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Add code
Oct 25, 2022
Viaarxiv icon

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Add code
Jun 21, 2021
Figure 1 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 2 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 3 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 4 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Viaarxiv icon