Picture for Kee-Eung Kim

Kee-Eung Kim

GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets

Add code
Oct 19, 2024
Viaarxiv icon

Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models

Add code
Sep 28, 2024
Viaarxiv icon

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Add code
Jul 20, 2024
Viaarxiv icon

SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

Add code
Jun 18, 2024
Viaarxiv icon

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

Add code
May 29, 2024
Viaarxiv icon

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning

Add code
Feb 13, 2024
Viaarxiv icon

Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL

Add code
Feb 11, 2024
Viaarxiv icon

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

Add code
Nov 03, 2023
Viaarxiv icon

Adapting Text-based Dialogue State Tracker for Spoken Dialogues

Add code
Aug 30, 2023
Viaarxiv icon

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Add code
Oct 25, 2022
Viaarxiv icon