Picture for Kee-Eung Kim

Kee-Eung Kim

Monet: Mixture of Monosemantic Experts for Transformers

Add code
Dec 05, 2024
Viaarxiv icon

GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets

Add code
Oct 19, 2024
Figure 1 for GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
Figure 2 for GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
Figure 3 for GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
Figure 4 for GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
Viaarxiv icon

Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models

Add code
Sep 28, 2024
Viaarxiv icon

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Add code
Jul 20, 2024
Viaarxiv icon

SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

Add code
Jun 18, 2024
Viaarxiv icon

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

Add code
May 29, 2024
Viaarxiv icon

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning

Add code
Feb 13, 2024
Figure 1 for Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Figure 2 for Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Figure 3 for Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Figure 4 for Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Viaarxiv icon

Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL

Add code
Feb 11, 2024
Viaarxiv icon

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

Add code
Nov 03, 2023
Viaarxiv icon

Adapting Text-based Dialogue State Tracker for Spoken Dialogues

Add code
Aug 30, 2023
Viaarxiv icon