Picture for Kyoung-Woon On

Kyoung-Woon On

TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

Add code
Jul 23, 2024
Viaarxiv icon

General Item Representation Learning for Cold-start Content Recommendations

Add code
Apr 22, 2024
Viaarxiv icon

Binary Classifier Optimization for Large Language Model Alignment

Add code
Apr 06, 2024
Viaarxiv icon

Semiparametric Token-Sequence Co-Supervision

Add code
Mar 14, 2024
Viaarxiv icon

How Well Do Large Language Models Truly Ground?

Add code
Nov 15, 2023
Viaarxiv icon

Hexa: Self-Improving for Knowledge-Grounded Dialogue System

Add code
Oct 22, 2023
Viaarxiv icon

Exploiting the Potential of Seq2Seq Models as Robust Few-Shot Learners

Add code
Jul 27, 2023
Viaarxiv icon

Effortless Integration of Memory Management into Open-Domain Conversation Systems

Add code
May 23, 2023
Viaarxiv icon

MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models

Add code
Mar 23, 2023
Viaarxiv icon

Video-Text Representation Learning via Differentiable Weak Temporal Alignment

Add code
Mar 31, 2022
Figure 1 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 2 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 3 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 4 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Viaarxiv icon