Picture for Jongjin Park

Jongjin Park

SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs

Add code
Apr 17, 2024
Viaarxiv icon

Preference Transformer: Modeling Human Preferences using Transformers for RL

Add code
Mar 02, 2023
Viaarxiv icon

Meta-Learning with Self-Improving Momentum Target

Add code
Oct 11, 2022
Figure 1 for Meta-Learning with Self-Improving Momentum Target
Figure 2 for Meta-Learning with Self-Improving Momentum Target
Figure 3 for Meta-Learning with Self-Improving Momentum Target
Figure 4 for Meta-Learning with Self-Improving Momentum Target
Viaarxiv icon

SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning

Add code
Mar 18, 2022
Figure 1 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 2 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 3 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Figure 4 for SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Viaarxiv icon

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Add code
Oct 27, 2021
Figure 1 for Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Figure 2 for Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Figure 3 for Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Figure 4 for Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Viaarxiv icon

OpenCoS: Contrastive Semi-supervised Learning for Handling Open-set Unlabeled Data

Add code
Jun 29, 2021
Figure 1 for OpenCoS: Contrastive Semi-supervised Learning for Handling Open-set Unlabeled Data
Figure 2 for OpenCoS: Contrastive Semi-supervised Learning for Handling Open-set Unlabeled Data
Figure 3 for OpenCoS: Contrastive Semi-supervised Learning for Handling Open-set Unlabeled Data
Figure 4 for OpenCoS: Contrastive Semi-supervised Learning for Handling Open-set Unlabeled Data
Viaarxiv icon

Regularizing Class-wise Predictions via Self-knowledge Distillation

Add code
Apr 07, 2020
Figure 1 for Regularizing Class-wise Predictions via Self-knowledge Distillation
Figure 2 for Regularizing Class-wise Predictions via Self-knowledge Distillation
Figure 3 for Regularizing Class-wise Predictions via Self-knowledge Distillation
Figure 4 for Regularizing Class-wise Predictions via Self-knowledge Distillation
Viaarxiv icon