Picture for Doyoung Kim

Doyoung Kim

References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation

Add code
May 10, 2025
Viaarxiv icon

Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model

Add code
Jun 21, 2024
Figure 1 for Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Figure 2 for Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Figure 3 for Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Figure 4 for Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model
Viaarxiv icon

Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity

Add code
Apr 22, 2024
Figure 1 for Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Figure 2 for Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Figure 3 for Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Figure 4 for Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Viaarxiv icon

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

Add code
Apr 16, 2024
Figure 1 for Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Figure 2 for Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Figure 3 for Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Figure 4 for Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Viaarxiv icon

Semiparametric Token-Sequence Co-Supervision

Add code
Mar 14, 2024
Figure 1 for Semiparametric Token-Sequence Co-Supervision
Figure 2 for Semiparametric Token-Sequence Co-Supervision
Figure 3 for Semiparametric Token-Sequence Co-Supervision
Figure 4 for Semiparametric Token-Sequence Co-Supervision
Viaarxiv icon

Joint Mechanical and Electrical Adjustment of IRS-aided LEO Satellite MIMO Communications

Add code
Jan 12, 2024
Viaarxiv icon

Adaptive Shortcut Debiasing for Online Continual Learning

Add code
Dec 14, 2023
Figure 1 for Adaptive Shortcut Debiasing for Online Continual Learning
Figure 2 for Adaptive Shortcut Debiasing for Online Continual Learning
Figure 3 for Adaptive Shortcut Debiasing for Online Continual Learning
Figure 4 for Adaptive Shortcut Debiasing for Online Continual Learning
Viaarxiv icon

One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning

Add code
Nov 18, 2023
Figure 1 for One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Figure 2 for One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Figure 3 for One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Figure 4 for One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Viaarxiv icon

How Well Do Large Language Models Truly Ground?

Add code
Nov 15, 2023
Figure 1 for How Well Do Large Language Models Truly Ground?
Figure 2 for How Well Do Large Language Models Truly Ground?
Figure 3 for How Well Do Large Language Models Truly Ground?
Figure 4 for How Well Do Large Language Models Truly Ground?
Viaarxiv icon

Robust Data Pruning under Label Noise via Maximizing Re-labeling Accuracy

Add code
Nov 02, 2023
Viaarxiv icon