Picture for Yeong-Dae Kwon

Yeong-Dae Kwon

Less for More: Enhancing Preference Learning in Generative Language Models with Automated Self-Curation of Training Corpora

Add code
Aug 23, 2024
Viaarxiv icon

CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only

Add code
Jun 11, 2024
Viaarxiv icon

Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation

Add code
May 10, 2024
Viaarxiv icon

Simulation-guided Beam Search for Neural Combinatorial Optimization

Add code
Jul 13, 2022
Figure 1 for Simulation-guided Beam Search for Neural Combinatorial Optimization
Figure 2 for Simulation-guided Beam Search for Neural Combinatorial Optimization
Figure 3 for Simulation-guided Beam Search for Neural Combinatorial Optimization
Figure 4 for Simulation-guided Beam Search for Neural Combinatorial Optimization
Viaarxiv icon

Matrix Encoding Networks for Neural Combinatorial Optimization

Add code
Jun 21, 2021
Figure 1 for Matrix Encoding Networks for Neural Combinatorial Optimization
Figure 2 for Matrix Encoding Networks for Neural Combinatorial Optimization
Figure 3 for Matrix Encoding Networks for Neural Combinatorial Optimization
Figure 4 for Matrix Encoding Networks for Neural Combinatorial Optimization
Viaarxiv icon

Efficient Active Search for Combinatorial Optimization Problems

Add code
Jun 09, 2021
Figure 1 for Efficient Active Search for Combinatorial Optimization Problems
Figure 2 for Efficient Active Search for Combinatorial Optimization Problems
Figure 3 for Efficient Active Search for Combinatorial Optimization Problems
Figure 4 for Efficient Active Search for Combinatorial Optimization Problems
Viaarxiv icon

SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning

Add code
Jan 16, 2021
Figure 1 for SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning
Figure 2 for SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning
Figure 3 for SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning
Figure 4 for SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning
Viaarxiv icon

POMO: Policy Optimization with Multiple Optima for Reinforcement Learning

Add code
Oct 30, 2020
Figure 1 for POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Figure 2 for POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Figure 3 for POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Figure 4 for POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Viaarxiv icon