Picture for Yige Yuan

Yige Yuan

On a Connection Between Imitation Learning and RLHF

Add code
Mar 07, 2025
Viaarxiv icon

MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Add code
Feb 28, 2025
Viaarxiv icon

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

Add code
Feb 04, 2025
Figure 1 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 2 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 3 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Figure 4 for SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Viaarxiv icon

Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment

Add code
Dec 19, 2024
Viaarxiv icon

Fact-Level Confidence Calibration and Self-Correction

Add code
Nov 20, 2024
Figure 1 for Fact-Level Confidence Calibration and Self-Correction
Figure 2 for Fact-Level Confidence Calibration and Self-Correction
Figure 3 for Fact-Level Confidence Calibration and Self-Correction
Figure 4 for Fact-Level Confidence Calibration and Self-Correction
Viaarxiv icon

How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective

Add code
Oct 14, 2024
Viaarxiv icon

MITA: Bridging the Gap between Model and Data for Test-time Adaptation

Add code
Oct 12, 2024
Viaarxiv icon

Negative as Positive: Enhancing Out-of-distribution Generalization for Graph Contrastive Learning

Add code
May 25, 2024
Viaarxiv icon

TEA: Test-time Energy Adaptation

Add code
Nov 24, 2023
Viaarxiv icon

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Add code
May 25, 2023
Viaarxiv icon