Picture for Hanshan Zhang

Hanshan Zhang

Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model

Add code
Dec 18, 2024
Viaarxiv icon

Uncertainty Sentence Sampling by Virtual Adversarial Perturbation

Add code
Oct 27, 2022
Viaarxiv icon