Picture for Hanshan Zhang

Hanshan Zhang

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Add code
Mar 06, 2025
Viaarxiv icon

Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model

Add code
Dec 18, 2024
Viaarxiv icon

Uncertainty Sentence Sampling by Virtual Adversarial Perturbation

Add code
Oct 27, 2022
Viaarxiv icon