Picture for Chunshan Li

Chunshan Li

Checkpoint Merging via Bayesian Optimization in LLM Pretraining

Add code
Mar 28, 2024
Viaarxiv icon