Picture for Jiawei Gu

Jiawei Gu

CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models

Add code
Jul 24, 2024
Viaarxiv icon