Picture for Chenggang Li

Chenggang Li

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Add code
Feb 24, 2025
Viaarxiv icon

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Add code
Feb 06, 2025
Viaarxiv icon