Picture for Kazuki Yano

Kazuki Yano

STEP: Staged Parameter-Efficient Pre-training for Large Language Models

Add code
Apr 05, 2025
Viaarxiv icon

Efficient Construction of Model Family through Progressive Training Using Model Expansion

Add code
Apr 01, 2025
Viaarxiv icon