Picture for Raghu Kiran Ganti

Raghu Kiran Ganti

Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts

Add code
Aug 30, 2024
Viaarxiv icon

Enhancing Training Efficiency Using Packing with Flash Attention

Add code
Jul 12, 2024
Viaarxiv icon