Picture for Achintya Kundu

Achintya Kundu

IBM Research

Enhancing Training Efficiency Using Packing with Flash Attention

Add code
Jul 12, 2024
Viaarxiv icon

Efficiently Distilling LLMs for Edge Applications

Add code
Apr 01, 2024
Figure 1 for Efficiently Distilling LLMs for Edge Applications
Figure 2 for Efficiently Distilling LLMs for Edge Applications
Figure 3 for Efficiently Distilling LLMs for Edge Applications
Figure 4 for Efficiently Distilling LLMs for Edge Applications
Viaarxiv icon

TOFA: Transfer-Once-for-All

Add code
Mar 27, 2023
Viaarxiv icon