Picture for Philip Zmushko

Philip Zmushko

FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training

Add code
Nov 12, 2024
Viaarxiv icon