Picture for Jan-Jan Wu

Jan-Jan Wu

GPU Memory Usage Optimization for Backward Propagation in Deep Network Training

Add code
Feb 18, 2025
Viaarxiv icon