Picture for Kiran Kumar Matam

Kiran Kumar Matam

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Apr 15, 2021
Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

Check-N-Run: A Checkpointing System for Training Recommendation Models

Add code
Oct 17, 2020
Figure 1 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 2 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 3 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 4 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Viaarxiv icon