Picture for Darshan Gandhi

Darshan Gandhi

Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance

Add code
Oct 31, 2024
Figure 1 for Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance
Figure 2 for Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance
Figure 3 for Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance
Figure 4 for Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance
Viaarxiv icon

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Add code
May 13, 2024
Figure 1 for SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
Figure 2 for SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
Figure 3 for SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
Figure 4 for SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
Viaarxiv icon

Training Large Language Models Efficiently with Sparsity and Dataflow

Add code
Apr 11, 2023
Viaarxiv icon