Picture for Niladrish Chatterjee

Niladrish Chatterjee

GPU Domain Specialization via Composable On-Package Architecture

Add code
Apr 05, 2021
Figure 1 for GPU Domain Specialization via Composable On-Package Architecture
Figure 2 for GPU Domain Specialization via Composable On-Package Architecture
Figure 3 for GPU Domain Specialization via Composable On-Package Architecture
Figure 4 for GPU Domain Specialization via Composable On-Package Architecture
Viaarxiv icon

DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis

Add code
Apr 02, 2019
Figure 1 for DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis
Figure 2 for DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis
Figure 3 for DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis
Figure 4 for DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis
Viaarxiv icon

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

Add code
May 03, 2017
Figure 1 for Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Figure 2 for Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Figure 3 for Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Figure 4 for Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Viaarxiv icon