Picture for Michael Garland

Michael Garland

Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs

Add code
Dec 19, 2025
Figure 1 for Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
Figure 2 for Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
Figure 3 for Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
Figure 4 for Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
Viaarxiv icon

Understanding the Effect of the Long Tail on Neural Network Compression

Add code
Jun 27, 2023
Viaarxiv icon

ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications

Add code
Jun 19, 2023
Figure 1 for ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications
Figure 2 for ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications
Figure 3 for ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications
Figure 4 for ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications
Viaarxiv icon

Efficient Sparsely Activated Transformers

Add code
Aug 31, 2022
Figure 1 for Efficient Sparsely Activated Transformers
Figure 2 for Efficient Sparsely Activated Transformers
Figure 3 for Efficient Sparsely Activated Transformers
Figure 4 for Efficient Sparsely Activated Transformers
Viaarxiv icon

Reliable Model Compression via Label-Preservation-Aware Loss Functions

Add code
Dec 03, 2020
Figure 1 for Reliable Model Compression via Label-Preservation-Aware Loss Functions
Figure 2 for Reliable Model Compression via Label-Preservation-Aware Loss Functions
Figure 3 for Reliable Model Compression via Label-Preservation-Aware Loss Functions
Figure 4 for Reliable Model Compression via Label-Preservation-Aware Loss Functions
Viaarxiv icon

A Programmable Approach to Model Compression

Add code
Nov 06, 2019
Figure 1 for A Programmable Approach to Model Compression
Figure 2 for A Programmable Approach to Model Compression
Figure 3 for A Programmable Approach to Model Compression
Figure 4 for A Programmable Approach to Model Compression
Viaarxiv icon

GPU-Accelerated Atari Emulation for Reinforcement Learning

Add code
Jul 19, 2019
Figure 1 for GPU-Accelerated Atari Emulation for Reinforcement Learning
Figure 2 for GPU-Accelerated Atari Emulation for Reinforcement Learning
Figure 3 for GPU-Accelerated Atari Emulation for Reinforcement Learning
Figure 4 for GPU-Accelerated Atari Emulation for Reinforcement Learning
Viaarxiv icon

AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks

Add code
Feb 14, 2018
Figure 1 for AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks
Figure 2 for AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks
Figure 3 for AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks
Figure 4 for AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks
Viaarxiv icon