Picture for Animesh Jain

Animesh Jain

Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

Add code
Mar 14, 2023
Figure 1 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 2 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 3 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 4 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Viaarxiv icon

Iterative Activation-based Structured Pruning

Add code
Jan 22, 2022
Figure 1 for Iterative Activation-based Structured Pruning
Figure 2 for Iterative Activation-based Structured Pruning
Figure 3 for Iterative Activation-based Structured Pruning
Figure 4 for Iterative Activation-based Structured Pruning
Viaarxiv icon

Adaptive Activation-based Structured Pruning

Add code
Jan 21, 2022
Figure 1 for Adaptive Activation-based Structured Pruning
Figure 2 for Adaptive Activation-based Structured Pruning
Figure 3 for Adaptive Activation-based Structured Pruning
Figure 4 for Adaptive Activation-based Structured Pruning
Viaarxiv icon

Automated Backend-Aware Post-Training Quantization

Add code
Mar 27, 2021
Figure 1 for Automated Backend-Aware Post-Training Quantization
Figure 2 for Automated Backend-Aware Post-Training Quantization
Figure 3 for Automated Backend-Aware Post-Training Quantization
Figure 4 for Automated Backend-Aware Post-Training Quantization
Viaarxiv icon

UNIT: Unifying Tensorized Instruction Compilation

Add code
Jan 21, 2021
Figure 1 for UNIT: Unifying Tensorized Instruction Compilation
Figure 2 for UNIT: Unifying Tensorized Instruction Compilation
Figure 3 for UNIT: Unifying Tensorized Instruction Compilation
Figure 4 for UNIT: Unifying Tensorized Instruction Compilation
Viaarxiv icon

Efficient Execution of Quantized Deep Learning Models: A Compiler Approach

Add code
Jun 18, 2020
Figure 1 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 2 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 3 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 4 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Viaarxiv icon

Optimizing Memory-Access Patterns for Deep Learning Accelerators

Add code
Feb 27, 2020
Viaarxiv icon