Picture for An Zou

An Zou

Chain of Compression: A Systematic Approach to Combinationally Compress Convolutional Neural Networks

Add code
Mar 26, 2024
Viaarxiv icon

ONE-SA: Enabling Nonlinear Operations in Systolic Arrays for Efficient and Flexible Neural Network Inference

Add code
Feb 01, 2024
Viaarxiv icon

NeuralMatrix: Moving Entire Neural Networks to General Matrix Multiplication for Efficient Inference

Add code
May 23, 2023
Viaarxiv icon

Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference

Add code
Jun 09, 2022
Figure 1 for Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Figure 2 for Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Figure 3 for Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Figure 4 for Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Viaarxiv icon

RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks with Fine-Grain Utilization

Add code
Jan 27, 2021
Figure 1 for RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks with Fine-Grain Utilization
Figure 2 for RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks with Fine-Grain Utilization
Figure 3 for RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks with Fine-Grain Utilization
Figure 4 for RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks with Fine-Grain Utilization
Viaarxiv icon