Picture for Naifeng Jing

Naifeng Jing

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration

Add code
Aug 12, 2024
Viaarxiv icon

DNN Training Acceleration via Exploring GPGPU Friendly Sparsity

Add code
Mar 11, 2022
Figure 1 for DNN Training Acceleration via Exploring GPGPU Friendly Sparsity
Figure 2 for DNN Training Acceleration via Exploring GPGPU Friendly Sparsity
Figure 3 for DNN Training Acceleration via Exploring GPGPU Friendly Sparsity
Figure 4 for DNN Training Acceleration via Exploring GPGPU Friendly Sparsity
Viaarxiv icon

CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction

Add code
Mar 09, 2022
Figure 1 for CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
Figure 2 for CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
Figure 3 for CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
Figure 4 for CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
Viaarxiv icon

SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network

Add code
Mar 02, 2021
Figure 1 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 2 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 3 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 4 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Viaarxiv icon

Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators

Add code
Oct 19, 2018
Figure 1 for Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators
Figure 2 for Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators
Figure 3 for Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators
Figure 4 for Invocation-driven Neural Approximate Computing with a Multiclass-Classifier and Multiple Approximators
Viaarxiv icon

AXNet: ApproXimate computing using an end-to-end trainable neural network

Add code
Jul 27, 2018
Figure 1 for AXNet: ApproXimate computing using an end-to-end trainable neural network
Figure 2 for AXNet: ApproXimate computing using an end-to-end trainable neural network
Figure 3 for AXNet: ApproXimate computing using an end-to-end trainable neural network
Figure 4 for AXNet: ApproXimate computing using an end-to-end trainable neural network
Viaarxiv icon