Picture for Grace Li Zhang

Grace Li Zhang

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Add code
Oct 02, 2024
Viaarxiv icon

BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks

Add code
Jul 04, 2024
Viaarxiv icon

LiveMind: Low-latency Large Language Models with Simultaneous Inference

Add code
Jun 20, 2024
Viaarxiv icon

EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration

Add code
Feb 25, 2024
Viaarxiv icon

Class-Aware Pruning for Efficient Neural Networks

Add code
Dec 10, 2023
Figure 1 for Class-Aware Pruning for Efficient Neural Networks
Figure 2 for Class-Aware Pruning for Efficient Neural Networks
Figure 3 for Class-Aware Pruning for Efficient Neural Networks
Figure 4 for Class-Aware Pruning for Efficient Neural Networks
Viaarxiv icon

Early Classification for Dynamic Inference of Neural Networks

Add code
Sep 23, 2023
Viaarxiv icon

Logic Design of Neural Networks for High-Throughput and Low-Power Applications

Add code
Sep 19, 2023
Viaarxiv icon

Expressivity Enhancement with Efficient Quadratic Neurons for Convolutional Neural Networks

Add code
Jun 10, 2023
Viaarxiv icon

PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration

Add code
Mar 24, 2023
Viaarxiv icon

CorrectNet: Robustness Enhancement of Analog In-Memory Computing for Neural Networks by Error Suppression and Compensation

Add code
Nov 27, 2022
Viaarxiv icon