Picture for Grace Li Zhang

Grace Li Zhang

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Add code
Oct 02, 2024
Viaarxiv icon

BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks

Add code
Jul 04, 2024
Viaarxiv icon

LiveMind: Low-latency Large Language Models with Simultaneous Inference

Add code
Jun 20, 2024
Viaarxiv icon

EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration

Add code
Feb 25, 2024
Viaarxiv icon

Class-Aware Pruning for Efficient Neural Networks

Add code
Dec 10, 2023
Viaarxiv icon

Early Classification for Dynamic Inference of Neural Networks

Add code
Sep 23, 2023
Viaarxiv icon

Logic Design of Neural Networks for High-Throughput and Low-Power Applications

Add code
Sep 19, 2023
Viaarxiv icon

Expressivity Enhancement with Efficient Quadratic Neurons for Convolutional Neural Networks

Add code
Jun 10, 2023
Viaarxiv icon

PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration

Add code
Mar 24, 2023
Viaarxiv icon

CorrectNet: Robustness Enhancement of Analog In-Memory Computing for Neural Networks by Error Suppression and Compensation

Add code
Nov 27, 2022
Viaarxiv icon