Picture for Runsheng Wang

Runsheng Wang

MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers

Add code
Oct 23, 2024
Figure 1 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 2 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 3 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 4 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Viaarxiv icon

PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization

Add code
Oct 12, 2024
Viaarxiv icon

AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference

Add code
Aug 19, 2024
Viaarxiv icon

ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

Add code
Jun 17, 2024
Figure 1 for ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
Figure 2 for ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
Figure 3 for ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
Figure 4 for ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
Viaarxiv icon

FastQuery: Communication-efficient Embedding Table Query for Private LLM Inference

Add code
May 25, 2024
Viaarxiv icon

PrivCirNet: Efficient Private Inference via Block Circulant Transformation

Add code
May 23, 2024
Viaarxiv icon

EasyACIM: An End-to-End Automated Analog CIM with Synthesizable Architecture and Agile Design Space Exploration

Add code
Apr 12, 2024
Viaarxiv icon

PDNNet: PDN-Aware GNN-CNN Heterogeneous Network for Dynamic IR Drop Prediction

Add code
Mar 27, 2024
Figure 1 for PDNNet: PDN-Aware GNN-CNN Heterogeneous Network for Dynamic IR Drop Prediction
Figure 2 for PDNNet: PDN-Aware GNN-CNN Heterogeneous Network for Dynamic IR Drop Prediction
Figure 3 for PDNNet: PDN-Aware GNN-CNN Heterogeneous Network for Dynamic IR Drop Prediction
Figure 4 for PDNNet: PDN-Aware GNN-CNN Heterogeneous Network for Dynamic IR Drop Prediction
Viaarxiv icon

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding

Add code
Feb 21, 2024
Figure 1 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 2 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 3 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 4 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Viaarxiv icon

HEQuant: Marrying Homomorphic Encryption and Quantization for Communication-Efficient Private Inference

Add code
Jan 31, 2024
Viaarxiv icon