Picture for Hao Mark Chen

Hao Mark Chen

Progressive Mixed-Precision Decoding for Efficient LLM Inference

Add code
Oct 17, 2024
Figure 1 for Progressive Mixed-Precision Decoding for Efficient LLM Inference
Figure 2 for Progressive Mixed-Precision Decoding for Efficient LLM Inference
Figure 3 for Progressive Mixed-Precision Decoding for Efficient LLM Inference
Figure 4 for Progressive Mixed-Precision Decoding for Efficient LLM Inference
Viaarxiv icon

Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA

Add code
Jun 24, 2024
Viaarxiv icon

Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGA

Add code
Jun 23, 2024
Viaarxiv icon