Picture for Junhui He

Junhui He

CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification

Add code
Sep 02, 2024
Figure 1 for CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification
Figure 2 for CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification
Figure 3 for CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification
Figure 4 for CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification
Viaarxiv icon

Adaptive Bayesian Multivariate Spline Knot Inference with Prior Specifications on Model Complexity

Add code
May 22, 2024
Viaarxiv icon