Picture for Qizheng Zhang

Qizheng Zhang

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits

Add code
Feb 12, 2025
Viaarxiv icon

CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion

Add code
May 26, 2024
Figure 1 for CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
Figure 2 for CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
Figure 3 for CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
Figure 4 for CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
Viaarxiv icon

OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation

Add code
Oct 03, 2023
Figure 1 for OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
Figure 2 for OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
Figure 3 for OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
Figure 4 for OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
Viaarxiv icon

Grace++: Loss-Resilient Real-Time Video Communication under High Network Latency

Add code
May 21, 2023
Viaarxiv icon

AccMPEG: Optimizing Video Encoding for Video Analytics

Add code
Apr 26, 2022
Figure 1 for AccMPEG: Optimizing Video Encoding for Video Analytics
Figure 2 for AccMPEG: Optimizing Video Encoding for Video Analytics
Figure 3 for AccMPEG: Optimizing Video Encoding for Video Analytics
Figure 4 for AccMPEG: Optimizing Video Encoding for Video Analytics
Viaarxiv icon