Picture for Yuechi Zhou

Yuechi Zhou

LongFlow: Efficient KV Cache Compression for Reasoning M

Add code
Mar 12, 2026
Viaarxiv icon

Accurate KV Cache Quantization with Outlier Tokens Tracing

Add code
May 16, 2025
Figure 1 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 2 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 3 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 4 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Viaarxiv icon

OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning

Add code
May 09, 2024
Figure 1 for OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Figure 2 for OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Figure 3 for OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Figure 4 for OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Viaarxiv icon

Chinese grammatical error correction based on knowledge distillation

Add code
Aug 05, 2022
Figure 1 for Chinese grammatical error correction based on knowledge distillation
Figure 2 for Chinese grammatical error correction based on knowledge distillation
Figure 3 for Chinese grammatical error correction based on knowledge distillation
Figure 4 for Chinese grammatical error correction based on knowledge distillation
Viaarxiv icon