Picture for Hailiang Yang

Hailiang Yang

Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement

Add code
Oct 17, 2024
Figure 1 for Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement
Figure 2 for Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement
Figure 3 for Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement
Figure 4 for Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancement
Viaarxiv icon