Picture for Chunan Shi

Chunan Shi

Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge

Add code
May 01, 2024
Figure 1 for Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Figure 2 for Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Figure 3 for Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Figure 4 for Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Viaarxiv icon

SpotServe: Serving Generative Large Language Models on Preemptible Instances

Add code
Nov 27, 2023
Figure 1 for SpotServe: Serving Generative Large Language Models on Preemptible Instances
Figure 2 for SpotServe: Serving Generative Large Language Models on Preemptible Instances
Figure 3 for SpotServe: Serving Generative Large Language Models on Preemptible Instances
Figure 4 for SpotServe: Serving Generative Large Language Models on Preemptible Instances
Viaarxiv icon

Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

Add code
Nov 25, 2022
Viaarxiv icon