Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lefan Zhang

Harmonized Speculative Sampling

Aug 28, 2024

Lefan Zhang, Xiaodan Wang, Yanhua Huang, Ruiwen Xu

Figure 1 for Harmonized Speculative Sampling

Figure 2 for Harmonized Speculative Sampling

Figure 3 for Harmonized Speculative Sampling

Figure 4 for Harmonized Speculative Sampling

Abstract:Speculative sampling has proven to be an effective solution to accelerate decoding from large language models, where the acceptance rate significantly determines the performance. Most previous works on improving the acceptance rate focus on aligned training and efficient decoding, implicitly paying less attention to the linkage of training and decoding. In this work, we first investigate the linkage of training and decoding for speculative sampling and then propose a solution named HArmonized Speculative Sampling (HASS). HASS improves the acceptance rate without extra inference overhead by harmonizing training and decoding on their objectives and contexts. Experiments on three LLaMA models demonstrate that HASS achieves 2.81x-3.65x wall-clock time speedup ratio averaging across three datasets, which is 8%-15% faster than EAGLE-2.

Via

Access Paper or Ask Questions

Learning from Long-Tailed Noisy Data with Sample Selection and Balanced Loss

Nov 20, 2022

Lefan Zhang, Zhang-Hao Tian, Wei Wang

Abstract:The success of deep learning depends on large-scale and well-curated training data, while data in real-world applications are commonly long-tailed and noisy. Many methods have been proposed to deal with long-tailed data or noisy data, while a few methods are developed to tackle long-tailed noisy data. To solve this, we propose a robust method for learning from long-tailed noisy data with sample selection and balanced loss. Specifically, we separate the noisy training data into clean labeled set and unlabeled set with sample selection, and train the deep neural network in a semi-supervised manner with a novel balanced loss based on model bias. Experiments on benchmarks demonstrate that our method outperforms existing state-of-the-art methods.

Via

Access Paper or Ask Questions