Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lianshang Cai

PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Nov 11, 2022

Lianshang Cai, Linhao Zhang, Dehong Ma, Jun Fan, Daiting Shi, Yi Wu, Zhicong Cheng, Simiu Gu, Dawei Yin

Figure 1 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Figure 2 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Figure 3 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Figure 4 for PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation

Abstract:Pre-trained language models have become a crucial part of ranking systems and achieved very impressive effects recently. To maintain high performance while keeping efficient computations, knowledge distillation is widely used. In this paper, we focus on two key questions in knowledge distillation for ranking models: 1) how to ensemble knowledge from multi-teacher; 2) how to utilize the label information of data in the distillation process. We propose a unified algorithm called Pairwise Iterative Logits Ensemble (PILE) to tackle these two questions simultaneously. PILE ensembles multi-teacher logits supervised by label information in an iterative way and achieved competitive performance in both offline and online experiments. The proposed method has been deployed in a real-world commercial search system.

Via

Access Paper or Ask Questions