Picture for Xiaohui Song

Xiaohui Song

Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA

Add code
Dec 30, 2024
Viaarxiv icon

BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation

Add code
Jun 19, 2024
Viaarxiv icon

Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation

Add code
Oct 19, 2022
Figure 1 for Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation
Figure 2 for Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation
Figure 3 for Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation
Figure 4 for Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation
Viaarxiv icon

Data Augmentation for Copy-Mechanism in Dialogue State Tracking

Add code
Feb 22, 2020
Figure 1 for Data Augmentation for Copy-Mechanism in Dialogue State Tracking
Figure 2 for Data Augmentation for Copy-Mechanism in Dialogue State Tracking
Figure 3 for Data Augmentation for Copy-Mechanism in Dialogue State Tracking
Figure 4 for Data Augmentation for Copy-Mechanism in Dialogue State Tracking
Viaarxiv icon