Picture for Jemin Park

Jemin Park

Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

Add code
Mar 03, 2024
Viaarxiv icon