Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Dec 18, 2022

Yuanchi Zhang, Peng Li, Maosong Sun, Yang Liu

Figure 1 for Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Figure 2 for Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Figure 3 for Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Figure 4 for Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Share this with someone who'll enjoy it:

Abstract:Although continually extending an existing NMT model to new domains or languages has attracted intensive interest in recent years, the equally valuable problem of continually improving a given NMT model in its domain by leveraging knowledge from an unlimited number of existing NMT models is not explored yet. To facilitate the study, we propose a formal definition for the problem named knowledge accumulation for NMT (KA-NMT) with corresponding datasets and evaluation metrics and develop a novel method for KA-NMT. We investigate a novel knowledge detection algorithm to identify beneficial knowledge from existing models at token level, and propose to learn from beneficial knowledge and learn against other knowledge simultaneously to improve learning efficiency. To alleviate catastrophic forgetting, we further propose to transfer knowledge from previous to current version of the given model. Extensive experiments show that our proposed method significantly and consistently outperforms representative baselines under homogeneous, heterogeneous, and malicious model settings for different language pairs.

* 18 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Continually Learning from Existing Models: Knowledge Accumulation for Neural Machine Translation

Paper and Code