Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Oct 06, 2024

Divya Jyoti Bajpai, Manjesh Kumar Hanawal

Figure 1 for DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Figure 2 for DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Figure 3 for DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Figure 4 for DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Share this with someone who'll enjoy it:

Abstract:Pre-trained Language Models (PLMs) exhibit good accuracy and generalization ability across various tasks using self-supervision, but their large size results in high inference latency. Early Exit (EE) strategies handle the issue by allowing the samples to exit from classifiers attached to the intermediary layers, but they do not generalize well, as exit classifiers can be sensitive to domain changes. To address this, we propose Unsupervised Domain Adaptation in EE framework (DADEE) that employs multi-level adaptation using knowledge distillation. DADEE utilizes GAN-based adversarial adaptation at each layer to achieve domain-invariant representations, reducing the domain gap between the source and target domain across all layers. The attached exits not only speed up inference but also enhance domain adaptation by reducing catastrophic forgetting and mode collapse, making it more suitable for real-world scenarios. Experiments on tasks such as sentiment analysis, entailment classification, and natural language inference demonstrate that DADEE consistently outperforms not only early exit methods but also various domain adaptation methods under domain shift scenarios. The anonymized source code is available at https://github.com/Div290/DAdEE.

* To appear in EMNLP (findings) 2024

View paper on

Share this with someone who'll enjoy it:

Title:DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Paper and Code