Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhaorui Yang

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Feb 21, 2024

Zhaorui Yang, Qian Liu, Tianyu Pang, Han Wang, Haozhe Feng, Minfeng Zhu, Wei Chen

Figure 1 for Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Figure 2 for Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Figure 3 for Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Figure 4 for Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Abstract:The surge in Large Language Models (LLMs) has revolutionized natural language processing, but fine-tuning them for specific tasks often encounters challenges in balancing performance and preserving general instruction-following abilities. In this paper, we posit that the distribution gap between task datasets and the LLMs serves as the primary underlying cause. To address the problem, we introduce Self-Distillation Fine-Tuning (SDFT), a novel approach that bridges the distribution gap by guiding fine-tuning with a distilled dataset generated by the model itself to match its original distribution. Experimental results on the Llama-2-chat model across various benchmarks demonstrate that SDFT effectively mitigates catastrophic forgetting while achieving comparable or superior performance on downstream tasks compared to the vanilla fine-tuning. Moreover, SDFT demonstrates the potential to maintain the helpfulness and safety alignment of LLMs. Our code is available at \url{https://github.com/sail-sg/sdft}.

Via

Access Paper or Ask Questions

CoSDA: Continual Source-Free Domain Adaptation

Apr 13, 2023

Haozhe Feng, Zhaorui Yang, Hesun Chen, Tianyu Pang, Chao Du, Minfeng Zhu, Wei Chen, Shuicheng Yan

Figure 1 for CoSDA: Continual Source-Free Domain Adaptation

Figure 2 for CoSDA: Continual Source-Free Domain Adaptation

Figure 3 for CoSDA: Continual Source-Free Domain Adaptation

Figure 4 for CoSDA: Continual Source-Free Domain Adaptation

Abstract:Without access to the source data, source-free domain adaptation (SFDA) transfers knowledge from a source-domain trained model to target domains. Recently, SFDA has gained popularity due to the need to protect the data privacy of the source domain, but it suffers from catastrophic forgetting on the source domain due to the lack of data. To systematically investigate the mechanism of catastrophic forgetting, we first reimplement previous SFDA approaches within a unified framework and evaluate them on four benchmarks. We observe that there is a trade-off between adaptation gain and forgetting loss, which motivates us to design a consistency regularization to mitigate forgetting. In particular, we propose a continual source-free domain adaptation approach named CoSDA, which employs a dual-speed optimized teacher-student model pair and is equipped with consistency learning capability. Our experiments demonstrate that CoSDA outperforms state-of-the-art approaches in continuous adaptation. Notably, our CoSDA can also be integrated with other SFDA methods to alleviate forgetting.

* 15 pages, 6 figures

Via

Access Paper or Ask Questions