Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:GIFT: Generative Interpretable Fine-Tuning Transformers

Dec 01, 2023

Chinmay Savadikar, Xi Song, Tianfu Wu

Figure 1 for GIFT: Generative Interpretable Fine-Tuning Transformers

Figure 2 for GIFT: Generative Interpretable Fine-Tuning Transformers

Figure 3 for GIFT: Generative Interpretable Fine-Tuning Transformers

Figure 4 for GIFT: Generative Interpretable Fine-Tuning Transformers

Share this with someone who'll enjoy it:

Abstract:We present GIFT (Generative Interpretable Fine-tuning Transformers) for fine-tuning pretrained (often large) Transformer models at downstream tasks in a parameter-efficient way with built-in interpretability. Our GIFT is a deep parameter-residual learning method, which addresses two problems in fine-tuning a pretrained Transformer model: Where to apply the parameter-efficient fine-tuning (PEFT) to be extremely lightweight yet sufficiently expressive, and How to learn the PEFT to better exploit the knowledge of the pretrained model in a direct way? For the former, we select the final projection (linear) layer in the multi-head self-attention of a Transformer model, and verify its effectiveness. For the latter, in contrast to the prior art that directly introduce new model parameters (often in low-rank approximation form) to be learned in fine-tuning with downstream data, we propose a method for learning to generate the fine-tuning parameters. Our GIFT is a hyper-Transformer which take as input the pretrained parameters of the projection layer to generate its fine-tuning parameters using a proposed Parameter-to-Cluster Attention (PaCa). The PaCa results in a simple clustering-based forward explainer that plays the role of semantic segmentation in testing. In experiments, our proposed GIFT is tested on the VTAB benchmark and the fine-grained visual classification (FGVC) benchmark. It obtains significantly better performance than the prior art. Our code is available at https://github.com/savadikarc/gift

* 18 pages, 12 figures

View paper on

Share this with someone who'll enjoy it:

Title:GIFT: Generative Interpretable Fine-Tuning Transformers

Paper and Code