Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Apr 08, 2022

Yoonseok Yang, Kyu Seok Kim, Minsam Kim, Juneyoung Park

Figure 1 for GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Figure 2 for GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Figure 3 for GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Figure 4 for GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Share this with someone who'll enjoy it:

Abstract:Content-based collaborative filtering (CCF) provides personalized item recommendations based on both users' interaction history and items' content information. Recently, pre-trained language models (PLM) have been used to extract high-quality item encodings for CCF. However, it is resource-intensive to finetune PLM in an end-to-end (E2E) manner in CCF due to its multi-modal nature: optimization involves redundant content encoding for interactions from users. For this, we propose GRAM (GRadient Accumulation for Multi-modality): (1) Single-step GRAM which aggregates gradients for each item while maintaining theoretical equivalence with E2E, and (2) Multi-step GRAM which further accumulates gradients across multiple training steps, with less than 40\% GPU memory footprint of E2E. We empirically confirm that GRAM achieves a remarkable boost in training efficiency based on five datasets from two task domains of Knowledge Tracing and News Recommendation, where single-step and multi-step GRAM achieve 4x and 45x training speedup on average, respectively.

* NAACL 2022 Main Conference

View paper on

Share this with someone who'll enjoy it:

Title:GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering

Paper and Code