Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Induced Model Matching: How Restricted Models Can Help Larger Ones

Feb 19, 2024

Usama Muneeb, Mesrob I. Ohannessian

Figure 1 for Induced Model Matching: How Restricted Models Can Help Larger Ones

Figure 2 for Induced Model Matching: How Restricted Models Can Help Larger Ones

Figure 3 for Induced Model Matching: How Restricted Models Can Help Larger Ones

Figure 4 for Induced Model Matching: How Restricted Models Can Help Larger Ones

Share this with someone who'll enjoy it:

Abstract:We consider scenarios where a very accurate predictive model using restricted features is available at the time of training of a larger, full-featured, model. This restricted model may be thought of as "side-information", derived either from an auxiliary exhaustive dataset or on the same dataset, by forcing the restriction. How can the restricted model be useful to the full model? We propose an approach for transferring the knowledge of the restricted model to the full model, by aligning the full model's context-restricted performance with that of the restricted model's. We call this methodology Induced Model Matching (IMM) and first illustrate its general applicability by using logistic regression as a toy example. We then explore IMM's use in language modeling, the application that initially inspired it, and where it offers an explicit foundation in contrast to the implicit use of restricted models in techniques such as noising. We demonstrate the methodology on both LSTM and transformer full models, using $N$-grams as restricted models. To further illustrate the potential of the principle whenever it is much cheaper to collect restricted rather than full information, we conclude with a simple RL example where POMDP policies can improve learned MDP policies via IMM.

View paper on

Share this with someone who'll enjoy it:

Title:Induced Model Matching: How Restricted Models Can Help Larger Ones

Paper and Code