Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Dec 12, 2023

Talfan Evans, Shreya Pathak, Hamza Merzic, Jonathan Schwarz, Ryutaro Tanno, Olivier J. Henaff

Figure 1 for Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Figure 2 for Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Figure 3 for Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Figure 4 for Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Share this with someone who'll enjoy it:

Abstract:We propose a method for accelerating large-scale pre-training with online data selection policies. For the first time, we demonstrate that model-based data selection can reduce the total computation needed to reach the performance of models trained with uniform sampling. The key insight which enables this "compute-positive" regime is that small models provide good proxies for the loss of much larger models, such that computation spent on scoring data can be drastically scaled down but still significantly accelerate training of the learner.. These data selection policies also strongly generalize across datasets and tasks, opening an avenue for further amortizing the overhead of data scoring by re-using off-the-shelf models and training sequences. Our methods, ClassAct and ActiveCLIP, require 46% and 51% fewer training updates and up to 25% less total computation when training visual classifiers on JFT and multimodal models on ALIGN, respectively. Finally, our paradigm seamlessly applies to the curation of large-scale image-text datasets, yielding a new state-of-the-art in several multimodal transfer tasks and pre-training regimes.

* Technical report

View paper on

Share this with someone who'll enjoy it:

Title:Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Paper and Code