Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CLIP-KD: An Empirical Study of Distilling CLIP Models

Jul 24, 2023

Chuanguang Yang, Zhulin An, Libo Huang, Junyu Bi, Xinqiang Yu, Han Yang, Yongjun Xu

Figure 1 for CLIP-KD: An Empirical Study of Distilling CLIP Models

Figure 2 for CLIP-KD: An Empirical Study of Distilling CLIP Models

Figure 3 for CLIP-KD: An Empirical Study of Distilling CLIP Models

Figure 4 for CLIP-KD: An Empirical Study of Distilling CLIP Models

Share this with someone who'll enjoy it:

Abstract:CLIP has become a promising language-supervised visual pre-training framework and achieves excellent performance over a wide range of tasks. This paper aims to distill small CLIP models supervised by a large teacher CLIP model. We propose several distillation strategies, including relation, feature, gradient and contrastive paradigm, to examine the impact on CLIP distillation. We show that the simplest feature mimicry with MSE loss performs best. Moreover, interactive contrastive learning and relation-based distillation are also critical in performance improvement. We apply the unified method to distill several student networks trained on 15 million (image, text) pairs. Distillation improves the student CLIP models consistently over zero-shot ImageNet classification and cross-modal retrieval benchmarks. We hope our empirical study will become an important baseline for future CLIP distillation research. The code is available at \url{https://github.com/winycg/CLIP-KD}.

View paper on

Share this with someone who'll enjoy it:

Title:CLIP-KD: An Empirical Study of Distilling CLIP Models

Paper and Code