Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Jun 26, 2023

Kai Han, Yunhe Wang, Jianyuan Guo, Enhua Wu

Figure 1 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Figure 2 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Figure 3 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Figure 4 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Share this with someone who'll enjoy it:

Abstract:The large-scale visual pretraining has significantly improve the performance of large vision models. However, we observe the \emph{low FLOPs pitfall} that the existing low-FLOPs models cannot benefit from large-scale pretraining. In this paper, we propose a general design principle of adding more parameters while maintaining low FLOPs for large-scale visual pretraining, named as ParameterNet. Dynamic convolutions are used for instance to equip the networks with more parameters and only slightly increase the FLOPs. The proposed ParameterNet scheme enables low-FLOPs networks to benefit from large-scale visual pretraining. Experiments on the large-scale ImageNet-22K have shown the superiority of our ParameterNet scheme. For example, ParameterNet-600M can achieve higher accuracy than the widely-used Swin Transformer (81.6\% \emph{vs.} 80.9\%) and has much lower FLOPs (0.6G \emph{vs.} 4.5G). The code will be released as soon (MindSpore: https://gitee.com/mindspore/models, PyTorch: https://github.com/huawei-noah/Efficient-AI-Backbones).

View paper on

Share this with someone who'll enjoy it:

Title:ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Paper and Code