Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Oct 10, 2024

Yiyuan Zhang, Xiaohan Ding, Xiangyu Yue

Figure 1 for Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Figure 2 for Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Figure 3 for Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Figure 4 for Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Share this with someone who'll enjoy it:

Abstract:This paper proposes the paradigm of large convolutional kernels in designing modern Convolutional Neural Networks (ConvNets). We establish that employing a few large kernels, instead of stacking multiple smaller ones, can be a superior design strategy. Our work introduces a set of architecture design guidelines for large-kernel ConvNets that optimize their efficiency and performance. We propose the UniRepLKNet architecture, which offers systematical architecture design principles specifically crafted for large-kernel ConvNets, emphasizing their unique ability to capture extensive spatial information without deep layer stacking. This results in a model that not only surpasses its predecessors with an ImageNet accuracy of 88.0%, an ADE20K mIoU of 55.6%, and a COCO box AP of 56.4% but also demonstrates impressive scalability and performance on various modalities such as time-series forecasting, audio, point cloud, and video recognition. These results indicate the universal modeling abilities of large-kernel ConvNets with faster inference speed compared with vision transformers. Our findings reveal that large-kernel ConvNets possess larger effective receptive fields and a higher shape bias, moving away from the texture bias typical of smaller-kernel CNNs. All codes and models are publicly available at https://github.com/AILab-CVC/UniRepLKNet promoting further research and development in the community.

* This is the journal version of arXiv:2203.06717 and arXiv:2311.15599

View paper on

Share this with someone who'll enjoy it:

Title:Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Paper and Code