Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Feb 09, 2023

Qiang Wan, Zilong Huang, Jiachen Lu, Gang Yu, Li Zhang

Figure 1 for SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Figure 2 for SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Figure 3 for SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Figure 4 for SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Share this with someone who'll enjoy it:

Abstract:Since the introduction of Vision Transformers, the landscape of many computer vision tasks (e.g., semantic segmentation), which has been overwhelmingly dominated by CNNs, recently has significantly revolutionized. However, the computational cost and memory requirement render these methods unsuitable on the mobile device, especially for the high-resolution per-pixel semantic segmentation task. In this paper, we introduce a new method squeeze-enhanced Axial TransFormer (SeaFormer) for mobile semantic segmentation. Specifically, we design a generic attention block characterized by the formulation of squeeze Axial and detail enhancement. It can be further used to create a family of backbone architectures with superior cost-effectiveness. Coupled with a light segmentation head, we achieve the best trade-off between segmentation accuracy and latency on the ARM-based mobile devices on the ADE20K and Cityscapes datasets. Critically, we beat both the mobile-friendly rivals and Transformer-based counterparts with better performance and lower latency without bells and whistles. Beyond semantic segmentation, we further apply the proposed SeaFormer architecture to image classification problem, demonstrating the potentials of serving as a versatile mobile-friendly backbone.

* ICLR 2023

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Paper and Code