Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Apr 20, 2023

Yang Liu, Shen Yan, Yuge Zhang, Kan Ren, Quanlu Zhang, Zebin Ren, Deng Cai, Mi Zhang

Figure 1 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Figure 2 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Figure 3 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Figure 4 for AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Share this with someone who'll enjoy it:

Abstract:Vision Transformers have shown great performance in single tasks such as classification and segmentation. However, real-world problems are not isolated, which calls for vision transformers that can perform multiple tasks concurrently. Existing multi-task vision transformers are handcrafted and heavily rely on human expertise. In this work, we propose a novel one-shot neural architecture search framework, dubbed AutoTaskFormer (Automated Multi-Task Vision TransFormer), to automate this process. AutoTaskFormer not only identifies the weights to share across multiple tasks automatically, but also provides thousands of well-trained vision transformers with a wide range of parameters (e.g., number of heads and network depth) for deployment under various resource constraints. Experiments on both small-scale (2-task Cityscapes and 3-task NYUv2) and large-scale (16-task Taskonomy) datasets show that AutoTaskFormer outperforms state-of-the-art handcrafted vision transformers in multi-task learning. The entire code and models will be open-sourced.

* 15 pages

View paper on

Share this with someone who'll enjoy it:

Title:AutoTaskFormer: Searching Vision Transformers for Multi-task Learning

Paper and Code