Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Jun 08, 2024

Zanlin Ni, Yulin Wang, Renping Zhou, Jiayi Guo, Jinyi Hu, Zhiyuan Liu, Shiji Song, Yuan Yao, Gao Huang

Figure 1 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Figure 2 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Figure 3 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Figure 4 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Share this with someone who'll enjoy it:

Abstract:The field of image synthesis is currently flourishing due to the advancements in diffusion models. While diffusion models have been successful, their computational intensity has prompted the pursuit of more efficient alternatives. As a representative work, non-autoregressive Transformers (NATs) have been recognized for their rapid generation. However, a major drawback of these models is their inferior performance compared to diffusion models. In this paper, we aim to re-evaluate the full potential of NATs by revisiting the design of their training and inference strategies. Specifically, we identify the complexities in properly configuring these strategies and indicate the possible sub-optimality in existing heuristic-driven designs. Recognizing this, we propose to go beyond existing methods by directly solving the optimal strategies in an automatic framework. The resulting method, named AutoNAT, advances the performance boundaries of NATs notably, and is able to perform comparably with the latest diffusion models at a significantly reduced inference cost. The effectiveness of AutoNAT is validated on four benchmark datasets, i.e., ImageNet-256 & 512, MS-COCO, and CC3M. Our code is available at https://github.com/LeapLabTHU/ImprovedNAT.

* Accepted by CVPR2024

View paper on

Share this with someone who'll enjoy it:

Title:Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Paper and Code