Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Linnan Jiang

PQD: Post-training Quantization for Efficient Diffusion Models

Dec 30, 2024

Jiaojiao Ye, Zhen Wang, Linnan Jiang

Figure 1 for PQD: Post-training Quantization for Efficient Diffusion Models

Figure 2 for PQD: Post-training Quantization for Efficient Diffusion Models

Figure 3 for PQD: Post-training Quantization for Efficient Diffusion Models

Figure 4 for PQD: Post-training Quantization for Efficient Diffusion Models

Abstract:Diffusionmodels(DMs)havedemonstratedremarkableachievements in synthesizing images of high fidelity and diversity. However, the extensive computational requirements and slow generative speed of diffusion models have limited their widespread adoption. In this paper, we propose a novel post-training quantization for diffusion models (PQD), which is a time-aware optimization framework for diffusion models based on post-training quantization. The proposed framework optimizes the inference process by selecting representative samples and conducting time-aware calibration. Experimental results show that our proposed method is able to directly quantize full-precision diffusion models into 8-bit or 4-bit models while maintaining comparable performance in a training-free manner, achieving a few FID change on ImageNet for unconditional image generation. Our approach demonstrates compatibility and can also be applied to 512x512 text-guided image generation for the first time.

* 7 pages, 3 figures, uses wacv.sty

Via

Access Paper or Ask Questions

Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Oct 24, 2024

Luping Wang, Sheng Chen, Linnan Jiang, Shu Pan, Runze Cai, Sen Yang, Fei Yang

Figure 1 for Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Figure 2 for Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Figure 3 for Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Figure 4 for Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

Abstract:The large models, as predicted by scaling raw forecasts, have made groundbreaking progress in many fields, particularly in natural language generation tasks, where they have approached or even surpassed human levels. However, the unprecedented scale of their parameters brings significant computational and storage costs. These large models require substantial computational resources and GPU memory to operate. When adapting large models to specific downstream tasks, their massive parameter scale poses a significant challenge in fine-tuning on hardware platforms with limited computational power and GPU memory. To address this issue, Parameter-Efficient Fine-Tuning (PEFT) offers a practical solution by efficiently adjusting the parameters of large pre-trained models to suit various downstream tasks. Specifically, PEFT adjusts the parameters of pre-trained large models to adapt to specific tasks or domains, minimizing the introduction of additional parameters and the computational resources required. This review mainly introduces the preliminary knowledge of PEFT, the core ideas and principles of various PEFT algorithms, the applications of PEFT, and potential future research directions. By reading this review, we believe that interested parties can quickly grasp the PEFT methodology, thereby accelerating its development and innovation.

Via

Access Paper or Ask Questions