Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Mar 16, 2025

Xin Wang, Samiul Alam, Zhongwei Wan, Hui Shen, Mi Zhang

Figure 1 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Figure 2 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Figure 3 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Figure 4 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Share this with someone who'll enjoy it:

Abstract:Despite significant advancements, the practical deployment of Large Language Models (LLMs) is often hampered by their immense sizes, highlighting the need for effective compression techniques. Singular Value Decomposition (SVD) is a promising LLM compression technique. However, existing SVD-based compression methods fall short in reducing truncation losses, leading to less competitive performance in compressed models. In this work, we introduce SVD-LLM V2, a SVD-based LLM compression method that optimizes singular value truncation in SVD compression with two techniques. First, SVD-LLM V2 proposes to use theoretical truncation loss of weight matrices to assign a unique compression ratio to each weight matrix at different layers to accommodate weight redundancy heterogeneity. Second, SVD-LLM V2 proposes loss-optimized weight truncation to ensure that the truncated singular values result in a lower and more stable truncation loss in practice. We evaluate SVD-LLM V2 on ten datasets and five LLMs at various scales. Our results show SVD-LLM V2 outperforms state-of-the-art SVD-based LLM compression methods. Our code is available at https://github.com/AIoT-MLSys-Lab/SVD-LLM

* NAACL 2025; Code available at https://github.com/AIoT-MLSys-Lab/SVD-LLM

View paper on

Share this with someone who'll enjoy it:

Title:SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Paper and Code