Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Feb 28, 2024

Guangji Bai, Yijiang Li, Chen Ling, Kibaek Kim, Liang Zhao

Figure 1 for Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Figure 2 for Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Figure 3 for Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Figure 4 for Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Share this with someone who'll enjoy it:

Abstract:The transformative impact of large language models (LLMs) like LLaMA and GPT on natural language processing is countered by their prohibitive computational demands. Pruning has emerged as a pivotal compression strategy, introducing sparsity to enhance both memory and computational efficiency. Yet, traditional global pruning is impractical for LLMs due to scalability issues, while local pruning, despite its efficiency, leads to suboptimal solutions. Addressing these challenges, we propose Adaptive Global Pruning (AdaGP), a novel framework that redefines the global pruning process into manageable, coordinated subproblems, allowing for resource-efficient optimization with global optimality. AdaGP's approach, which conceptualizes LLMs as a chain of modular functions and leverages auxiliary variables for problem decomposition, not only facilitates a pragmatic application on LLMs but also demonstrates significant performance improvements, particularly in high-sparsity regimes where it surpasses current state-of-the-art methods.

* Preprint

View paper on

Share this with someone who'll enjoy it:

Title:Gradient-Free Adaptive Global Pruning for Pre-trained Language Models

Paper and Code