Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Weigang Feng

Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Nov 28, 2024

Yijia Zhang, Zhihong Gou, Shijie Cao, Weigang Feng, Sicheng Zhang, Guohao Dai, Ningyi Xu

Figure 1 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Figure 2 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Figure 3 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Figure 4 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Abstract:Deep Neural Networks (DNNs) have revolutionized various fields, but their deployment on GPUs often leads to significant energy consumption. Unlike existing methods for reducing GPU energy consumption, which are either hardware-inflexible or limited by workload constraints, this paper addresses the problem at the GPU kernel level. We propose a novel search-based compilation method to generate energy-efficient GPU kernels by incorporating energy efficiency into the search process. To accelerate the energy evaluation process, we develop an accurate energy cost model based on high-level kernel features. Furthermore, we introduce a dynamic updating strategy for the energy cost model, reducing the need for on-device energy measurements and accelerating the search process. Our evaluation demonstrates that the proposed approach can generate GPU kernels with up to 21.69% reduced energy consumption while maintaining low latency.

Via

Access Paper or Ask Questions