Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding Long Programming Languages with Structure-Aware Sparse Attention

May 27, 2022

Tingting Liu, Chengyu Wang, Cen Chen, Ming Gao, Aoying Zhou

Figure 1 for Understanding Long Programming Languages with Structure-Aware Sparse Attention

Figure 2 for Understanding Long Programming Languages with Structure-Aware Sparse Attention

Figure 3 for Understanding Long Programming Languages with Structure-Aware Sparse Attention

Figure 4 for Understanding Long Programming Languages with Structure-Aware Sparse Attention

Share this with someone who'll enjoy it:

Abstract:Programming-based Pre-trained Language Models (PPLMs) such as CodeBERT have achieved great success in many downstream code-related tasks. Since the memory and computational complexity of self-attention in the Transformer grow quadratically with the sequence length, PPLMs typically limit the code length to 512. However, codes in real-world applications are generally long, such as code searches, which cannot be processed efficiently by existing PPLMs. To solve this problem, in this paper, we present SASA, a Structure-Aware Sparse Attention mechanism, which reduces the complexity and improves performance for long code understanding tasks. The key components in SASA are top-$k$ sparse attention and Abstract Syntax Tree (AST)-based structure-aware attention. With top-$k$ sparse attention, the most crucial attention relation can be obtained with a lower computational cost. As the code structure represents the logic of the code statements, which is a complement to the code sequence characteristics, we further introduce AST structures into attention. Extensive experiments on CodeXGLUE tasks show that SASA achieves better performance than the competing baselines.

* sigir 2022 accepted, code will be available at https://github.com/alibaba/EasyNLP

View paper on

Share this with someone who'll enjoy it:

Title:Understanding Long Programming Languages with Structure-Aware Sparse Attention

Paper and Code