Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lichen Jia

FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Aug 26, 2022

Lichen Jia, Bowen Tang, Chenggang Wu, Zhe Wang, Zihan Jiang, Yuanming Lai, Yan Kang, Ning Liu, Jingfeng Zhang

Figure 1 for FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Figure 2 for FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Figure 3 for FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Figure 4 for FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Abstract:The binary code similarity detection (BCSD) method measures the similarity of two binary executable codes. Recently, the learning-based BCSD methods have achieved great success, outperforming traditional BCSD in detection accuracy and efficiency. However, the existing studies are rather sparse on the adversarial vulnerability of the learning-based BCSD methods, which cause hazards in security-related applications. To evaluate the adversarial robustness, this paper designs an efficient and black-box adversarial code generation algorithm, namely, FuncFooler. FuncFooler constrains the adversarial codes 1) to keep unchanged the program's control flow graph (CFG), and 2) to preserve the same semantic meaning. Specifically, FuncFooler consecutively 1) determines vulnerable candidates in the malicious code, 2) chooses and inserts the adversarial instructions from the benign code, and 3) corrects the semantic side effect of the adversarial code to meet the constraints. Empirically, our FuncFooler can successfully attack the three learning-based BCSD models, including SAFE, Asm2Vec, and jTrans, which calls into question whether the learning-based BCSD is desirable.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions