Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Farn Wang

Automated Vulnerability Detection Using Deep Learning Technique

Oct 29, 2024

Guan-Yan Yang, Yi-Heng Ko, Farn Wang, Kuo-Hui Yeh, Haw-Shiang Chang, Hsueh-Yi Chen

Abstract:Our work explores the utilization of deep learning, specifically leveraging the CodeBERT model, to enhance code security testing for Python applications by detecting SQL injection vulnerabilities. Unlike traditional security testing methods that may be slow and error-prone, our approach transforms source code into vector representations and trains a Long Short-Term Memory (LSTM) model to identify vulnerable patterns. When compared with existing static application security testing (SAST) tools, our model displays superior performance, achieving higher precision, recall, and F1-score. The study demonstrates that deep learning techniques, particularly with CodeBERT's advanced contextual understanding, can significantly improve vulnerability detection, presenting a scalable methodology applicable to various programming languages and vulnerability types.

* 4 pages, 1 figures; Presented at The 30st International Conference on Computational & Experimental Engineering and Sciences (ICCES2024)

Via

Access Paper or Ask Questions

Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

Aug 23, 2016

Jun-Wei Lin, Farn Wang

Figure 1 for Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

Figure 2 for Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

Figure 3 for Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

Figure 4 for Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

Abstract:To automatically test web applications, crawling-based techniques are usually adopted to mine the behavior models, explore the state spaces or detect the violated invariants of the applications. However, in existing crawlers, rules for identifying the topics of input text fields, such as login ids, passwords, emails, dates and phone numbers, have to be manually configured. Moreover, the rules for one application are very often not suitable for another. In addition, when several rules conflict and match an input text field to more than one topics, it can be difficult to determine which rule suggests a better match. This paper presents a natural-language approach to automatically identify the topics of encountered input fields during crawling by semantically comparing their similarities with the input fields in labeled corpus. In our evaluation with 100 real-world forms, the proposed approach demonstrated comparable performance to the rule-based one. Our experiments also show that the accuracy of the rule-based approach can be improved by up to 19% when integrated with our approach.

* 11 pages

Via

Access Paper or Ask Questions

PAC Learning-Based Verification and Model Synthesis

Nov 03, 2015

Yu-Fang Chen, Chiao Hsieh, Ondřej Lengál, Tsung-Ju Lii, Ming-Hsien Tsai, Bow-Yaw Wang, Farn Wang

Figure 1 for PAC Learning-Based Verification and Model Synthesis

Figure 2 for PAC Learning-Based Verification and Model Synthesis

Figure 3 for PAC Learning-Based Verification and Model Synthesis

Figure 4 for PAC Learning-Based Verification and Model Synthesis

Abstract:We introduce a novel technique for verification and model synthesis of sequential programs. Our technique is based on learning a regular model of the set of feasible paths in a program, and testing whether this model contains an incorrect behavior. Exact learning algorithms require checking equivalence between the model and the program, which is a difficult problem, in general undecidable. Our learning procedure is therefore based on the framework of probably approximately correct (PAC) learning, which uses sampling instead and provides correctness guarantees expressed using the terms error probability and confidence. Besides the verification result, our procedure also outputs the model with the said correctness guarantees. Obtained preliminary experiments show encouraging results, in some cases even outperforming mature software verifiers.

* 11 pages

Via

Access Paper or Ask Questions