Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Carsten Sinz

Optimized Symbolic Interval Propagation for Neural Network Verification

Dec 15, 2022

Philipp Kern, Marko Kleine Büning, Carsten Sinz

Abstract:Neural networks are increasingly applied in safety critical domains, their verification thus is gaining importance. A large class of recent algorithms for proving input-output relations of feed-forward neural networks are based on linear relaxations and symbolic interval propagation. However, due to variable dependencies, the approximations deteriorate with increasing depth of the network. In this paper we present DPNeurifyFV, a novel branch-and-bound solver for ReLU networks with low dimensional input-space that is based on symbolic interval propagation with fresh variables and input-splitting. A new heuristic for choosing the fresh variables allows to ameliorate the dependency problem, while our novel splitting heuristic, in combination with several other improvements, speeds up the branch-and-bound procedure. We evaluate our approach on the airborne collision avoidance networks ACAS Xu and demonstrate runtime improvements compared to state-of-the-art tools.

* Published at the 1st Workshop on Formal Verification of Machine Learning (WFVML 2022) (https://www.ml-verification.com/)

Via

Access Paper or Ask Questions

Geometric Path Enumeration for Equivalence Verification of Neural Networks

Dec 13, 2021

Samuel Teuber, Marko Kleine Büning, Philipp Kern, Carsten Sinz

Figure 1 for Geometric Path Enumeration for Equivalence Verification of Neural Networks

Figure 2 for Geometric Path Enumeration for Equivalence Verification of Neural Networks

Figure 3 for Geometric Path Enumeration for Equivalence Verification of Neural Networks

Figure 4 for Geometric Path Enumeration for Equivalence Verification of Neural Networks

Abstract:As neural networks (NNs) are increasingly introduced into safety-critical domains, there is a growing need to formally verify NNs before deployment. In this work we focus on the formal verification problem of NN equivalence which aims to prove that two NNs (e.g. an original and a compressed version) show equivalent behavior. Two approaches have been proposed for this problem: Mixed integer linear programming and interval propagation. While the first approach lacks scalability, the latter is only suitable for structurally similar NNs with small weight changes. The contribution of our paper has four parts. First, we show a theoretical result by proving that the epsilon-equivalence problem is coNP-complete. Secondly, we extend Tran et al.'s single NN geometric path enumeration algorithm to a setting with multiple NNs. In a third step, we implement the extended algorithm for equivalence verification and evaluate optimizations necessary for its practical use. Finally, we perform a comparative evaluation showing use-cases where our approach outperforms the previous state of the art, both, for equivalence verification as well as for counter-example finding.

* 2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)
* Paper presented at The 33rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI)

Via

Access Paper or Ask Questions

Machine Learning Techniques for Software Quality Assurance: A Survey

Apr 29, 2021

Safa Omri, Carsten Sinz

Figure 1 for Machine Learning Techniques for Software Quality Assurance: A Survey

Figure 2 for Machine Learning Techniques for Software Quality Assurance: A Survey

Figure 3 for Machine Learning Techniques for Software Quality Assurance: A Survey

Figure 4 for Machine Learning Techniques for Software Quality Assurance: A Survey

Abstract:Over the last years, machine learning techniques have been applied to more and more application domains, including software engineering and, especially, software quality assurance. Important application domains have been, e.g., software defect prediction or test case selection and prioritization. The ability to predict which components in a large software system are most likely to contain the largest numbers of faults in the next release helps to better manage projects, including early estimation of possible release delays, and affordably guide corrective actions to improve the quality of the software. However, developing robust fault prediction models is a challenging task and many techniques have been proposed in the literature. Closely related to estimating defect-prone parts of a software system is the question of how to select and prioritize test cases, and indeed test case prioritization has been extensively researched as a means for reducing the time taken to discover regressions in software. In this survey, we discuss various approaches in both fault prediction and test case prioritization, also explaining how in recent studies deep learning algorithms for fault prediction help to bridge the gap between programs' semantics and fault prediction features. We also review recently proposed machine learning methods for test case prioritization (TCP), and their ability to reduce the cost of regression testing without negatively affecting fault detection capabilities.

Via

Access Paper or Ask Questions

Collaborative Management of Benchmark Instances and their Attributes

Sep 07, 2020

Markus Iser, Luca Springer, Carsten Sinz

Figure 1 for Collaborative Management of Benchmark Instances and their Attributes

Figure 2 for Collaborative Management of Benchmark Instances and their Attributes

Figure 3 for Collaborative Management of Benchmark Instances and their Attributes

Figure 4 for Collaborative Management of Benchmark Instances and their Attributes

Abstract:Experimental evaluation is an integral part in the design process of algorithms. Publicly available benchmark instances are widely used to evaluate methods in SAT solving. For the interpretation of results and the design of algorithm portfolios their attributes are crucial. Capturing the interrelation of benchmark instances and their attributes is considerably simplified through our specification of a benchmark instance identifier. Thus, our tool increases the availability of both by providing means to manage and retrieve benchmark instances by their attributes and vice versa. Like this, it facilitates the design and analysis of SAT experiments and the exchange of results.

Via

Access Paper or Ask Questions