Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Kienitz

ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

May 06, 2023

Marco Casadio, Luca Arnaboldi, Matthew L. Daggitt, Omri Isac, Tanvi Dinkar, Daniel Kienitz, Verena Rieser, Ekaterina Komendantskaya

Figure 1 for ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Figure 2 for ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Figure 3 for ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Figure 4 for ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Abstract:Verification of machine learning models used in Natural Language Processing (NLP) is known to be a hard problem. In particular, many known neural network verification methods that work for computer vision and other numeric datasets do not work for NLP. Here, we study technical reasons that underlie this problem. Based on this analysis, we propose practical methods and heuristics for preparing NLP datasets and models in a way that renders them amenable to known verification methods based on abstract interpretation. We implement these methods as a Python library called ANTONIO that links to the neural network verifiers ERAN and Marabou. We perform evaluation of the tool using an NLP dataset R-U-A-Robot suggested as a benchmark for verifying legally critical NLP applications. We hope that, thanks to its general applicability, this work will open novel possibilities for including NLP verification problems into neural network verification competitions, and will popularise NLP problems within this community.

Via

Access Paper or Ask Questions

Why Robust Natural Language Understanding is a Challenge

Jun 21, 2022

Marco Casadio, Ekaterina Komendantskaya, Verena Rieser, Matthew L. Daggitt, Daniel Kienitz, Luca Arnaboldi, Wen Kokke

Figure 1 for Why Robust Natural Language Understanding is a Challenge

Figure 2 for Why Robust Natural Language Understanding is a Challenge

Figure 3 for Why Robust Natural Language Understanding is a Challenge

Abstract:With the proliferation of Deep Machine Learning into real-life applications, a particular property of this technology has been brought to attention: Neural Networks notoriously present low robustness and can be highly sensitive to small input perturbations. Recently, many methods for verifying networks' general properties of robustness have been proposed, but they are mostly applied in Computer Vision. In this paper we propose a Verification method for Natural Language Understanding classification based on larger regions of interest, and we discuss the challenges of such task. We observe that, although the data is almost linearly separable, the verifier does not output positive results and we explain the problems and implications.

Via

Access Paper or Ask Questions

Property-driven Training: All You Ever Wanted to Know About

Apr 03, 2021

Marco Casadio, Matthew Daggitt, Ekaterina Komendantskaya, Wen Kokke, Daniel Kienitz, Rob Stewart

Figure 1 for Property-driven Training: All You Ever Wanted to Know About

Figure 2 for Property-driven Training: All You Ever Wanted to Know About

Figure 3 for Property-driven Training: All You Ever Wanted to Know About

Figure 4 for Property-driven Training: All You Ever Wanted to Know About

Abstract:Neural networks are known for their ability to detect general patterns in noisy data. This makes them a popular tool for perception components in complex AI systems. Paradoxically, they are also known for being vulnerable to adversarial attacks. In response, various methods such as adversarial training, data-augmentation and Lipschitz robustness training have been proposed as means of improving their robustness. However, as this paper explores, these training methods each optimise for a different definition of robustness. We perform an in-depth comparison of these different definitions, including their relationship, assumptions, interpretability and verifiability after training. We also look at constraint-driven training, a general approach designed to encode arbitrary constraints, and show that not all of these definitions are directly encodable. Finally we perform experiments to compare the applicability and efficacy of the training methods at ensuring the network obeys these different definitions. These results highlight that even the encoding of such a simple piece of knowledge such as robustness in neural network training is fraught with difficult choices and pitfalls.

* 10 pages, under review

Via

Access Paper or Ask Questions

Neural Network Verification for the Masses (of AI graduates)

Jul 02, 2019

Ekaterina Komendantskaya, Rob Stewart, Kirsy Duncan, Daniel Kienitz, Pierre Le Hen, Pascal Bacchus

Figure 1 for Neural Network Verification for the Masses (of AI graduates)

Figure 2 for Neural Network Verification for the Masses (of AI graduates)

Figure 3 for Neural Network Verification for the Masses (of AI graduates)

Figure 4 for Neural Network Verification for the Masses (of AI graduates)

Abstract:Rapid development of AI applications has stimulated demand for, and has given rise to, the rapidly growing number and diversity of AI MSc degrees. AI and Robotics research communities, industries and students are becoming increasingly aware of the problems caused by unsafe or insecure AI applications. Among them, perhaps the most famous example is vulnerability of deep neural networks to ``adversarial attacks''. Owing to wide-spread use of neural networks in all areas of AI, this problem is seen as particularly acute and pervasive. Despite of the growing number of research papers about safety and security vulnerabilities of AI applications, there is a noticeable shortage of accessible tools, methods and teaching materials for incorporating verification into AI programs. LAIV -- the Lab for AI and Verification -- is a newly opened research lab at Heriot-Watt university that engages AI and Robotics MSc students in verification projects, as part of their MSc dissertation work. In this paper, we will report on successes and unexpected difficulties LAIV faces, many of which arise from limitations of existing programming languages used for verification. We will discuss future directions for incorporating verification into AI degrees.

Via

Access Paper or Ask Questions