Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edoardo Manino

Encrypted Neural Networks without Overflows

May 21, 2026

Philipp Kern, Lorenzo Rovida, Samuel Teuber, Edoardo Manino, Carsten Sinz, Alberto Leporati

Abstract:Fully homomorphic encryption (FHE) enables private inference by evaluating neural networks on encrypted data. In this way, we can delegate the computation to a third party server without ever revealing the user's data. Currently, the CKKS scheme is the backbone of most efficient FHE implementations, but it only supports addition, multiplication, and array rotation operations, thus requiring all activation functions of the neural network to be approximated by polynomials within a certain interval, imposing strict design tolerances. In this paper, we demonstrate for the first time that this scheme is vulnerable to overflow attacks, i.e., seemingly benign inputs that can exceed such tolerances of the FHE circuit, thereby causing corrupt and unusable outputs. To avoid them, we propose a formal verification technique that computes certified bounds on the ranges of all neurons in the network. By construction, our method eliminates overflows and, in our experiments, removed observed overflows on all benchmarks, reducing failure rates from up to 47% to 0%. Moreover, our overflow-free solution is compatible with most CKKS-based frameworks, as it allows to simply substitute standard polynomials by polynomials with rigorously designed ranges.

* Preprint

Via

Access Paper or Ask Questions

The 6th International Verification of Neural Networks Competition (VNN-COMP 2025): Summary and Results

Dec 22, 2025

Konstantin Kaulen, Tobias Ladner, Stanley Bak, Christopher Brix, Hai Duong, Thomas Flinkow, Taylor T. Johnson, Lukas Koller, Edoardo Manino, ThanhVu H Nguyen(+1 more)

Abstract:This report summarizes the 6th International Verification of Neural Networks Competition (VNN-COMP 2025), held as a part of the 8th International Symposium on AI Verification (SAIV), that was collocated with the 37th International Conference on Computer-Aided Verification (CAV). VNN-COMP is held annually to facilitate the fair and objective comparison of state-of-the-art neural network verification tools, encourage the standardization of tool interfaces, and bring together the neural network verification community. To this end, standardized formats for networks (ONNX) and specification (VNN-LIB) were defined, tools were evaluated on equal-cost hardware (using an automatic evaluation pipeline based on AWS instances), and tool parameters were chosen by the participants before the final test sets were made public. In the 2025 iteration, 8 teams participated on a diverse set of 16 regular and 9 extended benchmarks. This report summarizes the rules, benchmarks, participating tools, results, and lessons learned from this iteration of this competition.

* Report on the results of VNN-COMP 2025. arXiv admin note: substantial text overlap with arXiv:2412.19985, arXiv:2312.16760, arXiv:2212.10376

Via

Access Paper or Ask Questions

GPT, But Backwards: Exactly Inverting Language Model Outputs

Jul 02, 2025

Adrians Skapars, Edoardo Manino, Youcheng Sun, Lucas C. Cordeiro

Abstract:While existing auditing techniques attempt to identify potential unwanted behaviours in large language models (LLMs), we address the complementary forensic problem of reconstructing the exact input that led to an existing LLM output - enabling post-incident analysis and potentially the detection of fake output reports. We formalize exact input reconstruction as a discrete optimisation problem with a unique global minimum and introduce SODA, an efficient gradient-based algorithm that operates on a continuous relaxation of the input search space with periodic restarts and parameter decay. Through comprehensive experiments on LLMs ranging in size from 33M to 3B parameters, we demonstrate that SODA significantly outperforms existing approaches. We succeed in fully recovering 79.5% of shorter out-of-distribution inputs from next-token logits, without a single false positive, but struggle to extract private information from the outputs of longer (15+ token) input sequences. This suggests that standard deployment practices may currently provide adequate protection against malicious use of our method. Our code is available at https://doi.org/10.5281/zenodo.15539879.

* 9 pages, ICML 2025 Workshop on Reliable and Responsible Foundation Models

Via

Access Paper or Ask Questions

Neural Network Verification is a Programming Language Challenge

Jan 10, 2025

Lucas C. Cordeiro, Matthew L. Daggitt, Julien Girard-Satabin, Omri Isac, Taylor T. Johnson, Guy Katz, Ekaterina Komendantskaya, Augustin Lemesle, Edoardo Manino, Artjoms Šinkarovs(+1 more)

Figure 1 for Neural Network Verification is a Programming Language Challenge

Figure 2 for Neural Network Verification is a Programming Language Challenge

Figure 3 for Neural Network Verification is a Programming Language Challenge

Figure 4 for Neural Network Verification is a Programming Language Challenge

Abstract:Neural network verification is a new and rapidly developing field of research. So far, the main priority has been establishing efficient verification algorithms and tools, while proper support from the programming language perspective has been considered secondary or unimportant. Yet, there is mounting evidence that insights from the programming language community may make a difference in the future development of this domain. In this paper, we formulate neural network verification challenges as programming language challenges and suggest possible future solutions.

* ESOP 2025
* Accepted at ESOP 2025, European Symposium on Programming Languages

Via

Access Paper or Ask Questions

Automated Repair of AI Code with Large Language Models and Formal Verification

May 14, 2024

Yiannis Charalambous, Edoardo Manino, Lucas C. Cordeiro

Figure 1 for Automated Repair of AI Code with Large Language Models and Formal Verification

Figure 2 for Automated Repair of AI Code with Large Language Models and Formal Verification

Figure 3 for Automated Repair of AI Code with Large Language Models and Formal Verification

Figure 4 for Automated Repair of AI Code with Large Language Models and Formal Verification

Abstract:The next generation of AI systems requires strong safety guarantees. This report looks at the software implementation of neural networks and related memory safety properties, including NULL pointer deference, out-of-bound access, double-free, and memory leaks. Our goal is to detect these vulnerabilities, and automatically repair them with the help of large language models. To this end, we first expand the size of NeuroCodeBench, an existing dataset of neural network code, to about 81k programs via an automated process of program mutation. Then, we verify the memory safety of the mutated neural network implementations with ESBMC, a state-of-the-art software verifier. Whenever ESBMC spots a vulnerability, we invoke a large language model to repair the source code. For the latest task, we compare the performance of various state-of-the-art prompt engineering techniques, and an iterative approach that repeatedly calls the large language model.

Via

Access Paper or Ask Questions

NeuroCodeBench: a plain C neural network benchmark for software verification

Sep 07, 2023

Edoardo Manino, Rafael Sá Menezes, Fedor Shmarov, Lucas C. Cordeiro

Figure 1 for NeuroCodeBench: a plain C neural network benchmark for software verification

Figure 2 for NeuroCodeBench: a plain C neural network benchmark for software verification

Figure 3 for NeuroCodeBench: a plain C neural network benchmark for software verification

Abstract:Safety-critical systems with neural network components require strong guarantees. While existing neural network verification techniques have shown great progress towards this goal, they cannot prove the absence of software faults in the network implementation. This paper presents NeuroCodeBench - a verification benchmark for neural network code written in plain C. It contains 32 neural networks with 607 safety properties divided into 6 categories: maths library, activation functions, error-correcting networks, transfer function approximation, probability density estimation and reinforcement learning. Our preliminary evaluation shows that state-of-the-art software verifiers struggle to provide correct verdicts, due to their incomplete support of the standard C mathematical library and the complexity of larger neural networks.

* Submitted to the 2023 AFRiTS workshop

Via

Access Paper or Ask Questions

LF-checker: Machine Learning Acceleration of Bounded Model Checking for Concurrency Verification (Competition Contribution)

Jan 22, 2023

Tong Wu, Edoardo Manino, Fatimah Aljaafari, Pavlos Petoumenos, Lucas C. Cordeiro

Figure 1 for LF-checker: Machine Learning Acceleration of Bounded Model Checking for Concurrency Verification (Competition Contribution)

Figure 2 for LF-checker: Machine Learning Acceleration of Bounded Model Checking for Concurrency Verification (Competition Contribution)

Figure 3 for LF-checker: Machine Learning Acceleration of Bounded Model Checking for Concurrency Verification (Competition Contribution)

Abstract:We describe and evaluate LF-checker, a metaverifier tool based on machine learning. It extracts multiple features of the program under test and predicts the optimal configuration (flags) of a bounded model checker with a decision tree. Our current work is specialised in concurrency verification and employs ESBMC as a back-end verification engine. In the paper, we demonstrate that LF-checker achieves better results than the default configuration of the underlying verification engine.

Via

Access Paper or Ask Questions

CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Jul 09, 2022

João Batista P. Matos Jr., Iury Bessa, Edoardo Manino, Xidan Song, Lucas C. Cordeiro

Figure 1 for CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Figure 2 for CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Figure 3 for CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Figure 4 for CEG4N: Counter-Example Guided Neural Network Quantization Refinement

Abstract:Neural networks are essential components of learning-based software systems. However, their high compute, memory, and power requirements make using them in low resources domains challenging. For this reason, neural networks are often quantized before deployment. Existing quantization techniques tend to degrade the network accuracy. We propose Counter-Example Guided Neural Network Quantization Refinement (CEG4N). This technique combines search-based quantization and equivalence verification: the former minimizes the computational requirements, while the latter guarantees that the network's output does not change after quantization. We evaluate CEG4N~on a diverse set of benchmarks, including large and small networks. Our technique successfully quantizes the networks in our evaluation while producing models with up to 72% better accuracy than state-of-the-art techniques.

Via

Access Paper or Ask Questions

Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

Apr 26, 2022

Edoardo Manino, Julia Rozanova, Danilo Carvalho, Andre Freitas, Lucas Cordeiro

Figure 1 for Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

Figure 2 for Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

Figure 3 for Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

Figure 4 for Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

Abstract:Metamorphic testing has recently been used to check the safety of neural NLP models. Its main advantage is that it does not rely on a ground truth to generate test cases. However, existing studies are mostly concerned with robustness-like metamorphic relations, limiting the scope of linguistic properties they can test. We propose three new classes of metamorphic relations, which address the properties of systematicity, compositionality and transitivity. Unlike robustness, our relations are defined over multiple source inputs, thus increasing the number of test cases that we can produce by a polynomial factor. With them, we test the internal consistency of state-of-the-art NLP models, and show that they do not always behave according to their expected linguistic properties. Lastly, we introduce a novel graphical notation that efficiently summarises the inner structure of metamorphic relations.

* Findings of the Association for Computational Linguistics 2022

Via

Access Paper or Ask Questions

QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking

Nov 25, 2021

Xidan Song, Edoardo Manino, Luiz Sena, Erickson Alves, Eddie de Lima Filho, Iury Bessa, Mikel Lujan, Lucas Cordeiro

Figure 1 for QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking

Figure 2 for QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking

Figure 3 for QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking

Figure 4 for QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking

Abstract:QNNVerifier is the first open-source tool for verifying implementations of neural networks that takes into account the finite word-length (i.e. quantization) of their operands. The novel support for quantization is achieved by employing state-of-the-art software model checking (SMC) techniques. It translates the implementation of neural networks to a decidable fragment of first-order logic based on satisfiability modulo theories (SMT). The effects of fixed- and floating-point operations are represented through direct implementations given a hardware-determined precision. Furthermore, QNNVerifier allows to specify bespoke safety properties and verify the resulting model with different verification strategies (incremental and k-induction) and SMT solvers. Finally, QNNVerifier is the first tool that combines invariant inference via interval analysis and discretization of non-linear activation functions to speed up the verification of neural networks by orders of magnitude. A video presentation of QNNVerifier is available at https://youtu.be/7jMgOL41zTY

* Submitted to the Demo track of the ICSE 2022 conference

Via

Access Paper or Ask Questions