Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nikolaos Matragkas

DeepKnowledge: Generalisation-Driven Deep Learning Testing

Mar 25, 2024

Sondess Missaoui, Simos Gerasimou, Nikolaos Matragkas

Figure 1 for DeepKnowledge: Generalisation-Driven Deep Learning Testing

Figure 2 for DeepKnowledge: Generalisation-Driven Deep Learning Testing

Figure 3 for DeepKnowledge: Generalisation-Driven Deep Learning Testing

Figure 4 for DeepKnowledge: Generalisation-Driven Deep Learning Testing

Abstract:Despite their unprecedented success, DNNs are notoriously fragile to small shifts in data distribution, demanding effective testing techniques that can assess their dependability. Despite recent advances in DNN testing, there is a lack of systematic testing approaches that assess the DNN's capability to generalise and operate comparably beyond data in their training distribution. We address this gap with DeepKnowledge, a systematic testing methodology for DNN-based systems founded on the theory of knowledge generalisation, which aims to enhance DNN robustness and reduce the residual risk of 'black box' models. Conforming to this theory, DeepKnowledge posits that core computational DNN units, termed Transfer Knowledge neurons, can generalise under domain shift. DeepKnowledge provides an objective confidence measurement on testing activities of DNN given data distribution shifts and uses this information to instrument a generalisation-informed test adequacy criterion to check the transfer knowledge capacity of a test set. Our empirical evaluation of several DNNs, across multiple datasets and state-of-the-art adversarial generation techniques demonstrates the usefulness and effectiveness of DeepKnowledge and its ability to support the engineering of more dependable DNNs. We report improvements of up to 10 percentage points over state-of-the-art coverage criteria for detecting adversarial attacks on several benchmarks, including MNIST, SVHN, and CIFAR.

* 10 pages

Via

Access Paper or Ask Questions