Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lina Marsso

Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance

Feb 29, 2024

Huakun Shen, Boyue Caroline Hu, Krzysztof Czarnecki, Lina Marsso, Marsha Chechik

Abstract:While Neural Networks (NNs) have surpassed human accuracy in image classification on ImageNet, they often lack robustness against image corruption, i.e., corruption robustness. Yet such robustness is seemingly effortless for human perception. In this paper, we propose visually-continuous corruption robustness (VCR) -- an extension of corruption robustness to allow assessing it over the wide and continuous range of changes that correspond to the human perceptive quality (i.e., from the original image to the full distortion of all perceived visual information), along with two novel human-aware metrics for NN evaluation. To compare VCR of NNs with human perception, we conducted extensive experiments on 14 commonly used image corruptions with 7,718 human participants and state-of-the-art robust NN models with different training objectives (e.g., standard, adversarial, corruption robustness), different architectures (e.g., convolution NNs, vision transformers), and different amounts of training data augmentation. Our study showed that: 1) assessing robustness against continuous corruption can reveal insufficient robustness undetected by existing benchmarks; as a result, 2) the gap between NN and human robustness is larger than previously known; and finally, 3) some image corruptions have a similar impact on human perception, offering opportunities for more cost-effective robustness assessments. Our validation set with 14 image corruptions, human robustness data, and the evaluation code is provided as a toolbox and a benchmark.

Via

Access Paper or Ask Questions

Formally Modeling Autonomous Vehicles in LNT for Simulation and Testing

Mar 18, 2022

Lina Marsso, Radu Mateescu, Lucie Muller, Wendelin Serwe

Figure 1 for Formally Modeling Autonomous Vehicles in LNT for Simulation and Testing

Figure 2 for Formally Modeling Autonomous Vehicles in LNT for Simulation and Testing

Figure 3 for Formally Modeling Autonomous Vehicles in LNT for Simulation and Testing

Figure 4 for Formally Modeling Autonomous Vehicles in LNT for Simulation and Testing

Abstract:We present two behavioral models of an autonomous vehicle and its interaction with the environment. Both models use the formal modeling language LNT provided by the CADP toolbox. This paper discusses the modeling choices and the challenges of our autonomous vehicle models, and also illustrates how formal validation tools can be applied to a single component or the overall vehicle.

* EPTCS 355, 2022, pp. 60-117
* In Proceedings MARS 2022, arXiv:2203.09299

Via

Access Paper or Ask Questions

If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components

Feb 08, 2022

Boyue Caroline Hu, Lina Marsso, Krzysztof Czarnecki, Rick Salay, Huakun Shen, Marsha Chechik

Figure 1 for If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components

Figure 2 for If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components

Figure 3 for If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components

Figure 4 for If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components

Abstract:Machine Vision Components (MVC) are becoming safety-critical. Assuring their quality, including safety, is essential for their successful deployment. Assurance relies on the availability of precisely specified and, ideally, machine-verifiable requirements. MVCs with state-of-the-art performance rely on machine learning (ML) and training data but largely lack such requirements. In this paper, we address the need for defining machine-verifiable reliability requirements for MVCs against transformations that simulate the full range of realistic and safety-critical changes in the environment. Using human performance as a baseline, we define reliability requirements as: 'if the changes in an image do not affect a human's decision, neither should they affect the MVC's.' To this end, we provide: (1) a class of safety-related image transformations; (2) reliability requirement classes to specify correctness-preservation and prediction-preservation for MVCs; (3) a method to instantiate machine-verifiable requirements from these requirements classes using human performance experiment data; (4) human performance experiment data for image recognition involving eight commonly used transformations, from about 2000 human participants; and (5) a method for automatically checking whether an MVC satisfies our requirements. Further, we show that our reliability requirements are feasible and reusable by evaluating our methods on 13 state-of-the-art pre-trained image classification models. Finally, we demonstrate that our approach detects reliability gaps in MVCs that other existing methods are unable to detect.

Via

Access Paper or Ask Questions