Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Emmanouil Seferis

Randomized Smoothing Meets Vision-Language Models

Sep 19, 2025

Emmanouil Seferis, Changshun Wu, Stefanos Kollias, Saddek Bensalem, Chih-Hong Cheng

Abstract:Randomized smoothing (RS) is one of the prominent techniques to ensure the correctness of machine learning models, where point-wise robustness certificates can be derived analytically. While RS is well understood for classification, its application to generative models is unclear, since their outputs are sequences rather than labels. We resolve this by connecting generative outputs to an oracle classification task and showing that RS can still be enabled: the final response can be classified as a discrete action (e.g., service-robot commands in VLAs), as harmful vs. harmless (content moderation or toxicity detection in VLMs), or even applying oracles to cluster answers into semantically equivalent ones. Provided that the error rate for the oracle classifier comparison is bounded, we develop the theory that associates the number of samples with the corresponding robustness radius. We further derive improved scaling laws analytically relating the certified radius and accuracy to the number of samples, showing that the earlier result of 2 to 3 orders of magnitude fewer samples sufficing with minimal loss remains valid even under weaker assumptions. Together, these advances make robustness certification both well-defined and computationally feasible for state-of-the-art VLMs, as validated against recent jailbreak-style adversarial attacks.

* EMNLP'25 full version, including appendix (proofs, additional experiments)

Via

Access Paper or Ask Questions

Estimating the Robustness Radius for Randomized Smoothing with 100$\times$ Sample Efficiency

Apr 26, 2024

Emmanouil Seferis, Stefanos Kollias, Chih-Hong Cheng

Abstract:Randomized smoothing (RS) has successfully been used to improve the robustness of predictions for deep neural networks (DNNs) by adding random noise to create multiple variations of an input, followed by deciding the consensus. To understand if an RS-enabled DNN is effective in the sampled input domains, it is mandatory to sample data points within the operational design domain, acquire the point-wise certificate regarding robustness radius, and compare it with pre-defined acceptance criteria. Consequently, ensuring that a point-wise robustness certificate for any given data point is obtained relatively cost-effectively is crucial. This work demonstrates that reducing the number of samples by one or two orders of magnitude can still enable the computation of a slightly smaller robustness radius (commonly ~20% radius reduction) with the same confidence. We provide the mathematical foundation for explaining the phenomenon while experimentally showing promising results on the standard CIFAR-10 and ImageNet datasets.

Via

Access Paper or Ask Questions

Prioritizing Corners in OoD Detectors via Symbolic String Manipulation

May 16, 2022

Chih-Hong Cheng, Changshun Wu, Emmanouil Seferis, Saddek Bensalem

Figure 1 for Prioritizing Corners in OoD Detectors via Symbolic String Manipulation

Figure 2 for Prioritizing Corners in OoD Detectors via Symbolic String Manipulation

Figure 3 for Prioritizing Corners in OoD Detectors via Symbolic String Manipulation

Figure 4 for Prioritizing Corners in OoD Detectors via Symbolic String Manipulation

Abstract:For safety assurance of deep neural networks (DNNs), out-of-distribution (OoD) monitoring techniques are essential as they filter spurious input that is distant from the training dataset. This paper studies the problem of systematically testing OoD monitors to avoid cases where an input data point is tested as in-distribution by the monitor, but the DNN produces spurious output predictions. We consider the definition of "in-distribution" characterized in the feature space by a union of hyperrectangles learned from the training dataset. Thus the testing is reduced to finding corners in hyperrectangles distant from the available training data in the feature space. Concretely, we encode the abstract location of every data point as a finite-length binary string, and the union of all binary strings is stored compactly using binary decision diagrams (BDDs). We demonstrate how to use BDDs to symbolically extract corners distant from all data points within the training set. Apart from test case generation, we explain how to use the proposed corners to fine-tune the DNN to ensure that it does not predict overly confidently. The result is evaluated over examples such as number and traffic sign recognition.

Via

Access Paper or Ask Questions

Unaligned but Safe -- Formally Compensating Performance Limitations for Imprecise 2D Object Detection

Feb 10, 2022

Tobias Schuster, Emmanouil Seferis, Simon Burton, Chih-Hong Cheng

Figure 1 for Unaligned but Safe -- Formally Compensating Performance Limitations for Imprecise 2D Object Detection

Figure 2 for Unaligned but Safe -- Formally Compensating Performance Limitations for Imprecise 2D Object Detection

Figure 3 for Unaligned but Safe -- Formally Compensating Performance Limitations for Imprecise 2D Object Detection

Figure 4 for Unaligned but Safe -- Formally Compensating Performance Limitations for Imprecise 2D Object Detection

Abstract:In this paper, we consider the imperfection within machine learning-based 2D object detection and its impact on safety. We address a special sub-type of performance limitations: the prediction bounding box cannot be perfectly aligned with the ground truth, but the computed Intersection-over-Union metric is always larger than a given threshold. Under such type of performance limitation, we formally prove the minimum required bounding box enlargement factor to cover the ground truth. We then demonstrate that the factor can be mathematically adjusted to a smaller value, provided that the motion planner takes a fixed-length buffer in making its decisions. Finally, observing the difference between an empirically measured enlargement factor and our formally derived worst-case enlargement factor offers an interesting connection between the quantitative evidence (demonstrated by statistics) and the qualitative evidence (demonstrated by worst-case analysis).

Via

Access Paper or Ask Questions