Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lorenzo Cazzaro

Università Ca' Foscari Venezia, Italy

Watermarking Decision Tree Ensembles

Oct 06, 2024

Stefano Calzavara, Lorenzo Cazzaro, Donald Gera, Salvatore Orlando

Figure 1 for Watermarking Decision Tree Ensembles

Figure 2 for Watermarking Decision Tree Ensembles

Figure 3 for Watermarking Decision Tree Ensembles

Figure 4 for Watermarking Decision Tree Ensembles

Abstract:Protecting the intellectual property of machine learning models is a hot topic and many watermarking schemes for deep neural networks have been proposed in the literature. Unfortunately, prior work largely neglected the investigation of watermarking techniques for other types of models, including decision tree ensembles, which are a state-of-the-art model for classification tasks on non-perceptual data. In this paper, we present the first watermarking scheme designed for decision tree ensembles, focusing in particular on random forest models. We discuss watermark creation and verification, presenting a thorough security analysis with respect to possible attacks. We finally perform an experimental evaluation of the proposed scheme, showing excellent results in terms of accuracy and security against the most relevant threats.

* 7 pages, 5 figures, 2 tables

Via

Access Paper or Ask Questions

Timber! Poisoning Decision Trees

Oct 01, 2024

Stefano Calzavara, Lorenzo Cazzaro, Massimo Vettori

Figure 1 for Timber! Poisoning Decision Trees

Figure 2 for Timber! Poisoning Decision Trees

Figure 3 for Timber! Poisoning Decision Trees

Figure 4 for Timber! Poisoning Decision Trees

Abstract:We present Timber, the first white-box poisoning attack targeting decision trees. Timber is based on a greedy attack strategy leveraging sub-tree retraining to efficiently estimate the damage performed by poisoning a given training instance. The attack relies on a tree annotation procedure which enables sorting training instances so that they are processed in increasing order of computational cost of sub-tree retraining. This sorting yields a variant of Timber supporting an early stopping criterion designed to make poisoning attacks more efficient and feasible on larger datasets. We also discuss an extension of Timber to traditional random forest models, which is useful because decision trees are normally combined into ensembles to improve their predictive power. Our experimental evaluation on public datasets shows that our attacks outperform existing baselines in terms of effectiveness, efficiency or both. Moreover, we show that two representative defenses can mitigate the effect of our attacks, but fail at effectively thwarting them.

* 18 pages, 7 figures, 5 tables

Via

Access Paper or Ask Questions

Verifiable Boosted Tree Ensembles

Feb 22, 2024

Stefano Calzavara, Lorenzo Cazzaro, Claudio Lucchese, Giulio Ermanno Pibiri

Abstract:Verifiable learning advocates for training machine learning models amenable to efficient security verification. Prior research demonstrated that specific classes of decision tree ensembles -- called large-spread ensembles -- allow for robustness verification in polynomial time against any norm-based attacker. This study expands prior work on verifiable learning from basic ensemble methods (i.e., hard majority voting) to advanced boosted tree ensembles, such as those trained using XGBoost or LightGBM. Our formal results indicate that robustness verification is achievable in polynomial time when considering attackers based on the $L_\infty$-norm, but remains NP-hard for other norm-based attackers. Nevertheless, we present a pseudo-polynomial time algorithm to verify robustness against attackers based on the $L_p$-norm for any $p \in \mathbb{N} \cup \{0\}$, which in practice grants excellent performance. Our experimental evaluation shows that large-spread boosted ensembles are accurate enough for practical adoption, while being amenable to efficient security verification.

* 15 pages, 3 figures

Via

Access Paper or Ask Questions

Verifiable Learning for Robust Tree Ensembles

May 05, 2023

Stefano Calzavara, Lorenzo Cazzaro, Giulio Ermanno Pibiri, Nicola Prezza

Abstract:Verifying the robustness of machine learning models against evasion attacks at test time is an important research problem. Unfortunately, prior work established that this problem is NP-hard for decision tree ensembles, hence bound to be intractable for specific inputs. In this paper, we identify a restricted class of decision tree ensembles, called large-spread ensembles, which admit a security verification algorithm running in polynomial time. We then propose a new approach called verifiable learning, which advocates the training of such restricted model classes which are amenable for efficient verification. We show the benefits of this idea by designing a new training algorithm that automatically learns a large-spread decision tree ensemble from labelled data, thus enabling its security verification in polynomial time. Experimental results on publicly available datasets confirm that large-spread ensembles trained using our algorithm can be verified in a matter of seconds, using standard commercial hardware. Moreover, large-spread ensembles are more robust than traditional ensembles against evasion attacks, while incurring in just a relatively small loss of accuracy in the non-adversarial setting.

* 17 pages, 3 figures

Via

Access Paper or Ask Questions

Explainable Global Fairness Verification of Tree-Based Classifiers

Sep 27, 2022

Stefano Calzavara, Lorenzo Cazzaro, Claudio Lucchese, Federico Marcuzzi

Figure 1 for Explainable Global Fairness Verification of Tree-Based Classifiers

Figure 2 for Explainable Global Fairness Verification of Tree-Based Classifiers

Figure 3 for Explainable Global Fairness Verification of Tree-Based Classifiers

Figure 4 for Explainable Global Fairness Verification of Tree-Based Classifiers

Abstract:We present a new approach to the global fairness verification of tree-based classifiers. Given a tree-based classifier and a set of sensitive features potentially leading to discrimination, our analysis synthesizes sufficient conditions for fairness, expressed as a set of traditional propositional logic formulas, which are readily understandable by human experts. The verified fairness guarantees are global, in that the formulas predicate over all the possible inputs of the classifier, rather than just a few specific test instances. Our analysis is formally proved both sound and complete. Experimental results on public datasets show that the analysis is precise, explainable to human experts and efficient enough for practical adoption.

* 15 pages with 7 figures

Via

Access Paper or Ask Questions

Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Dec 05, 2021

Stefano Calzavara, Lorenzo Cazzaro, Claudio Lucchese, Federico Marcuzzi, Salvatore Orlando

Figure 1 for Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Figure 2 for Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Figure 3 for Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Figure 4 for Beyond Robustness: Resilience Verification of Tree-Based Classifiers

Abstract:In this paper we criticize the robustness measure traditionally employed to assess the performance of machine learning models deployed in adversarial settings. To mitigate the limitations of robustness, we introduce a new measure called resilience and we focus on its verification. In particular, we discuss how resilience can be verified by combining a traditional robustness verification technique with a data-independent stability analysis, which identifies a subset of the feature space where the model does not change its predictions despite adversarial manipulations. We then introduce a formally sound data-independent stability analysis for decision trees and decision tree ensembles, which we experimentally assess on public datasets and we leverage for resilience verification. Our results show that resilience verification is useful and feasible in practice, yielding a more reliable security assessment of both standard and robust decision tree models.

Via

Access Paper or Ask Questions