Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zirui Peng

Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

Feb 22, 2022

Zirui Peng, Shaofeng Li, Guoxing Chen, Cheng Zhang, Haojin Zhu, Minhui Xue

Figure 1 for Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

Figure 2 for Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

Figure 3 for Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

Figure 4 for Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

Abstract:In this paper, we propose a novel and practical mechanism which enables the service provider to verify whether a suspect model is stolen from the victim model via model extraction attacks. Our key insight is that the profile of a DNN model's decision boundary can be uniquely characterized by its \textit{Universal Adversarial Perturbations (UAPs)}. UAPs belong to a low-dimensional subspace and piracy models' subspaces are more consistent with victim model's subspace compared with non-piracy model. Based on this, we propose a UAP fingerprinting method for DNN models and train an encoder via \textit{contrastive learning} that takes fingerprint as inputs, outputs a similarity score. Extensive studies show that our framework can detect model IP breaches with confidence $> 99.99 \%$ within only $20$ fingerprints of the suspect model. It has good generalizability across different model architectures and is robust against post-modifications on stolen models.

Via

Access Paper or Ask Questions