Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jacson Rodrigues Correia-Silva

Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?

Jan 21, 2021

Jacson Rodrigues Correia-Silva, Rodrigo F. Berriel, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

Figure 1 for Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?

Figure 2 for Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?

Figure 3 for Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?

Figure 4 for Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?

Abstract:Convolutional neural networks have been successful lately enabling companies to develop neural-based products, which demand an expensive process, involving data acquisition and annotation; and model generation, usually requiring experts. With all these costs, companies are concerned about the security of their models against copies and deliver them as black-boxes accessed by APIs. Nonetheless, we argue that even black-box models still have some vulnerabilities. In a preliminary work, we presented a simple, yet powerful, method to copy black-box models by querying them with natural random images. In this work, we consolidate and extend the copycat method: (i) some constraints are waived; (ii) an extensive evaluation with several problems is performed; (iii) models are copied between different architectures; and, (iv) a deeper analysis is performed by looking at the copycat behavior. Results show that natural random images are effective to generate copycats for several problems.

* Pattern Recognition 113 (2021) 107830
* The code is available at https://github.com/jeiks/Stealing_DL_Models

Via

Access Paper or Ask Questions

Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

Jun 14, 2018

Jacson Rodrigues Correia-Silva, Rodrigo F. Berriel, Claudine Badue, Alberto F. de Souza, Thiago Oliveira-Santos

Figure 1 for Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

Figure 2 for Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

Figure 3 for Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

Figure 4 for Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

Abstract:In the past few years, Convolutional Neural Networks (CNNs) have been achieving state-of-the-art performance on a variety of problems. Many companies employ resources and money to generate these models and provide them as an API, therefore it is in their best interest to protect them, i.e., to avoid that someone else copies them. Recent studies revealed that state-of-the-art CNNs are vulnerable to adversarial examples attacks, and this weakness indicates that CNNs do not need to operate in the problem domain (PD). Therefore, we hypothesize that they also do not need to be trained with examples of the PD in order to operate in it. Given these facts, in this paper, we investigate if a target black-box CNN can be copied by persuading it to confess its knowledge through random non-labeled data. The copy is two-fold: i) the target network is queried with random data and its predictions are used to create a fake dataset with the knowledge of the network; and ii) a copycat network is trained with the fake dataset and should be able to achieve similar performance as the target network. This hypothesis was evaluated locally in three problems (facial expression, object, and crosswalk classification) and against a cloud-based API. In the copy attacks, images from both non-problem domain and PD were used. All copycat networks achieved at least 93.7% of the performance of the original models with non-problem domain data, and at least 98.6% using additional data from the PD. Additionally, the copycat CNN successfully copied at least 97.3% of the performance of the Microsoft Azure Emotion API. Our results show that it is possible to create a copycat CNN by simply querying a target network as black-box with random non-labeled data.

* 8 pages, 3 figures, accepted by IJCNN 2018

Via

Access Paper or Ask Questions