Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Steven Braun

Deep Classifier Mimicry without Data Access

Jun 03, 2023

Steven Braun, Martin Mundt, Kristian Kersting

Abstract:Access to pre-trained models has recently emerged as a standard across numerous machine learning domains. Unfortunately, access to the original data the models were trained on may not equally be granted. This makes it tremendously challenging to fine-tune, compress models, adapt continually, or to do any other type of data-driven update. We posit that original data access may however not be required. Specifically, we propose Contrastive Abductive Knowledge Extraction (CAKE), a model-agnostic knowledge distillation procedure that mimics deep classifiers without access to the original data. To this end, CAKE generates pairs of noisy synthetic samples and diffuses them contrastively toward a model's decision boundary. We empirically corroborate CAKE's effectiveness using several benchmark datasets and various architectural choices, paving the way for broad application.

* 10 pages main, 4 figures, 2 tables, 2 pages appendix

Via

Access Paper or Ask Questions

Probabilistic Circuits That Know What They Don't Know

Feb 17, 2023

Fabrizio Ventola, Steven Braun, Zhongjie Yu, Martin Mundt, Kristian Kersting

Figure 1 for Probabilistic Circuits That Know What They Don't Know

Figure 2 for Probabilistic Circuits That Know What They Don't Know

Figure 3 for Probabilistic Circuits That Know What They Don't Know

Figure 4 for Probabilistic Circuits That Know What They Don't Know

Abstract:Probabilistic circuits (PCs) are models that allow exact and tractable probabilistic inference. In contrast to neural networks, they are often assumed to be well-calibrated and robust to out-of-distribution (OOD) data. In this paper, we show that PCs are in fact not robust to OOD data, i.e., they don't know what they don't know. We then show how this challenge can be overcome by model uncertainty quantification. To this end, we propose tractable dropout inference (TDI), an inference procedure to estimate uncertainty by deriving an analytical solution to Monte Carlo dropout (MCD) through variance propagation. Unlike MCD in neural networks, which comes at the cost of multiple network evaluations, TDI provides tractable sampling-free uncertainty estimates in a single forward pass. TDI improves the robustness of PCs to distribution shift and OOD data, demonstrated through a series of experiments evaluating the classification confidence and uncertainty estimates on real-world data.

* 22 pages, 8 figures, 1 table, 1 algorithm

Via

Access Paper or Ask Questions