Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shachar Schnapp

Differentially Private Approximate Quantiles

Oct 11, 2021

Haim Kaplan, Shachar Schnapp, Uri Stemmer

Figure 1 for Differentially Private Approximate Quantiles

Figure 2 for Differentially Private Approximate Quantiles

Figure 3 for Differentially Private Approximate Quantiles

Figure 4 for Differentially Private Approximate Quantiles

Abstract:In this work we study the problem of differentially private (DP) quantiles, in which given dataset $X$ and quantiles $q_1, ..., q_m \in [0,1]$, we want to output $m$ quantile estimations which are as close as possible to the true quantiles and preserve DP. We describe a simple recursive DP algorithm, which we call ApproximateQuantiles (AQ), for this task. We give a worst case upper bound on its error, and show that its error is much lower than of previous implementations on several different datasets. Furthermore, it gets this low error while running time two orders of magnitude faster that the best previous implementation.

Via

Access Paper or Ask Questions

Active Feature Selection for the Mutual Information Criterion

Dec 13, 2020

Shachar Schnapp, Sivan Sabato

Figure 1 for Active Feature Selection for the Mutual Information Criterion

Figure 2 for Active Feature Selection for the Mutual Information Criterion

Figure 3 for Active Feature Selection for the Mutual Information Criterion

Figure 4 for Active Feature Selection for the Mutual Information Criterion

Abstract:We study active feature selection, a novel feature selection setting in which unlabeled data is available, but the budget for labels is limited, and the examples to label can be actively selected by the algorithm. We focus on feature selection using the classical mutual information criterion, which selects the $k$ features with the largest mutual information with the label. In the active feature selection setting, the goal is to use significantly fewer labels than the data set size and still find $k$ features whose mutual information with the label based on the \emph{entire} data set is large. We explain and experimentally study the choices that we make in the algorithm, and show that they lead to a successful algorithm, compared to other more naive approaches. Our design draws on insights which relate the problem of active feature selection to the study of pure-exploration multi-armed bandits settings. While we focus here on mutual information, our general methodology can be adapted to other feature-quality measures as well. The code is available at the following url: https://github.com/ShacharSchnapp/ActiveFeatureSelection.

* To appear in AAAI-21

Via

Access Paper or Ask Questions