Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eric Bax

Continuous Monitoring via Repeated Significance

Aug 05, 2024

Eric Bax, Arundhyoti Sarkar, Alex Shtoff

Abstract:Requiring statistical significance at multiple interim analyses to declare a statistically significant result for an AB test allows less stringent requirements for significance at each interim analysis. Repeated repeated significance competes well with methods built on assumptions about the test -- assumptions that may be impossible to evaluate a priori and may require extra data to evaluate empirically. Instead, requiring repeated significance allows the data itself to prove directly that the required results are not due to chance alone. We explain how to apply tests with repeated significance to continuously monitor unbounded tests -- tests that do not have an a priori bound on running time or number of observations. We show that it is impossible to maintain a constant requirement for significance for unbounded tests, but that we can come arbitrarily close to that goal.

Via

Access Paper or Ask Questions

Early Stopping Based on Repeated Significance

Aug 01, 2024

Eric Bax, Arundhyoti Sarkar, Alex Shtoff

Abstract:For a bucket test with a single criterion for success and a fixed number of samples or testing period, requiring a $p$-value less than a specified value of $\alpha$ for the success criterion produces statistical confidence at level $1 - \alpha$. For multiple criteria, a Bonferroni correction that partitions $\alpha$ among the criteria produces statistical confidence, at the cost of requiring lower $p$-values for each criterion. The same concept can be applied to decisions about early stopping, but that can lead to strict requirements for $p$-values. We show how to address that challenge by requiring criteria to be successful at multiple decision points.

Via

Access Paper or Ask Questions

Non-asymptotic approximations for Pearson's chi-square statistic and its application to confidence intervals for strictly convex functions of the probability weights of discrete distributions

Sep 05, 2023

Eric Bax, Frédéric Ouimet

Abstract:In this paper, we develop a non-asymptotic local normal approximation for multinomial probabilities. First, we use it to find non-asymptotic total variation bounds between the measures induced by uniformly jittered multinomials and the multivariate normals with the same means and covariances. From the total variation bounds, we also derive a comparison of the cumulative distribution functions and quantile coupling inequalities between Pearson's chi-square statistic (written as the normalized quadratic form of a multinomial vector) and its multivariate normal analogue. We apply our results to find confidence intervals for the negative entropy of discrete distributions. Our method can be applied more generally to find confidence intervals for strictly convex functions of the weights of discrete distributions.

* 20 pages, 3 figures

Via

Access Paper or Ask Questions

Sharp Frequency Bounds for Sample-Based Queries

Aug 14, 2022

Eric Bax, John Donald

Abstract:A data sketch algorithm scans a big data set, collecting a small amount of data -- the sketch, which can be used to statistically infer properties of the big data set. Some data sketch algorithms take a fixed-size random sample of a big data set, and use that sample to infer frequencies of items that meet various criteria in the big data set. This paper shows how to statistically infer probably approximately correct (PAC) bounds for those frequencies, efficiently, and precisely enough that the frequency bounds are either sharp or off by only one, which is the best possible result without exact computation.

* In 2019 IEEE Big Data, pages 5983-5985, 2019
* 3 pages

Via

Access Paper or Ask Questions

Selecting a number of voters for a voting ensemble

Apr 23, 2021

Eric Bax

Figure 1 for Selecting a number of voters for a voting ensemble

Figure 2 for Selecting a number of voters for a voting ensemble

Figure 3 for Selecting a number of voters for a voting ensemble

Abstract:For a voting ensemble that selects an odd-sized subset of the ensemble classifiers at random for each example, applies them to the example, and returns the majority vote, we show that any number of voters may minimize the error rate over an out-of-sample distribution. The optimal number of voters depends on the out-of-sample distribution of the number of classifiers in error. To select a number of voters to use, estimating that distribution then inferring error rates for numbers of voters gives lower-variance estimates than directly estimating those error rates.

Via

Access Paper or Ask Questions

Ensemble Validation: Selectivity has a Price, but Variety is Free

Apr 25, 2018

Eric Bax, Farshad Kooti

Figure 1 for Ensemble Validation: Selectivity has a Price, but Variety is Free

Figure 2 for Ensemble Validation: Selectivity has a Price, but Variety is Free

Figure 3 for Ensemble Validation: Selectivity has a Price, but Variety is Free

Figure 4 for Ensemble Validation: Selectivity has a Price, but Variety is Free

Abstract:Suppose some classifiers are selected from a set of hypothesis classifiers to form an equally-weighted ensemble that selects a member classifier at random for each input example. Then the ensemble has an error bound consisting of the average error bound for the member classifiers, a term for selectivity that varies from zero (if all hypothesis classifiers are selected) to a standard uniform error bound (if only a single classifier is selected), and small constants. There is no penalty for using a richer hypothesis set if the same fraction of the hypothesis classifiers are selected for the ensemble.

Via

Access Paper or Ask Questions

Speculate-Correct Error Bounds for k-Nearest Neighbor Classifiers

Sep 15, 2017

Eric Bax, Lingjie Weng, Xu Tian

Figure 1 for Speculate-Correct Error Bounds for k-Nearest Neighbor Classifiers

Figure 2 for Speculate-Correct Error Bounds for k-Nearest Neighbor Classifiers

Figure 3 for Speculate-Correct Error Bounds for k-Nearest Neighbor Classifiers

Abstract:We introduce the speculate-correct method to derive error bounds for local classifiers. Using it, we show that k nearest neighbor classifiers, in spite of their famously fractured decision boundaries, have exponential error bounds with O(sqrt((k + ln n) / n)) error bound range for n in-sample examples.

Via

Access Paper or Ask Questions

Validation of Matching

Apr 11, 2016

Ya Le, Eric Bax, Nicola Barbieri, David Garcia Soriano, Jitesh Mehta, James Li

Abstract:We introduce a technique to compute probably approximately correct (PAC) bounds on precision and recall for matching algorithms. The bounds require some verified matches, but those matches may be used to develop the algorithms. The bounds can be applied to network reconciliation or entity resolution algorithms, which identify nodes in different networks or values in a data set that correspond to the same entity. For network reconciliation, the bounds do not require knowledge of the network generation process.

Via

Access Paper or Ask Questions

Some Theory For Practical Classifier Validation

Oct 09, 2015

Eric Bax, Ya Le

Figure 1 for Some Theory For Practical Classifier Validation

Figure 2 for Some Theory For Practical Classifier Validation

Figure 3 for Some Theory For Practical Classifier Validation

Abstract:We compare and contrast two approaches to validating a trained classifier while using all in-sample data for training. One is simultaneous validation over an organized set of hypotheses (SVOOSH), the well-known method that began with VC theory. The other is withhold and gap (WAG). WAG withholds a validation set, trains a holdout classifier on the remaining data, uses the validation data to validate that classifier, then adds the rate of disagreement between the holdout classifier and one trained using all in-sample data, which is an upper bound on the difference in error rates. We show that complex hypothesis classes and limited training data can make WAG a favorable alternative.

Via

Access Paper or Ask Questions

Improved Error Bounds Based on Worst Likely Assignments

Mar 31, 2015

Eric Bax

Figure 1 for Improved Error Bounds Based on Worst Likely Assignments

Figure 2 for Improved Error Bounds Based on Worst Likely Assignments

Figure 3 for Improved Error Bounds Based on Worst Likely Assignments

Figure 4 for Improved Error Bounds Based on Worst Likely Assignments

Abstract:Error bounds based on worst likely assignments use permutation tests to validate classifiers. Worst likely assignments can produce effective bounds even for data sets with 100 or fewer training examples. This paper introduces a statistic for use in the permutation tests of worst likely assignments that improves error bounds, especially for accurate classifiers, which are typically the classifiers of interest.

* IJCNN 2015

Via

Access Paper or Ask Questions