Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Václav Mácha

DeepTopPush: Simple and Scalable Method for Accuracy at the Top

Jun 22, 2020

Lukáš Adam, Václav Mácha, Václav Šmídl

Figure 1 for DeepTopPush: Simple and Scalable Method for Accuracy at the Top

Figure 2 for DeepTopPush: Simple and Scalable Method for Accuracy at the Top

Figure 3 for DeepTopPush: Simple and Scalable Method for Accuracy at the Top

Figure 4 for DeepTopPush: Simple and Scalable Method for Accuracy at the Top

Abstract:Accuracy at the top is a special class of binary classification problems where the performance is evaluated only on a small number of relevant (top) samples. Applications include information retrieval systems or processes with manual (expensive) postprocessing. This leads to the minimization of irrelevant samples above a threshold. We consider classifiers in the form of an arbitrary (deep) network and propose a new method DeepTopPush for minimizing the top loss function. Since the threshold depends on all samples, the problem is non-decomposable. We modify the stochastic gradient descent to handle the non-decomposability in an end-to-end training manner and propose a way to estimate the threshold only from values on the current minibatch. We demonstrate the good performance of DeepTopPush on visual recognition datasets and on a real-world application of selecting a small number of molecules for further drug testing.

Via

Access Paper or Ask Questions

Nonlinear classifiers for ranking problems based on kernelized SVM

Feb 26, 2020

Václav Mácha, Lukáš Adam, Václav Šmídl

Figure 1 for Nonlinear classifiers for ranking problems based on kernelized SVM

Figure 2 for Nonlinear classifiers for ranking problems based on kernelized SVM

Figure 3 for Nonlinear classifiers for ranking problems based on kernelized SVM

Figure 4 for Nonlinear classifiers for ranking problems based on kernelized SVM

Abstract:Many classification problems focus on maximizing the performance only on the samples with the highest relevance instead of all samples. As an example, we can mention ranking problems, accuracy at the top or search engines where only the top few queries matter. In our previous work, we derived a general framework including several classes of these linear classification problems. In this paper, we extend the framework to nonlinear classifiers. Utilizing a similarity to SVM, we dualize the problems, add kernels and propose a componentwise dual ascent method. This allows us to perform one iteration in less than 20 milliseconds on relatively large datasets such as FashionMNIST.

Via

Access Paper or Ask Questions

General Framework for Binary Classification on Top Samples

Feb 25, 2020

Lukáš Adam, Václav Mácha, Václav Šmídl, Tomáš Pevný

Figure 1 for General Framework for Binary Classification on Top Samples

Figure 2 for General Framework for Binary Classification on Top Samples

Figure 3 for General Framework for Binary Classification on Top Samples

Figure 4 for General Framework for Binary Classification on Top Samples

Abstract:Many binary classification problems minimize misclassification above (or below) a threshold. We show that instances of ranking problems, accuracy at the top or hypothesis testing may be written in this form. We propose a general framework to handle these classes of problems and show which known methods (both known and newly proposed) fall into this framework. We provide a theoretical analysis of this framework and mention selected possible pitfalls the methods may encounter. We suggest several numerical improvements including the implicit derivative and stochastic gradient descent. We provide an extensive numerical study. Based both on the theoretical properties and numerical experiments, we conclude the paper by suggesting which method should be used in which situation.

Via

Access Paper or Ask Questions