Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tom J. Viering

Making Learners (More) Monotone

Nov 25, 2019

Tom J. Viering, Alexander Mey, Marco Loog

Figure 1 for Making Learners (More) Monotone

Figure 2 for Making Learners (More) Monotone

Figure 3 for Making Learners (More) Monotone

Figure 4 for Making Learners (More) Monotone

Abstract:Learning performance can show non-monotonic behavior. That is, more data does not necessarily lead to better models, even on average. We propose three algorithms that take a supervised learning model and make it perform more monotone. We prove consistency and monotonicity with high probability, and evaluate the algorithms on scenarios where non-monotone behaviour occurs. Our proposed algorithm $\text{MT}_{\text{HT}}$ makes less than $1\%$ non-monotone decisions on MNIST while staying competitive in terms of error rate compared to several baselines.

Via

Access Paper or Ask Questions

Nuclear Discrepancy for Active Learning

Jun 08, 2017

Tom J. Viering, Jesse H. Krijthe, Marco Loog

Figure 1 for Nuclear Discrepancy for Active Learning

Figure 2 for Nuclear Discrepancy for Active Learning

Figure 3 for Nuclear Discrepancy for Active Learning

Figure 4 for Nuclear Discrepancy for Active Learning

Abstract:Active learning algorithms propose which unlabeled objects should be queried for their labels to improve a predictive model the most. We study active learners that minimize generalization bounds and uncover relationships between these bounds that lead to an improved approach to active learning. In particular we show the relation between the bound of the state-of-the-art Maximum Mean Discrepancy (MMD) active learner, the bound of the Discrepancy, and a new and looser bound that we refer to as the Nuclear Discrepancy bound. We motivate this bound by a probabilistic argument: we show it considers situations which are more likely to occur. Our experiments indicate that active learning using the tightest Discrepancy bound performs the worst in terms of the squared loss. Overall, our proposed loosest Nuclear Discrepancy generalization bound performs the best. We confirm our probabilistic argument empirically: the other bounds focus on more pessimistic scenarios that are rarer in practice. We conclude that tightness of bounds is not always of main importance and that active learning methods should concentrate on realistic scenarios in order to improve performance.

* 32 pages, 5 figures, 4 tables

Via

Access Paper or Ask Questions