Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zac Pullar-Strecker

Memento: Facilitating Effortless, Efficient, and Reliable ML Experiments

Apr 17, 2023

Zac Pullar-Strecker, Xinglong Chang, Liam Brydon, Ioannis Ziogas, Katharina Dost, Jörg Wicker

Abstract:Running complex sets of machine learning experiments is challenging and time-consuming due to the lack of a unified framework. This leaves researchers forced to spend time implementing necessary features such as parallelization, caching, and checkpointing themselves instead of focussing on their project. To simplify the process, in this paper, we introduce Memento, a Python package that is designed to aid researchers and data scientists in the efficient management and execution of computationally intensive experiments. Memento has the capacity to streamline any experimental pipeline by providing a straightforward configuration matrix and the ability to concurrently run experiments across multiple threads. A demonstration of Memento is available at: https://wickerlab.org/publication/memento.

Via

Access Paper or Ask Questions

Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Oct 07, 2021

Zac Pullar-Strecker, Katharina Dost, Eibe Frank, Jörg Wicker

Figure 1 for Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Figure 2 for Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Figure 3 for Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Figure 4 for Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Abstract:Active learning allows machine learning models to be trained using fewer labels while retaining similar performance to traditional fully supervised learning. An active learner selects the most informative data points, requests their labels, and retrains itself. While this approach is promising, it leaves an open problem of how to determine when the model is `good enough' without the additional labels required for traditional evaluation. In the past, different stopping criteria have been proposed aiming to identify the optimal stopping point. However, optimality can only be expressed as a domain-dependent trade-off between accuracy and the number of labels, and no criterion is superior in all applications. This paper is the first to give actionable advice to practitioners on what stopping criteria they should use in a given real-world scenario. We contribute the first large-scale comparison of stopping criteria, using a cost measure to quantify the accuracy/label trade-off, public implementations of all stopping criteria we evaluate, and an open-source framework for evaluating stopping criteria. Our research enables practitioners to substantially reduce labelling costs by utilizing the stopping criterion which best suits their domain.

* 39 pages, 26 figures

Via

Access Paper or Ask Questions