Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yannet Interian

Learning Interpretable Heuristics for WalkSAT

Jul 10, 2023

Yannet Interian, Sara Bernardini

Abstract:Local search algorithms are well-known methods for solving large, hard instances of the satisfiability problem (SAT). The performance of these algorithms crucially depends on heuristics for setting noise parameters and scoring variables. The optimal setting for these heuristics varies for different instance distributions. In this paper, we present an approach for learning effective variable scoring functions and noise parameters by using reinforcement learning. We consider satisfiability problems from different instance distributions and learn specialized heuristics for each of them. Our experimental results show improvements with respect to both a WalkSAT baseline and another local search learned heuristic.

Via

Access Paper or Ask Questions

Lockout: Sparse Regularization of Neural Networks

Jul 15, 2021

Gilmer Valdes, Wilmer Arbelo, Yannet Interian, Jerome H. Friedman

Figure 1 for Lockout: Sparse Regularization of Neural Networks

Figure 2 for Lockout: Sparse Regularization of Neural Networks

Figure 3 for Lockout: Sparse Regularization of Neural Networks

Figure 4 for Lockout: Sparse Regularization of Neural Networks

Abstract:Many regression and classification procedures fit a parameterized function $f(x;w)$ of predictor variables $x$ to data $\{x_{i},y_{i}\}_1^N$ based on some loss criterion $L(y,f)$. Often, regularization is applied to improve accuracy by placing a constraint $P(w)\leq t$ on the values of the parameters $w$. Although efficient methods exist for finding solutions to these constrained optimization problems for all values of $t\geq0$ in the special case when $f$ is a linear function, none are available when $f$ is non-linear (e.g. Neural Networks). Here we present a fast algorithm that provides all such solutions for any differentiable function $f$ and loss $L$, and any constraint $P$ that is an increasing monotone function of the absolute value of each parameter. Applications involving sparsity inducing regularization of arbitrary Neural Networks are discussed. Empirical results indicate that these sparse solutions are usually superior to their dense counterparts in both accuracy and interpretability. This improvement in accuracy can often make Neural Networks competitive with, and sometimes superior to, state-of-the-art methods in the analysis of tabular data.

Via

Access Paper or Ask Questions

Training Deep Learning models with small datasets

Dec 14, 2019

Miguel Romero, Yannet Interian, Timothy Solberg, Gilmer Valdes

Figure 1 for Training Deep Learning models with small datasets

Figure 2 for Training Deep Learning models with small datasets

Figure 3 for Training Deep Learning models with small datasets

Figure 4 for Training Deep Learning models with small datasets

Abstract:The growing use of Machine Learning has produced significant advances in many fields. For image-based tasks, however, the use of deep learning remains challenging in small datasets. In this article, we review, evaluate and compare the current state of the art techniques in training neural networks to elucidate which techniques work best for small datasets. We further propose a path forward for the improvement of model accuracy in medical imaging applications. We observed best results from one cycle training, discriminative learning rates with gradual freezing and parameter modification after transfer learning. We also established that when datasets are small, transfer learning plays an important role beyond parameter initialization by reusing previously learned features. Surprisingly we observed that there is little advantage in using pre-trained networks in images from another part of the body compared to Imagenet. On the contrary, if images from the same part of the body are available then transfer learning can produce a significant improvement in performance with as little as 50 images in the training data.

Via

Access Paper or Ask Questions

Conditional Super Learner

Dec 13, 2019

Gilmer Valdes, Yannet Interian, Efstathios D. Gennatas Mark J. Van der Laan

Abstract:In this article we consider the Conditional Super Learner (CSL), an algorithm which selects the best model candidate from a library conditional on the covariates. The CSL expands the idea of using cross-validation to select the best model and merges it with meta learning. Here we propose a specific algorithm that finds a local minimum to the problem posed, proof that it converges at a rate faster than Op(n^-1/4) and offers extensive empirical evidence that it is an excellent candidate to substitute stacking or for the analysis of Hierarchical problems.

Via

Access Paper or Ask Questions

ContamiNet: Detecting Contamination in Municipal Solid Waste

Nov 11, 2019

Khoury Ibrahim, Danielle A. Savage, Addie Schnirel, Paul Intrevado, Yannet Interian

Figure 1 for ContamiNet: Detecting Contamination in Municipal Solid Waste

Figure 2 for ContamiNet: Detecting Contamination in Municipal Solid Waste

Figure 3 for ContamiNet: Detecting Contamination in Municipal Solid Waste

Figure 4 for ContamiNet: Detecting Contamination in Municipal Solid Waste

Abstract:Leveraging over 30,000 images each with up to 89 labels collected by Recology---an integrated resource recovery company with both residential and commercial trash, recycling and composting services---the authors develop ContamiNet, a convolutional neural network, to identify contaminating material in residential recycling and compost bins. When training the model on a subset of labels that meet a minimum frequency threshold, ContamiNet preforms almost as well human experts in detecting contamination (0.86 versus 0.88 AUC). Recology is actively piloting ContamiNet in their daily municipal solid waste (MSW) collection to identify contaminants in recycling and compost bins to subsequently inform and educate customers about best sorting practices.

* 8 pages, 3 figures, ICMLA 2020

Via

Access Paper or Ask Questions