Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dan Schmidt

Self-Play Learning Without a Reward Metric

Dec 16, 2019

Dan Schmidt, Nick Moran, Jonathan S. Rosenfeld, Jonathan Rosenthal, Jonathan Yedidia

Figure 1 for Self-Play Learning Without a Reward Metric

Figure 2 for Self-Play Learning Without a Reward Metric

Figure 3 for Self-Play Learning Without a Reward Metric

Figure 4 for Self-Play Learning Without a Reward Metric

Abstract:The AlphaZero algorithm for the learning of strategy games via self-play, which has produced superhuman ability in the games of Go, chess, and shogi, uses a quantitative reward function for game outcomes, requiring the users of the algorithm to explicitly balance different components of the reward against each other, such as the game winner and margin of victory. We present a modification to the AlphaZero algorithm that requires only a total ordering over game outcomes, obviating the need to perform any quantitative balancing of reward components. We demonstrate that this system learns optimal play in a comparable amount of time to AlphaZero on a sample game.

* 6 pages, 4 figures

Via

Access Paper or Ask Questions

Noisier2Noise: Learning to Denoise from Unpaired Noisy Data

Oct 25, 2019

Nick Moran, Dan Schmidt, Yu Zhong, Patrick Coady

Figure 1 for Noisier2Noise: Learning to Denoise from Unpaired Noisy Data

Figure 2 for Noisier2Noise: Learning to Denoise from Unpaired Noisy Data

Figure 3 for Noisier2Noise: Learning to Denoise from Unpaired Noisy Data

Figure 4 for Noisier2Noise: Learning to Denoise from Unpaired Noisy Data

Abstract:We present a method for training a neural network to perform image denoising without access to clean training examples or access to paired noisy training examples. Our method requires only a single noisy realization of each training example and a statistical model of the noise distribution, and is applicable to a wide variety of noise models, including spatially structured noise. Our model produces results which are competitive with other learned methods which require richer training data, and outperforms traditional non-learned denoising methods. We present derivations of our method for arbitrary additive noise, an improvement specific to Gaussian additive noise, and an extension to multiplicative Bernoulli noise.

Via

Access Paper or Ask Questions

Monotone Learning with Rectified Wire Networks

Aug 24, 2018

Veit Elser, Dan Schmidt, Jonathan Yedidia

Figure 1 for Monotone Learning with Rectified Wire Networks

Figure 2 for Monotone Learning with Rectified Wire Networks

Figure 3 for Monotone Learning with Rectified Wire Networks

Figure 4 for Monotone Learning with Rectified Wire Networks

Abstract:We introduce a new neural network model, together with a tractable and monotone online learning algorithm. Our model describes feed-forward networks for classification, with one output node for each class. The only nonlinear operation is rectification using a ReLU function with a bias. However, there is a rectifier on every edge rather than at the nodes of the network. There are also weights, but these are positive, static, and associated with the nodes. Our "rectified wire" networks are able to represent arbitrary Boolean functions. Only the bias parameters, on the edges of the network, are learned. Another departure in our approach, from standard neural networks, is that the loss function is replaced by a constraint. This constraint is simply that the value of the output node associated with the correct class should be zero. Our model has the property that the exact norm-minimizing parameter update, required to correctly classify a training item, is the solution to a quadratic program that can be computed with a few passes through the network. We demonstrate a training algorithm using this update, called sequential deactivation (SDA), on MNIST and some synthetic datasets. Upon adopting a natural choice for the nodal weights, SDA has no hyperparameters other than those describing the network structure. Our experiments explore behavior with respect to network size and depth in a family of sparse expander networks.

* 41 pages, 21 figures, new experimental results, various improvements

Via

Access Paper or Ask Questions

Proactive Message Passing on Memory Factor Networks

Jan 18, 2016

Patrick Eschenfeldt, Dan Schmidt, Stark Draper, Jonathan Yedidia

Figure 1 for Proactive Message Passing on Memory Factor Networks

Figure 2 for Proactive Message Passing on Memory Factor Networks

Figure 3 for Proactive Message Passing on Memory Factor Networks

Figure 4 for Proactive Message Passing on Memory Factor Networks

Abstract:We introduce a new type of graphical model that we call a "memory factor network" (MFN). We show how to use MFNs to model the structure inherent in many types of data sets. We also introduce an associated message-passing style algorithm called "proactive message passing"' (PMP) that performs inference on MFNs. PMP comes with convergence guarantees and is efficient in comparison to competing algorithms such as variants of belief propagation. We specialize MFNs and PMP to a number of distinct types of data (discrete, continuous, labelled) and inference problems (interpolation, hypothesis testing), provide examples, and discuss approaches for efficient implementation.

* 35 pages, 13 figures

Via

Access Paper or Ask Questions