Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Stochastic Optimization of Sorting Networks via Continuous Relaxations

Mar 21, 2019

Aditya Grover, Eric Wang, Aaron Zweig, Stefano Ermon

Figure 1 for Stochastic Optimization of Sorting Networks via Continuous Relaxations

Figure 2 for Stochastic Optimization of Sorting Networks via Continuous Relaxations

Figure 3 for Stochastic Optimization of Sorting Networks via Continuous Relaxations

Figure 4 for Stochastic Optimization of Sorting Networks via Continuous Relaxations

Share this with someone who'll enjoy it:

Abstract:Sorting input objects is an important step in many machine learning pipelines. However, the sorting operator is non-differentiable with respect to its inputs, which prohibits end-to-end gradient-based optimization. In this work, we propose NeuralSort, a general-purpose continuous relaxation of the output of the sorting operator from permutation matrices to the set of unimodal row-stochastic matrices, where every row sums to one and has a distinct arg max. This relaxation permits straight-through optimization of any computational graph involve a sorting operation. Further, we use this relaxation to enable gradient-based stochastic optimization over the combinatorially large space of permutations by deriving a reparameterized gradient estimator for the Plackett-Luce family of distributions over permutations. We demonstrate the usefulness of our framework on three tasks that require learning semantic orderings of high-dimensional objects, including a fully differentiable, parameterized extension of the k-nearest neighbors algorithm.

* ICLR 2019

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Stochastic Optimization of Sorting Networks via Continuous Relaxations

Paper and Code