Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alanna Tempest

Coarsening Optimization for Differentiable Programming

Oct 05, 2021

Xipeng Shen, Guoqiang Zhang, Irene Dea, Samantha Andow, Emilio Arroyo-Fang, Neal Gafter, Johann George, Melissa Grueter, Erik Meijer, Steffi Stumpos(+3 more)

Figure 1 for Coarsening Optimization for Differentiable Programming

Figure 2 for Coarsening Optimization for Differentiable Programming

Figure 3 for Coarsening Optimization for Differentiable Programming

Figure 4 for Coarsening Optimization for Differentiable Programming

Abstract:This paper presents a novel optimization for differentiable programming named coarsening optimization. It offers a systematic way to synergize symbolic differentiation and algorithmic differentiation (AD). Through it, the granularity of the computations differentiated by each step in AD can become much larger than a single operation, and hence lead to much reduced runtime computations and data allocations in AD. To circumvent the difficulties that control flow creates to symbolic differentiation in coarsening, this work introduces phi-calculus, a novel method to allow symbolic reasoning and differentiation of computations that involve branches and loops. It further avoids "expression swell" in symbolic differentiation and balance reuse and coarsening through the design of reuse-centric segment of interest identification. Experiments on a collection of real-world applications show that coarsening optimization is effective in speeding up AD, producing several times to two orders of magnitude speedups.

* This is the preprint of a paper to be published at OOPSLA'2021

Via

Access Paper or Ask Questions

Gradient Descent: The Ultimate Optimizer

Sep 29, 2019

Kartik Chandra, Erik Meijer, Samantha Andow, Emilio Arroyo-Fang, Irene Dea, Johann George, Melissa Grueter, Basil Hosmer, Steffi Stumpos, Alanna Tempest(+1 more)

Figure 1 for Gradient Descent: The Ultimate Optimizer

Figure 2 for Gradient Descent: The Ultimate Optimizer

Figure 3 for Gradient Descent: The Ultimate Optimizer

Figure 4 for Gradient Descent: The Ultimate Optimizer

Abstract:Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as the learning rate. There exist many techniques for automated hyperparameter optimization, but they typically introduce even more hyperparameters to control the hyperparameter optimization process. We propose to instead learn the hyperparameters themselves by gradient descent, and furthermore to learn the hyper-hyperparameters by gradient descent as well, and so on ad infinitum. As these towers of gradient-based optimizers grow, they become significantly less sensitive to the choice of top-level hyperparameters, hence decreasing the burden on the user to search for optimal values.

Via

Access Paper or Ask Questions