Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maksym Byshkin

CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing

May 29, 2020

Oleksandr Borysenko, Maksym Byshkin

Figure 1 for CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing

Figure 2 for CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing

Figure 3 for CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing

Figure 4 for CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing

Abstract:Deep learning applications require optimization of nonconvex objective functions. These functions have multiple local minima and their optimization is a challenging problem. Simulated Annealing is a well-established method for optimization of such functions, but its efficiency depends on the efficiency of the adapted sampling methods. We explore relations between the Langevin dynamics and stochastic optimization. By combining the Momentum optimizer with Simulated Annealing, we propose CoolMomentum - a prospective stochastic optimization method. Empirical results confirm the efficiency of the proposed theoretical approach.

* 9 pages, 6 figures

Via

Access Paper or Ask Questions

A Simple Algorithm for Scalable Monte Carlo Inference

Jan 15, 2019

Alexander Borisenko, Maksym Byshkin, Alessandro Lomi

Figure 1 for A Simple Algorithm for Scalable Monte Carlo Inference

Figure 2 for A Simple Algorithm for Scalable Monte Carlo Inference

Figure 3 for A Simple Algorithm for Scalable Monte Carlo Inference

Figure 4 for A Simple Algorithm for Scalable Monte Carlo Inference

Abstract:Statistical inference involves estimation of parameters of a model based on observations. Building on the recently proposed Equilibrium Expectation approach and Persistent Contrastive Divergence, we derive a simple and fast Markov chain Monte Carlo algorithm for maximum likelihood estimation (MLE) of parameters of exponential family distributions. The algorithm has good scaling properties and is suitable for Monte Carlo inference on large network data with billions of tie variables. The performance of the algorithm is demonstrated on Markov random fields, conditional random fields, exponential random graph models and Boltzmann machines.

* 10 pages with 5 figures and references

Via

Access Paper or Ask Questions

Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data

Aug 01, 2018

Maksym Byshkin, Alex Stivala, Antonietta Mira, Garry Robins, Alessandro Lomi

Figure 1 for Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data

Figure 2 for Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data

Figure 3 for Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data

Figure 4 for Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data

Abstract:A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we propose a fast algorithm for maximum likelihood estimation (MLE) that afords a signifcant increase in the size of networks amenable to direct empirical analysis. The algorithm we propose in this paper relies on properties of Markov chains at equilibrium, and for this reason it is called equilibrium expectation (EE). We demonstrate the performance of the EE algorithm in the context of exponential random graphmodels (ERGMs) a family of statistical models commonly used in empirical research based on network data observed at a single period in time. Thus far, the lack of efcient computational strategies has limited the empirical scope of ERGMs to relatively small networks with a few thousand nodes. The approach we propose allows a dramatic increase in the size of networks that may be analyzed using ERGMs. This is illustrated in an analysis of several biological networks and one social network with 104,103 nodes

* Scientific Reports | (2018) 8:11509 https://www.nature.com/articles/s41598-018-29725-8
* Final version

Via

Access Paper or Ask Questions