Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luofeng Liao

Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems

Jun 18, 2021

Luofeng Liao, Li Shen, Jia Duan, Mladen Kolar, Dacheng Tao

Figure 1 for Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems

Figure 2 for Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems

Figure 3 for Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems

Figure 4 for Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Minimax Problems

Abstract:Large scale convex-concave minimax problems arise in numerous applications, including game theory, robust training, and training of generative adversarial networks. Despite their wide applicability, solving such problems efficiently and effectively is challenging in the presence of large amounts of data using existing stochastic minimax methods. We study a class of stochastic minimax methods and develop a communication-efficient distributed stochastic extragradient algorithm, LocalAdaSEG, with an adaptive learning rate suitable for solving convex-concave minimax problem in the Parameter-Server model. LocalAdaSEG has three main features: (i) periodic communication strategy reduces the communication cost between workers and the server; (ii) an adaptive learning rate that is computed locally and allows for tuning-free implementation; and (iii) theoretically, a nearly linear speed-up with respect to the dominant variance term, arising from estimation of the stochastic gradient, is proven in both the smooth and nonsmooth convex-concave settings. LocalAdaSEG is used to solve a stochastic bilinear game, and train generative adversarial network. We compare LocalAdaSEG against several existing optimizers for minimax problems and demonstrate its efficacy through several experiments in both the homogeneous and heterogeneous settings.

* 24 pages

Via

Access Paper or Ask Questions

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

Feb 19, 2021

Luofeng Liao, Zuyue Fu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

Figure 1 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

Figure 2 for Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

Abstract:In offline reinforcement learning (RL) an optimal policy is learnt solely from a priori collected observational data. However, in observational data, actions are often confounded by unobserved variables. Instrumental variables (IVs), in the context of RL, are the variables whose influence on the state variables are all mediated through the action. When a valid instrument is present, we can recover the confounded transition dynamics through observational data. We study a confounded Markov decision process where the transition dynamics admit an additive nonlinear functional form. Using IVs, we derive a conditional moment restriction (CMR) through which we can identify transition dynamics based on observational data. We propose a provably efficient IV-aided Value Iteration (IVVI) algorithm based on a primal-dual reformulation of CMR. To the best of our knowledge, this is the first provably efficient algorithm for instrument-aided offline RL.

* under review

Via

Access Paper or Ask Questions

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Jul 02, 2020

Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

Figure 1 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Figure 2 for Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

Abstract:Structural equation models (SEMs) are widely used in sciences, ranging from economics to psychology, to uncover causal relationships underlying a complex system under consideration and estimate structural parameters of interest. We study estimation in a class of generalized SEMs where the object of interest is defined as the solution to a linear operator equation. We formulate the linear operator equation as a min-max game, where both players are parameterized by neural networks (NNs), and learn the parameters of these neural networks using the stochastic gradient descent. We consider both 2-layer and multi-layer NNs with ReLU activation functions and prove global convergence in an overparametrized regime, where the number of neurons is diverging. The results are established using techniques from online learning and local linearization of NNs, and improve in several aspects the current state-of-the-art. For the first time we provide a tractable estimation procedure for SEMs based on NNs with provable convergence and without the need for sample splitting.

* Submitted to NeurIPS 2020. Under review

Via

Access Paper or Ask Questions