Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xuejun Liao

Duke University

A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

Sep 18, 2017

Qinliang Su, Xuejun Liao, Lawrence Carin

Figure 1 for A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

Figure 2 for A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

Figure 3 for A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

Figure 4 for A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

Abstract:We present a probabilistic framework for nonlinearities, based on doubly truncated Gaussian distributions. By setting the truncation points appropriately, we are able to generate various types of nonlinearities within a unified framework, including sigmoid, tanh and ReLU, the most commonly used nonlinearities in neural networks. The framework readily integrates into existing stochastic neural networks (with hidden units characterized as random variables), allowing one for the first time to learn the nonlinearities alongside model weights in these networks. Extensive experiments demonstrate the performance improvements brought about by the proposed framework when integrated with the restricted Boltzmann machine (RBM), temporal RBM and the truncated Gaussian graphical model (TGGM).

Via

Access Paper or Ask Questions

Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Nov 20, 2016

Qinliang Su, Xuejun Liao, Changyou Chen, Lawrence Carin

Figure 1 for Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Figure 2 for Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Figure 3 for Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Figure 4 for Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Abstract:We introduce the truncated Gaussian graphical model (TGGM) as a novel framework for designing statistical models for nonlinear learning. A TGGM is a Gaussian graphical model (GGM) with a subset of variables truncated to be nonnegative. The truncated variables are assumed latent and integrated out to induce a marginal model. We show that the variables in the marginal model are non-Gaussian distributed and their expected relations are nonlinear. We use expectation-maximization to break the inference of the nonlinear model into a sequence of TGGM inference problems, each of which is efficiently solved by using the properties and numerical methods of multivariate Gaussian distributions. We use the TGGM to design models for nonlinear regression and classification, with the performances of these models demonstrated on extensive benchmark datasets and compared to state-of-the-art competing results.

* Appeared in ICML 2016

Via

Access Paper or Ask Questions

Unsupervised Learning with Truncated Gaussian Graphical Models

Nov 20, 2016

Qinliang Su, Xuejun Liao, Chunyuan Li, Zhe Gan, Lawrence Carin

Figure 1 for Unsupervised Learning with Truncated Gaussian Graphical Models

Figure 2 for Unsupervised Learning with Truncated Gaussian Graphical Models

Figure 3 for Unsupervised Learning with Truncated Gaussian Graphical Models

Figure 4 for Unsupervised Learning with Truncated Gaussian Graphical Models

Abstract:Gaussian graphical models (GGMs) are widely used for statistical modeling, because of ease of inference and the ubiquitous use of the normal distribution in practical approximations. However, they are also known for their limited modeling abilities, due to the Gaussian assumption. In this paper, we introduce a novel variant of GGMs, which relaxes the Gaussian restriction and yet admits efficient inference. Specifically, we impose a bipartite structure on the GGM and govern the hidden variables by truncated normal distributions. The nonlinearity of the model is revealed by its connection to rectified linear unit (ReLU) neural networks. Meanwhile, thanks to the bipartite structure and appealing properties of truncated normals, we are able to train the models efficiently using contrastive divergence. We consider three output constructs, accounting for real-valued, binary and count data. We further extend the model to deep constructions and show that deep models can be used for unsupervised pre-training of rectifier neural networks. Extensive experimental results are provided to validate the proposed models and demonstrate their superiority over competing models.

* To appear in AAAI 2017

Via

Access Paper or Ask Questions

Variational Gaussian Copula Inference

May 18, 2016

Shaobo Han, Xuejun Liao, David B. Dunson, Lawrence Carin

Figure 1 for Variational Gaussian Copula Inference

Figure 2 for Variational Gaussian Copula Inference

Abstract:We utilize copulas to constitute a unified framework for constructing and optimizing variational proposals in hierarchical Bayesian models. For models with continuous and non-Gaussian hidden variables, we propose a semiparametric and automated variational Gaussian copula approach, in which the parametric Gaussian copula family is able to preserve multivariate posterior dependence, and the nonparametric transformations based on Bernstein polynomials provide ample flexibility in characterizing the univariate marginal posteriors.

* Appearing in Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS) 2016, Cadiz, Spain. JMLR: W&CP volume 51

Via

Access Paper or Ask Questions

Stick-Breaking Policy Learning in Dec-POMDPs

Nov 23, 2015

Miao Liu, Christopher Amato, Xuejun Liao, Lawrence Carin, Jonathan P. How

Figure 1 for Stick-Breaking Policy Learning in Dec-POMDPs

Figure 2 for Stick-Breaking Policy Learning in Dec-POMDPs

Figure 3 for Stick-Breaking Policy Learning in Dec-POMDPs

Figure 4 for Stick-Breaking Policy Learning in Dec-POMDPs

Abstract:Expectation maximization (EM) has recently been shown to be an efficient algorithm for learning finite-state controllers (FSCs) in large decentralized POMDPs (Dec-POMDPs). However, current methods use fixed-size FSCs and often converge to maxima that are far from optimal. This paper considers a variable-size FSC to represent the local policy of each agent. These variable-size FSCs are constructed using a stick-breaking prior, leading to a new framework called \emph{decentralized stick-breaking policy representation} (Dec-SBPR). This approach learns the controller parameters with a variational Bayesian algorithm without having to assume that the Dec-POMDP model is available. The performance of Dec-SBPR is demonstrated on several benchmark problems, showing that the algorithm scales to large problems while outperforming other state-of-the-art methods.

Via

Access Paper or Ask Questions

Low-Cost Compressive Sensing for Color Video and Depth

Feb 27, 2014

Xin Yuan, Patrick Llull, Xuejun Liao, Jianbo Yang, Guillermo Sapiro, David J. Brady, Lawrence Carin

Figure 1 for Low-Cost Compressive Sensing for Color Video and Depth

Figure 2 for Low-Cost Compressive Sensing for Color Video and Depth

Figure 3 for Low-Cost Compressive Sensing for Color Video and Depth

Figure 4 for Low-Cost Compressive Sensing for Color Video and Depth

Abstract:A simple and inexpensive (low-power and low-bandwidth) modification is made to a conventional off-the-shelf color video camera, from which we recover {multiple} color frames for each of the original measured frames, and each of the recovered frames can be focused at a different depth. The recovery of multiple frames for each measured frame is made possible via high-speed coding, manifested via translation of a single coded aperture; the inexpensive translation is constituted by mounting the binary code on a piezoelectric device. To simultaneously recover depth information, a {liquid} lens is modulated at high speed, via a variable voltage. Consequently, during the aforementioned coding process, the liquid lens allows the camera to sweep the focus through multiple depths. In addition to designing and implementing the camera, fast recovery is achieved by an anytime algorithm exploiting the group-sparsity of wavelet/DCT coefficients.

* 8 pages, CVPR 2014

Via

Access Paper or Ask Questions

Adaptive Temporal Compressive Sensing for Video

Oct 15, 2013

Xin Yuan, Jianbo Yang, Patrick Llull, Xuejun Liao, Guillermo Sapiro, David J. Brady, Lawrence Carin

Figure 1 for Adaptive Temporal Compressive Sensing for Video

Figure 2 for Adaptive Temporal Compressive Sensing for Video

Figure 3 for Adaptive Temporal Compressive Sensing for Video

Figure 4 for Adaptive Temporal Compressive Sensing for Video

Abstract:This paper introduces the concept of adaptive temporal compressive sensing (CS) for video. We propose a CS algorithm to adapt the compression ratio based on the scene's temporal complexity, computed from the compressed data, without compromising the quality of the reconstructed video. The temporal adaptivity is manifested by manipulating the integration time of the camera, opening the possibility to real-time implementation. The proposed algorithm is a generalized temporal CS approach that can be incorporated with a diverse set of existing hardware systems.

* IEEE Interonal International Conference on Image Processing (ICIP),2013

Via

Access Paper or Ask Questions

Coded aperture compressive temporal imaging

Feb 04, 2013

Patrick Llull, Xuejun Liao, Xin Yuan, Jianbo Yang, David Kittle, Lawrence Carin, Guillermo Sapiro, David J. Brady

Figure 1 for Coded aperture compressive temporal imaging

Figure 2 for Coded aperture compressive temporal imaging

Figure 3 for Coded aperture compressive temporal imaging

Figure 4 for Coded aperture compressive temporal imaging

Abstract:We use mechanical translation of a coded aperture for code division multiple access compression of video. We present experimental results for reconstruction at 148 frames per coded snapshot.

* 19 pages (when compiled with Optics Express' TEX template), 15 figures

Via

Access Paper or Ask Questions

Cross-Domain Multitask Learning with Latent Probit Models

Jun 27, 2012

Shaobo Han, Xuejun Liao, Lawrence Carin

Figure 1 for Cross-Domain Multitask Learning with Latent Probit Models

Figure 2 for Cross-Domain Multitask Learning with Latent Probit Models

Figure 3 for Cross-Domain Multitask Learning with Latent Probit Models

Figure 4 for Cross-Domain Multitask Learning with Latent Probit Models

Abstract:Learning multiple tasks across heterogeneous domains is a challenging problem since the feature space may not be the same for different tasks. We assume the data in multiple tasks are generated from a latent common domain via sparse domain transforms and propose a latent probit model (LPM) to jointly learn the domain transforms, and the shared probit classifier in the common domain. To learn meaningful task relatedness and avoid over-fitting in classification, we introduce sparsity in the domain transforms matrices, as well as in the common classifier. We derive theoretical bounds for the estimation error of the classifier in terms of the sparsity of domain transforms. An expectation-maximization algorithm is derived for learning the LPM. The effectiveness of the approach is demonstrated on several real datasets.

* Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

Via

Access Paper or Ask Questions