Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guy Lebanon

Georgia Institute of Technology

Local Space-Time Smoothing for Version Controlled Documents

Aug 08, 2013

Seungyeon Kim, Guy Lebanon

Figure 1 for Local Space-Time Smoothing for Version Controlled Documents

Figure 2 for Local Space-Time Smoothing for Version Controlled Documents

Figure 3 for Local Space-Time Smoothing for Version Controlled Documents

Figure 4 for Local Space-Time Smoothing for Version Controlled Documents

Abstract:Unlike static documents, version controlled documents are continuously edited by one or more authors. Such collaborative revision process makes traditional modeling and visualization techniques inappropriate. In this paper we propose a new representation based on local space-time smoothing that captures important revision patterns. We demonstrate the applicability of our framework using experiments on synthetic and real-world data.

* Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010); 2010 Aug 23-27; Beijing, CN
* 9 pages, 6 figures

Via

Access Paper or Ask Questions

Beyond Sentiment: The Manifold of Human Emotions

Aug 08, 2013

Seungyeon Kim, Fuxin Li, Guy Lebanon, Irfan Essa

Figure 1 for Beyond Sentiment: The Manifold of Human Emotions

Figure 2 for Beyond Sentiment: The Manifold of Human Emotions

Figure 3 for Beyond Sentiment: The Manifold of Human Emotions

Figure 4 for Beyond Sentiment: The Manifold of Human Emotions

Abstract:Sentiment analysis predicts the presence of positive or negative emotions in a text document. In this paper we consider higher dimensional extensions of the sentiment concept, which represent a richer set of human emotions. Our approach goes beyond previous work in that our model contains a continuous manifold rather than a finite set of human emotions. We investigate the resulting model, compare it to psychological observations, and explore its predictive capabilities. Besides obtaining significant improvements over a baseline without manifold, we are also able to visualize different notions of positive sentiment in different domains.

* Proceedings of the 16 International Conference on Artificial Intelligence and Statistics (AISTATS) 2013, Scottsdale, AZ, USA. Volume 31 of JMLR: W&CP 31
* 15 pages, 7 figures

Via

Access Paper or Ask Questions

A Linear Approximation to the chi^2 Kernel with Geometric Convergence

Jun 12, 2013

Fuxin Li, Guy Lebanon, Cristian Sminchisescu

Figure 1 for A Linear Approximation to the chi^2 Kernel with Geometric Convergence

Figure 2 for A Linear Approximation to the chi^2 Kernel with Geometric Convergence

Figure 3 for A Linear Approximation to the chi^2 Kernel with Geometric Convergence

Figure 4 for A Linear Approximation to the chi^2 Kernel with Geometric Convergence

Abstract:We propose a new analytical approximation to the $\chi^2$ kernel that converges geometrically. The analytical approximation is derived with elementary methods and adapts to the input distribution for optimal convergence rate. Experiments show the new approximation leads to improved performance in image classification and semantic segmentation tasks using a random Fourier feature approximation of the $\exp-\chi^2$ kernel. Besides, out-of-core principal component analysis (PCA) methods are introduced to reduce the dimensionality of the approximation and achieve better performance at the expense of only an additional constant factor to the time complexity. Moreover, when PCA is performed jointly on the training and unlabeled testing data, further performance improvements can be obtained. Experiments conducted on the PASCAL VOC 2010 segmentation and the ImageNet ILSVRC 2010 datasets show statistically significant improvements over alternative approximation methods.

Via

Access Paper or Ask Questions

The Manifold of Human Emotions

Jan 15, 2013

Seungyeon Kim, Fuxin Li, Guy Lebanon, Irfan Essa

Figure 1 for The Manifold of Human Emotions

Figure 2 for The Manifold of Human Emotions

Abstract:Sentiment analysis predicts the presence of positive or negative emotions in a text document. In this paper, we consider higher dimensional extensions of the sentiment concept, which represent a richer set of human emotions. Our approach goes beyond previous work in that our model contains a continuous manifold rather than a finite set of human emotions. We investigate the resulting model, compare it to psychological observations, and explore its predictive capabilities.

* 3 pages, 2 figures

Via

Access Paper or Ask Questions

Matrix Approximation under Local Low-Rank Assumption

Jan 15, 2013

Joonseok Lee, Seungyeon Kim, Guy Lebanon, Yoram Singer

Figure 1 for Matrix Approximation under Local Low-Rank Assumption

Figure 2 for Matrix Approximation under Local Low-Rank Assumption

Abstract:Matrix approximation is a common tool in machine learning for building accurate prediction models for recommendation systems, text mining, and computer vision. A prevalent assumption in constructing matrix approximations is that the partially observed matrix is of low-rank. We propose a new matrix approximation model where we assume instead that the matrix is only locally of low-rank, leading to a representation of the observed matrix as a weighted sum of low-rank matrices. We analyze the accuracy of the proposed local low-rank modeling. Our experiments show improvements in prediction accuracy in recommendation tasks.

* 3 pages, 2 figures, Workshop submission to the First International Conference on Learning Representations (ICLR)

Via

Access Paper or Ask Questions

Learning Riemannian Metrics

Oct 19, 2012

Guy Lebanon

Figure 1 for Learning Riemannian Metrics

Figure 2 for Learning Riemannian Metrics

Figure 3 for Learning Riemannian Metrics

Figure 4 for Learning Riemannian Metrics

Abstract:We propose a solution to the problem of estimating a Riemannian metric associated with a given differentiable manifold. The metric learning problem is based on minimizing the relative volume of a given set of points. We derive the details for a family of metrics on the multinomial simplex. The resulting metric has applications in text classification and bears some similarity to TFIDF representation of text documents.

* Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

Via

Access Paper or Ask Questions

Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations

Oct 03, 2012

Krishnakumar Balasubramanian, Kai Yu, Guy Lebanon

Figure 1 for Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations

Figure 2 for Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations

Figure 3 for Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations

Figure 4 for Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations

Abstract:We propose and analyze a novel framework for learning sparse representations, based on two statistical techniques: kernel smoothing and marginal regression. The proposed approach provides a flexible framework for incorporating feature similarity or temporal information present in data sets, via non-parametric kernel smoothing. We provide generalization bounds for dictionary learning using smooth sparse coding and show how the sample complexity depends on the L1 norm of kernel function used. Furthermore, we propose using marginal regression for obtaining sparse codes, which significantly improves the speed and allows one to scale to large dictionary sizes easily. We demonstrate the advantages of the proposed approach, both in terms of accuracy and speed by extensive experimentation on several real data sets. In addition, we demonstrate how the proposed approach could be used for improving semi-supervised sparse coding.

Via

Access Paper or Ask Questions

An Extended Cencov-Campbell Characterization of Conditional Information Geometry

Jul 11, 2012

Guy Lebanon

Abstract:We formulate and prove an axiomatic characterization of conditional information geometry, for both the normalized and the nonnormalized cases. This characterization extends the axiomatic derivation of the Fisher geometry by Cencov and Campbell to the cone of positive conditional models, and as a special case to the manifold of conditional distributions. Due to the close connection between the conditional I-divergence and the product Fisher information metric the characterization provides a new axiomatic interpretation of the primal problems underlying logistic regression and AdaBoost.

* Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

Via

Access Paper or Ask Questions

The Landmark Selection Method for Multiple Output Prediction

Jun 27, 2012

Krishnakumar Balasubramanian, Guy Lebanon

Figure 1 for The Landmark Selection Method for Multiple Output Prediction

Figure 2 for The Landmark Selection Method for Multiple Output Prediction

Figure 3 for The Landmark Selection Method for Multiple Output Prediction

Figure 4 for The Landmark Selection Method for Multiple Output Prediction

Abstract:Conditional modeling x \to y is a central problem in machine learning. A substantial research effort is devoted to such modeling when x is high dimensional. We consider, instead, the case of a high dimensional y, where x is either low dimensional or high dimensional. Our approach is based on selecting a small subset y_L of the dimensions of y, and proceed by modeling (i) x \to y_L and (ii) y_L \to y. Composing these two models, we obtain a conditional model x \to y that possesses convenient statistical properties. Multi-label classification and multivariate regression experiments on several datasets show that this model outperforms the one vs. all approach as well as several sophisticated multiple output prediction methods.

* Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

Via

Access Paper or Ask Questions

Sequential Document Representations and Simplicial Curves

Jun 27, 2012

Guy Lebanon

Figure 1 for Sequential Document Representations and Simplicial Curves

Figure 2 for Sequential Document Representations and Simplicial Curves

Figure 3 for Sequential Document Representations and Simplicial Curves

Figure 4 for Sequential Document Representations and Simplicial Curves

Abstract:The popular bag of words assumption represents a document as a histogram of word occurrences. While computationally efficient, such a representation is unable to maintain any sequential information. We present a continuous and differentiable sequential document representation that goes beyond the bag of words assumption, and yet is efficient and effective. This representation employs smooth curves in the multinomial simplex to account for sequential information. We discuss the representation and its geometric properties and demonstrate its applicability for the task of text classification.

* Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

Via

Access Paper or Ask Questions