Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dimitris S. Papailiopoulos

Sparse PCA through Low-rank Approximations

May 08, 2014

Dimitris S. Papailiopoulos, Alexandros G. Dimakis, Stavros Korokythakis

Figure 1 for Sparse PCA through Low-rank Approximations

Figure 2 for Sparse PCA through Low-rank Approximations

Figure 3 for Sparse PCA through Low-rank Approximations

Figure 4 for Sparse PCA through Low-rank Approximations

Abstract:We introduce a novel algorithm that computes the $k$-sparse principal component of a positive semidefinite matrix $A$. Our algorithm is combinatorial and operates by examining a discrete set of special vectors lying in a low-dimensional eigen-subspace of $A$. We obtain provable approximation guarantees that depend on the spectral decay profile of the matrix: the faster the eigenvalue decay, the better the quality of our approximation. For example, if the eigenvalues of $A$ follow a power-law decay, we obtain a polynomial-time approximation algorithm for any desired accuracy. A key algorithmic component of our scheme is a combinatorial feature elimination step that is provably safe and in practice significantly reduces the running complexity of our algorithm. We implement our algorithm and test it on multiple artificial and real data sets. Due to the feature elimination step, it is possible to perform sparse PCA on data sets consisting of millions of entries in a few minutes. Our experimental evaluation shows that our scheme is nearly optimal while finding very sparse vectors. We compare to the prior state of the art and show that our scheme matches or outperforms previous algorithms in all tested data sets.

* Long version of the ICML 2013 paper: http://jmlr.org/proceedings/papers/v28/papailiopoulos13.html

Via

Access Paper or Ask Questions

The Sparse Principal Component of a Constant-rank Matrix

Dec 20, 2013

Megasthenis Asteris, Dimitris S. Papailiopoulos, George N. Karystinos

Figure 1 for The Sparse Principal Component of a Constant-rank Matrix

Figure 2 for The Sparse Principal Component of a Constant-rank Matrix

Figure 3 for The Sparse Principal Component of a Constant-rank Matrix

Figure 4 for The Sparse Principal Component of a Constant-rank Matrix

Abstract:The computation of the sparse principal component of a matrix is equivalent to the identification of its principal submatrix with the largest maximum eigenvalue. Finding this optimal submatrix is what renders the problem ${\mathcal{NP}}$-hard. In this work, we prove that, if the matrix is positive semidefinite and its rank is constant, then its sparse principal component is polynomially computable. Our proof utilizes the auxiliary unit vector technique that has been recently developed to identify problems that are polynomially solvable. Moreover, we use this technique to design an algorithm which, for any sparsity value, computes the sparse principal component with complexity ${\mathcal O}\left(N^{D+1}\right)$, where $N$ and $D$ are the matrix size and rank, respectively. Our algorithm is fully parallelizable and memory efficient.

Via

Access Paper or Ask Questions

Sparse Principal Component of a Rank-deficient Matrix

Jun 08, 2011

Megasthenis Asteris, Dimitris S. Papailiopoulos, George N. Karystinos

Figure 1 for Sparse Principal Component of a Rank-deficient Matrix

Abstract:We consider the problem of identifying the sparse principal component of a rank-deficient matrix. We introduce auxiliary spherical variables and prove that there exists a set of candidate index-sets (that is, sets of indices to the nonzero elements of the vector argument) whose size is polynomially bounded, in terms of rank, and contains the optimal index-set, i.e. the index-set of the nonzero elements of the optimal solution. Finally, we develop an algorithm that computes the optimal sparse principal component in polynomial time for any sparsity degree.

* 5 pages, 1 figure, to be presented at ISIT

Via

Access Paper or Ask Questions