Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nicholas D. Sidiropoulos

Revisiting Deep Generalized Canonical Correlation Analysis

Dec 20, 2023

Paris A. Karakasis, Nicholas D. Sidiropoulos

Figure 1 for Revisiting Deep Generalized Canonical Correlation Analysis

Figure 2 for Revisiting Deep Generalized Canonical Correlation Analysis

Figure 3 for Revisiting Deep Generalized Canonical Correlation Analysis

Figure 4 for Revisiting Deep Generalized Canonical Correlation Analysis

Abstract:Canonical correlation analysis (CCA) is a classic statistical method for discovering latent co-variation that underpins two or more observed random vectors. Several extensions and variations of CCA have been proposed that have strengthened our capabilities in terms of revealing common random factors from multiview datasets. In this work, we first revisit the most recent deterministic extensions of deep CCA and highlight the strengths and limitations of these state-of-the-art methods. Some methods allow trivial solutions, while others can miss weak common factors. Others overload the problem by also seeking to reveal what is not common among the views -- i.e., the private components that are needed to fully reconstruct each view. The latter tends to overload the problem and its computational and sample complexities. Aiming to improve upon these limitations, we design a novel and efficient formulation that alleviates some of the current restrictions. The main idea is to model the private components as conditionally independent given the common ones, which enables the proposed compact formulation. In addition, we also provide a sufficient condition for identifying the common random factors. Judicious experiments with synthetic and real datasets showcase the validity of our claims and the effectiveness of the proposed approach.

* in IEEE Transactions on Signal Processing, vol. 71, pp. 4392-4406, 2023

Via

Access Paper or Ask Questions

On High-dimensional and Low-rank Tensor Bandits

May 06, 2023

Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos

Abstract:Most existing studies on linear bandits focus on the one-dimensional characterization of the overall system. While being representative, this formulation may fail to model applications with high-dimensional but favorable structures, such as the low-rank tensor representation for recommender systems. To address this limitation, this work studies a general tensor bandits model, where actions and system parameters are represented by tensors as opposed to vectors, and we particularly focus on the case that the unknown system tensor is low-rank. A novel bandit algorithm, coined TOFU (Tensor Optimism in the Face of Uncertainty), is developed. TOFU first leverages flexible tensor regression techniques to estimate low-dimensional subspaces associated with the system tensor. These estimates are then utilized to convert the original problem to a new one with norm constraints on its system parameters. Lastly, a norm-constrained bandit subroutine is adopted by TOFU, which utilizes these constraints to avoid exploring the entire high-dimensional parameter space. Theoretical analyses show that TOFU improves the best-known regret upper bound by a multiplicative factor that grows exponentially in the system order. A novel performance lower bound is also established, which further corroborates the efficiency of TOFU.

* Accepted to the 2023 IEEE International Symposium on Information Theory (ISIT 2023)

Via

Access Paper or Ask Questions

Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis

Oct 16, 2022

Paris A. Karakasis, Athanasios P. Liavas, Nicholas D. Sidiropoulos, Panagiotis G. Simos, Efrosini Papadaki

Figure 1 for Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis

Figure 2 for Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis

Figure 3 for Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis

Figure 4 for Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis

Abstract:Functional magnetic resonance imaging (fMRI) is one of the most popular methods for studying the human brain. Task-related fMRI data processing aims to determine which brain areas are activated when a specific task is performed and is usually based on the Blood Oxygen Level Dependent (BOLD) signal. The background BOLD signal also reflects systematic fluctuations in regional brain activity which are attributed to the existence of resting-state brain networks. We propose a new fMRI data generating model which takes into consideration the existence of common task-related and resting-state components. We first estimate the common task-related temporal component, via two successive stages of generalized canonical correlation analysis and, then, we estimate the common task-related spatial component, leading to a task-related activation map. The experimental tests of our method with synthetic data reveal that we are able to obtain very accurate temporal and spatial estimates even at very low Signal to Noise Ratio (SNR), which is usually the case in fMRI data processing. The tests with real-world fMRI data show significant advantages over standard procedures based on General Linear Models (GLMs).

* IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 31, 2022, pg 4011-4022

Via

Access Paper or Ask Questions

Finding the smallest or largest element of a tensor from its low-rank factors

Oct 16, 2022

Nicholas D. Sidiropoulos, Paris Karakasis, Aritra Konar

Figure 1 for Finding the smallest or largest element of a tensor from its low-rank factors

Figure 2 for Finding the smallest or largest element of a tensor from its low-rank factors

Figure 3 for Finding the smallest or largest element of a tensor from its low-rank factors

Abstract:We consider the problem of finding the smallest or largest entry of a tensor of order $N$ that is specified via its rank decomposition. Stated in a different way, we are given $N$ sets of $R$-dimensional vectors and we wish to select one vector from each set such that the sum of the Hadamard product of the selected vectors is minimized or maximized. This is a fundamental tensor problem with numerous applications in embedding similarity search, recommender systems, graph mining, multivariate probability, and statistics. We show that this discrete optimization problem is NP-hard for any tensor rank higher than one, but also provide an equivalent continuous problem reformulation which is amenable to disciplined non-convex optimization. We propose a suite of gradient-based approximation algorithms whose performance in preliminary experiments appears to be promising.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

Learning Multivariate CDFs and Copulas using Tensor Factorization

Oct 13, 2022

Magda Amiridi, Nicholas D. Sidiropoulos

Figure 1 for Learning Multivariate CDFs and Copulas using Tensor Factorization

Figure 2 for Learning Multivariate CDFs and Copulas using Tensor Factorization

Figure 3 for Learning Multivariate CDFs and Copulas using Tensor Factorization

Figure 4 for Learning Multivariate CDFs and Copulas using Tensor Factorization

Abstract:Learning the multivariate distribution of data is a core challenge in statistics and machine learning. Traditional methods aim for the probability density function (PDF) and are limited by the curse of dimensionality. Modern neural methods are mostly based on black-box models, lacking identifiability guarantees. In this work, we aim to learn multivariate cumulative distribution functions (CDFs), as they can handle mixed random variables, allow efficient box probability evaluation, and have the potential to overcome local sample scarcity owing to their cumulative nature. We show that any grid sampled version of a joint CDF of mixed random variables admits a universal representation as a naive Bayes model via the Canonical Polyadic (tensor-rank) decomposition. By introducing a low-rank model, either directly in the raw data domain, or indirectly in a transformed (Copula) domain, the resulting model affords efficient sampling, closed form inference and uncertainty quantification, and comes with uniqueness guarantees under relatively mild conditions. We demonstrate the superior performance of the proposed model in several synthetic and real datasets and applications including regression, sampling and data imputation. Interestingly, our experiments with real data show that it is possible to obtain better density/mass estimates indirectly via a low-rank CDF model, than a low-rank PDF/PMF model.

Via

Access Paper or Ask Questions

Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation

Jun 20, 2021

Magda Amiridi, Nikos Kargas, Nicholas D. Sidiropoulos

Figure 1 for Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation

Figure 2 for Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation

Figure 3 for Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation

Figure 4 for Low-rank Characteristic Tensor Density Estimation Part II: Compression and Latent Density Estimation

Abstract:Learning generative probabilistic models is a core problem in machine learning, which presents significant challenges due to the curse of dimensionality. This paper proposes a joint dimensionality reduction and non-parametric density estimation framework, using a novel estimator that can explicitly capture the underlying distribution of appropriate reduced-dimension representations of the input data. The idea is to jointly design a nonlinear dimensionality reducing auto-encoder to model the training data in terms of a parsimonious set of latent random variables, and learn a canonical low-rank tensor model of the joint distribution of the latent variables in the Fourier domain. The proposed latent density model is non-parametric and universal, as opposed to the predefined prior that is assumed in variational auto-encoders. Joint optimization of the auto-encoder and the latent density estimator is pursued via a formulation which learns both by minimizing a combination of the negative log-likelihood in the latent domain and the auto-encoder reconstruction loss. We demonstrate that the proposed model achieves very promising results on toy, tabular, and image datasets on regression tasks, sampling, and anomaly detection.

Via

Access Paper or Ask Questions

Probabilistic Simplex Component Analysis

Mar 18, 2021

Ruiyuan Wu, Wing-Kin Ma, Yuening Li, Anthony Man-Cho So, Nicholas D. Sidiropoulos

Figure 1 for Probabilistic Simplex Component Analysis

Figure 2 for Probabilistic Simplex Component Analysis

Figure 3 for Probabilistic Simplex Component Analysis

Figure 4 for Probabilistic Simplex Component Analysis

Abstract:This study presents PRISM, a probabilistic simplex component analysis approach to identifying the vertices of a data-circumscribing simplex from data. The problem has a rich variety of applications, the most notable being hyperspectral unmixing in remote sensing and non-negative matrix factorization in machine learning. PRISM uses a simple probabilistic model, namely, uniform simplex data distribution and additive Gaussian noise, and it carries out inference by maximum likelihood. The inference model is sound in the sense that the vertices are provably identifiable under some assumptions, and it suggests that PRISM can be effective in combating noise when the number of data points is large. PRISM has strong, but hidden, relationships with simplex volume minimization, a powerful geometric approach for the same problem. We study these fundamental aspects, and we also consider algorithmic schemes based on importance sampling and variational inference. In particular, the variational inference scheme is shown to resemble a matrix factorization problem with a special regularizer, which draws an interesting connection to the matrix factorization approach. Numerical results are provided to demonstrate the potential of PRISM.

Via

Access Paper or Ask Questions

eTREE: Learning Tree-structured Embeddings

Dec 20, 2020

Faisal M. Almutairi, Yunlong Wang, Dong Wang, Emily Zhao, Nicholas D. Sidiropoulos

Figure 1 for eTREE: Learning Tree-structured Embeddings

Figure 2 for eTREE: Learning Tree-structured Embeddings

Figure 3 for eTREE: Learning Tree-structured Embeddings

Abstract:Matrix factorization (MF) plays an important role in a wide range of machine learning and data mining models. MF is commonly used to obtain item embeddings and feature representations due to its ability to capture correlations and higher-order statistical dependencies across dimensions. In many applications, the categories of items exhibit a hierarchical tree structure. For instance, human diseases can be divided into coarse categories, e.g., bacterial, and viral. These categories can be further divided into finer categories, e.g., viral infections can be respiratory, gastrointestinal, and exanthematous viral diseases. In e-commerce, products, movies, books, etc., are grouped into hierarchical categories, e.g., clothing items are divided by gender, then by type (formal, casual, etc.). While the tree structure and the categories of the different items may be known in some applications, they have to be learned together with the embeddings in many others. In this work, we propose eTREE, a model that incorporates the (usually ignored) tree structure to enhance the quality of the embeddings. We leverage the special uniqueness properties of Nonnegative MF (NMF) to prove identifiability of eTREE. The proposed model not only exploits the tree structure prior, but also learns the hierarchical clustering in an unsupervised data-driven fashion. We derive an efficient algorithmic solution and a scalable implementation of eTREE that exploits parallel computing, computation caching, and warm start strategies. We showcase the effectiveness of eTREE on real data from various application domains: healthcare, recommender systems, and education. We also demonstrate the meaningfulness of the tree obtained from eTREE by means of domain experts interpretation.

Via

Access Paper or Ask Questions

STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

Dec 08, 2020

Nikos Kargas, Cheng Qian, Nicholas D. Sidiropoulos, Cao Xiao, Lucas M. Glass, Jimeng Sun

Figure 1 for STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

Figure 2 for STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

Figure 3 for STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

Figure 4 for STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization

Abstract:Accurate prediction of the transmission of epidemic diseases such as COVID-19 is crucial for implementing effective mitigation measures. In this work, we develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously. We construct a 3-way spatio-temporal tensor (location, attribute, time) of case counts and propose a nonnegative tensor factorization with latent epidemiological model regularization named STELAR. Unlike standard tensor factorization methods which cannot predict slabs ahead, STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations of a widely adopted epidemiological model. We use latent instead of location/attribute-level epidemiological dynamics to capture common epidemic profile sub-types and improve collaborative learning and prediction. We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic. Finally, we evaluate the predictive ability of our method and show superior performance compared to the baselines, achieving up to 21% lower root mean square error and 25% lower mean absolute error for county-level prediction.

Via

Access Paper or Ask Questions

GAGE: Geometry Preserving Attributed Graph Embeddings

Nov 03, 2020

Charilaos I. Kanatsoulis, Nicholas D. Sidiropoulos

Figure 1 for GAGE: Geometry Preserving Attributed Graph Embeddings

Figure 2 for GAGE: Geometry Preserving Attributed Graph Embeddings

Figure 3 for GAGE: Geometry Preserving Attributed Graph Embeddings

Figure 4 for GAGE: Geometry Preserving Attributed Graph Embeddings

Abstract:Node representation learning is the task of extracting concise and informative feature embeddings of certain entities that are connected in a network. Many real world network datasets include information about both node connectivity and certain node attributes, in the form of features or time-series data. Modern representation learning techniques utilize both connectivity and attribute information of the nodes to produce embeddings in an unsupervised manner. In this context, deriving embeddings that preserve the geometry of the network and the attribute vectors would be highly desirable, as they would reflect both the topological neighborhood structure and proximity in feature space. While this is fairly straightforward to maintain when only observing the connectivity or attributed information of the network, preserving the geometry of both types of information is challenging. A novel tensor factorization approach for node embedding in attributed networks that preserves the distances of both the connections and the attributes is proposed in this paper, along with an effective and lightweight algorithm to tackle the learning task. Judicious experiments with multiple state-of-art baselines suggest that the proposed algorithm offers significant performance improvements in node classification and link prediction tasks.

Via

Access Paper or Ask Questions