Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dawon Ahn

Global and Local Structure Learning for Sparse Tensor Completion

Mar 26, 2025

Dawon Ahn, Evangelos E. Papalexakis

Abstract:How can we accurately complete tensors by learning relationships of dimensions along each mode? Tensor completion, a widely studied problem, is to predict missing entries in incomplete tensors. Tensor decomposition methods, fundamental tensor analysis tools, have been actively developed to solve tensor completion tasks. However, standard tensor decomposition models have not been designed to learn relationships of dimensions along each mode, which limits to accurate tensor completion. Also, previously developed tensor decomposition models have required prior knowledge between relations within dimensions to model the relations, expensive to obtain. This paper proposes TGL (Tensor Decomposition Learning Global and Local Structures) to accurately predict missing entries in tensors. TGL reconstructs a tensor with factor matrices which learn local structures with GNN without prior knowledges. Extensive experiments are conducted to evaluate TGL with baselines and datasets.

Via

Access Paper or Ask Questions

Tensor Completion for Surrogate Modeling of Material Property Prediction

Jan 30, 2025

Shaan Pakala, Dawon Ahn, Evangelos Papalexakis

Figure 1 for Tensor Completion for Surrogate Modeling of Material Property Prediction

Figure 2 for Tensor Completion for Surrogate Modeling of Material Property Prediction

Figure 3 for Tensor Completion for Surrogate Modeling of Material Property Prediction

Figure 4 for Tensor Completion for Surrogate Modeling of Material Property Prediction

Abstract:When designing materials to optimize certain properties, there are often many possible configurations of designs that need to be explored. For example, the materials' composition of elements will affect properties such as strength or conductivity, which are necessary to know when developing new materials. Exploring all combinations of elements to find optimal materials becomes very time consuming, especially when there are more design variables. For this reason, there is growing interest in using machine learning (ML) to predict a material's properties. In this work, we model the optimization of certain material properties as a tensor completion problem, to leverage the structure of our datasets and navigate the vast number of combinations of material configurations. Across a variety of material property prediction tasks, our experiments show tensor completion methods achieving 10-20% decreased error compared with baseline ML models such as GradientBoosting and Multilayer Perceptron (MLP), while maintaining similar training speed.

* 2 page paper accepted to AAAI KGML 2025 bridge program

Via

Access Paper or Ask Questions

Automating Data Science Pipelines with Tensor Completion

Oct 08, 2024

Shaan Pakala, Bryce Graw, Dawon Ahn, Tam Dinh, Mehnaz Tabassum Mahin, Vassilis Tsotras, Jia Chen, Evangelos E. Papalexakis

Figure 1 for Automating Data Science Pipelines with Tensor Completion

Figure 2 for Automating Data Science Pipelines with Tensor Completion

Figure 3 for Automating Data Science Pipelines with Tensor Completion

Figure 4 for Automating Data Science Pipelines with Tensor Completion

Abstract:Hyperparameter optimization is an essential component in many data science pipelines and typically entails exhaustive time and resource-consuming computations in order to explore the combinatorial search space. Similar to this problem, other key operations in data science pipelines exhibit the exact same properties. Important examples are: neural architecture search, where the goal is to identify the best design choices for a neural network, and query cardinality estimation, where given different predicate values for a SQL query the goal is to estimate the size of the output. In this paper, we abstract away those essential components of data science pipelines and we model them as instances of tensor completion, where each variable of the search space corresponds to one mode of the tensor, and the goal is to identify all missing entries of the tensor, corresponding to all combinations of variable values, starting from a very small sample of observed entries. In order to do so, we first conduct a thorough experimental evaluation of existing state-of-the-art tensor completion techniques and introduce domain-inspired adaptations (such as smoothness across the discretized variable space) and an ensemble technique which is able to achieve state-of-the-art performance. We extensively evaluate existing and proposed methods in a number of datasets generated corresponding to (a) hyperparameter optimization for non-neural network models, (b) neural architecture search, and (c) variants of query cardinality estimation, demonstrating the effectiveness of tensor completion as a tool for automating data science pipelines. Furthermore, we release our generated datasets and code in order to provide benchmarks for future work on this topic.

Via

Access Paper or Ask Questions

TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Jun 25, 2024

Yiran Luo, Het Patel, Yu Fu, Dawon Ahn, Jia Chen, Yue Dong, Evangelos E. Papalexakis

Figure 1 for TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Figure 2 for TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Figure 3 for TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Figure 4 for TRAWL: Tensor Reduced and Approximated Weights for Large Language Models

Abstract:Large language models (LLMs) have fundamentally transformed artificial intelligence, catalyzing recent advancements while imposing substantial environmental and computational burdens. We introduce TRAWL (Tensor Reduced and Approximated Weights for Large Language Models), a novel methodology for optimizing LLMs through tensor decomposition. TRAWL leverages diverse strategies to exploit matrices within transformer-based architectures, realizing notable performance enhancements without necessitating retraining. The most significant improvements were observed through a layer-by-layer intervention strategy, particularly when applied to fully connected weights of the final layers, yielding up to 16% enhancement in accuracy without the need for additional data or fine-tuning. These results underscore the importance of targeted and adaptive techniques in increasing the efficiency and effectiveness of large language model optimization, thereby promoting the development of more sustainable and accessible AI systems.

* 8 pages, 5 figures. Submitted to EMNLP 2024 and under review

Via

Access Paper or Ask Questions

Time-Aware Tensor Decomposition for Missing Entry Prediction

Dec 16, 2020

Dawon Ahn, Jun-Gi Jang, U Kang

Figure 1 for Time-Aware Tensor Decomposition for Missing Entry Prediction

Figure 2 for Time-Aware Tensor Decomposition for Missing Entry Prediction

Figure 3 for Time-Aware Tensor Decomposition for Missing Entry Prediction

Figure 4 for Time-Aware Tensor Decomposition for Missing Entry Prediction

Abstract:Given a time-evolving tensor with missing entries, how can we effectively factorize it for precisely predicting the missing entries? Tensor factorization has been extensively utilized for analyzing various multi-dimensional real-world data. However, existing models for tensor factorization have disregarded the temporal property for tensor factorization while most real-world data are closely related to time. Moreover, they do not address accuracy degradation due to the sparsity of time slices. The essential problems of how to exploit the temporal property for tensor decomposition and consider the sparsity of time slices remain unresolved. In this paper, we propose TATD (Time-Aware Tensor Decomposition), a novel tensor decomposition method for real-world temporal tensors. TATD is designed to exploit temporal dependency and time-varying sparsity of real-world temporal tensors. We propose a new smoothing regularization with Gaussian kernel for modeling time dependency. Moreover, we improve the performance of TATD by considering time-varying sparsity. We design an alternating optimization scheme suitable for temporal tensor factorization with our smoothing regularization. Extensive experiments show that TATD provides the state-of-the-art accuracy for decomposing temporal tensors.

* 20 pages

Via

Access Paper or Ask Questions