Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huseyin Uzunalioglu

Time Series Language Model for Descriptive Caption Generation

Jan 03, 2025

Mohamed Trabelsi, Aidan Boyd, Jin Cao, Huseyin Uzunalioglu

Figure 1 for Time Series Language Model for Descriptive Caption Generation

Figure 2 for Time Series Language Model for Descriptive Caption Generation

Figure 3 for Time Series Language Model for Descriptive Caption Generation

Figure 4 for Time Series Language Model for Descriptive Caption Generation

Abstract:The automatic generation of representative natural language descriptions for observable patterns in time series data enhances interpretability, simplifies analysis and increases cross-domain utility of temporal data. While pre-trained foundation models have made considerable progress in natural language processing (NLP) and computer vision (CV), their application to time series analysis has been hindered by data scarcity. Although several large language model (LLM)-based methods have been proposed for time series forecasting, time series captioning is under-explored in the context of LLMs. In this paper, we introduce TSLM, a novel time series language model designed specifically for time series captioning. TSLM operates as an encoder-decoder model, leveraging both text prompts and time series data representations to capture subtle temporal patterns across multiple phases and generate precise textual descriptions of time series inputs. TSLM addresses the data scarcity problem in time series captioning by first leveraging an in-context prompting synthetic data generation, and second denoising the generated data via a novel cross-modal dense retrieval scoring applied to time series-caption pairs. Experimental findings on various time series captioning datasets demonstrate that TSLM outperforms existing state-of-the-art approaches from multiple data modalities by a significant margin.

Via

Access Paper or Ask Questions

Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency

Oct 21, 2024

Aidan Boyd, Mohamed Trabelsi, Huseyin Uzunalioglu, Dan Kushnir

Figure 1 for Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency

Figure 2 for Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency

Figure 3 for Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency

Figure 4 for Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency

Abstract:Understanding specifically where a model focuses on within an image is critical for human interpretability of the decision-making process. Deep learning-based solutions are prone to learning coincidental correlations in training datasets, causing over-fitting and reducing the explainability. Recent advances have shown that guiding models to human-defined regions of saliency within individual images significantly increases performance and interpretability. Human-guided models also exhibit greater generalization capabilities, as coincidental dataset features are avoided. Results show that models trained with saliency incorporation display an increase in interpretability of up to 30% over models trained without saliency information. The collection of this saliency information, however, can be costly, laborious and in some cases infeasible. To address this limitation, we propose a combination strategy of saliency incorporation and active learning to reduce the human annotation data required by 80% while maintaining the interpretability and performance increase from human saliency. Extensive experimentation outlines the effectiveness of the proposed approach across five public datasets and six active learning criteria.

Via

Access Paper or Ask Questions

Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Jun 07, 2023

Mohamed Trabelsi, Huseyin Uzunalioglu

Figure 1 for Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Figure 2 for Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Figure 3 for Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Figure 4 for Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Abstract:Multi-document summarization (MDS) refers to the task of summarizing the text in multiple documents into a concise summary. The generated summary can save the time of reading many documents by providing the important content in the form of a few sentences. Abstractive MDS aims to generate a coherent and fluent summary for multiple documents using natural language generation techniques. In this paper, we consider the unsupervised abstractive MDS setting where there are only documents with no groundtruh summaries provided, and we propose Absformer, a new Transformer-based method for unsupervised abstractive summary generation. Our method consists of a first step where we pretrain a Transformer-based encoder using the masked language modeling (MLM) objective as the pretraining task in order to cluster the documents into semantically similar groups; and a second step where we train a Transformer-based decoder to generate abstractive summaries for the clusters of documents. To our knowledge, we are the first to successfully incorporate a Transformer-based model to solve the unsupervised abstractive MDS task. We evaluate our approach using three real-world datasets from different domains, and we demonstrate both substantial improvements in terms of evaluation metrics over state-of-the-art abstractive-based methods, and generalization to datasets from different domains.

* ICDAR 2023 International Workshop on Machine Learning (WML)

Via

Access Paper or Ask Questions

Augmented Data Science: Towards Industrialization and Democratization of Data Science

Sep 12, 2019

Huseyin Uzunalioglu, Jin Cao, Chitra Phadke, Gerald Lehmann, Ahmet Akyamac, Ran He, Jeongran Lee, Maria Able

Figure 1 for Augmented Data Science: Towards Industrialization and Democratization of Data Science

Figure 2 for Augmented Data Science: Towards Industrialization and Democratization of Data Science

Figure 3 for Augmented Data Science: Towards Industrialization and Democratization of Data Science

Figure 4 for Augmented Data Science: Towards Industrialization and Democratization of Data Science

Abstract:Conversion of raw data into insights and knowledge requires substantial amounts of effort from data scientists. Despite breathtaking advances in Machine Learning (ML) and Artificial Intelligence (AI), data scientists still spend the majority of their effort in understanding and then preparing the raw data for ML/AI. The effort is often manual and ad hoc, and requires some level of domain knowledge. The complexity of the effort increases dramatically when data diversity, both in form and context, increases. In this paper, we introduce our solution, Augmented Data Science (ADS), towards addressing this "human bottleneck" in creating value from diverse datasets. ADS is a data-driven approach and relies on statistics and ML to extract insights from any data set in a domain-agnostic way to facilitate the data science process. Key features of ADS are the replacement of rudimentary data exploration and processing steps with automation and the augmentation of data scientist judgment with automatically-generated insights. We present building blocks of our end-to-end solution and provide a case study to exemplify its capabilities.

Via

Access Paper or Ask Questions