Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pedro Mercado

Saarland University

ChronosX: Adapting Pretrained Time Series Models with Exogenous Variables

Mar 15, 2025

Sebastian Pineda Arango, Pedro Mercado, Shubham Kapoor, Abdul Fatir Ansari, Lorenzo Stella, Huibin Shen, Hugo Senetaire, Caner Turkmen, Oleksandr Shchur, Danielle C. Maddix(+3 more)

Abstract:Covariates provide valuable information on external factors that influence time series and are critical in many real-world time series forecasting tasks. For example, in retail, covariates may indicate promotions or peak dates such as holiday seasons that heavily influence demand forecasts. Recent advances in pretraining large language model architectures for time series forecasting have led to highly accurate forecasters. However, the majority of these models do not readily use covariates as they are often specific to a certain task or domain. This paper introduces a new method to incorporate covariates into pretrained time series forecasting models. Our proposed approach incorporates covariate information into pretrained forecasting models through modular blocks that inject past and future covariate information, without necessarily modifying the pretrained model in consideration. In order to evaluate our approach, we introduce a benchmark composed of 32 different synthetic datasets with varying dynamics to evaluate the effectivity of forecasting models with covariates. Extensive evaluations on both synthetic and real datasets show that our approach effectively incorporates covariate information into pretrained models, outperforming existing baselines.

* Accepted at the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025

Via

Access Paper or Ask Questions

Chronos: Learning the Language of Time Series

Mar 12, 2024

Abdul Fatir Ansari, Lorenzo Stella, Caner Turkmen, Xiyuan Zhang, Pedro Mercado, Huibin Shen, Oleksandr Shchur, Syama Sundar Rangapuram, Sebastian Pineda Arango, Shubham Kapoor(+7 more)

Abstract:We introduce Chronos, a simple yet effective framework for pretrained probabilistic time series models. Chronos tokenizes time series values using scaling and quantization into a fixed vocabulary and trains existing transformer-based language model architectures on these tokenized time series via the cross-entropy loss. We pretrained Chronos models based on the T5 family (ranging from 20M to 710M parameters) on a large collection of publicly available datasets, complemented by a synthetic dataset that we generated via Gaussian processes to improve generalization. In a comprehensive benchmark consisting of 42 datasets, and comprising both classical local models and deep learning methods, we show that Chronos models: (a) significantly outperform other methods on datasets that were part of the training corpus; and (b) have comparable and occasionally superior zero-shot performance on new datasets, relative to methods that were trained specifically on them. Our results demonstrate that Chronos models can leverage time series data from diverse domains to improve zero-shot accuracy on unseen forecasting tasks, positioning pretrained models as a viable tool to greatly simplify forecasting pipelines.

* Inference code and model checkpoints available at https://github.com/amazon-science/chronos-forecasting

Via

Access Paper or Ask Questions

Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

Oct 30, 2019

Pedro Mercado, Francesco Tudisco, Matthias Hein

Figure 1 for Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

Figure 2 for Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

Figure 3 for Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

Figure 4 for Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

Abstract:We study the task of semi-supervised learning on multilayer graphs by taking into account both labeled and unlabeled observations together with the information encoded by each individual graph layer. We propose a regularizer based on the generalized matrix mean, which is a one-parameter family of matrix means that includes the arithmetic, geometric and harmonic means as particular cases. We analyze it in expectation under a Multilayer Stochastic Block Model and verify numerically that it outperforms state of the art methods. Moreover, we introduce a matrix-free numerical scheme based on contour integral quadratures and Krylov subspace solvers that scales to large sparse multilayer graphs.

* Accepted in NeurIPS 2019

Via

Access Paper or Ask Questions

Spectral Clustering of Signed Graphs via Matrix Power Means

May 15, 2019

Pedro Mercado, Francesco Tudisco, Matthias Hein

Figure 1 for Spectral Clustering of Signed Graphs via Matrix Power Means

Figure 2 for Spectral Clustering of Signed Graphs via Matrix Power Means

Figure 3 for Spectral Clustering of Signed Graphs via Matrix Power Means

Figure 4 for Spectral Clustering of Signed Graphs via Matrix Power Means

Abstract:Signed graphs encode positive (attractive) and negative (repulsive) relations between nodes. We extend spectral clustering to signed graphs via the one-parameter family of Signed Power Mean Laplacians, defined as the matrix power mean of normalized standard and signless Laplacians of positive and negative edges. We provide a thorough analysis of the proposed approach in the setting of a general Stochastic Block Model that includes models such as the Labeled Stochastic Block Model and the Censored Block Model. We show that in expectation the signed power mean Laplacian captures the ground truth clusters under reasonable settings where state-of-the-art approaches fail. Moreover, we prove that the eigenvalues and eigenvector of the signed power mean Laplacian concentrate around their expectation under reasonable conditions in the general Stochastic Block Model. Extensive experiments on random graphs and real world datasets confirm the theoretically predicted behaviour of the signed power mean Laplacian and show that it compares favourably with state-of-the-art methods.

* final version accepted at ICML 2019

Via

Access Paper or Ask Questions

Community detection in networks via nonlinear modularity eigenvectors

Sep 12, 2018

Francesco Tudisco, Pedro Mercado, Matthias Hein

Figure 1 for Community detection in networks via nonlinear modularity eigenvectors

Figure 2 for Community detection in networks via nonlinear modularity eigenvectors

Figure 3 for Community detection in networks via nonlinear modularity eigenvectors

Figure 4 for Community detection in networks via nonlinear modularity eigenvectors

Abstract:Revealing a community structure in a network or dataset is a central problem arising in many scientific areas. The modularity function $Q$ is an established measure quantifying the quality of a community, being identified as a set of nodes having high modularity. In our terminology, a set of nodes with positive modularity is called a \textit{module} and a set that maximizes $Q$ is thus called \textit{leading module}. Finding a leading module in a network is an important task, however the dimension of real-world problems makes the maximization of $Q$ unfeasible. This poses the need of approximation techniques which are typically based on a linear relaxation of $Q$, induced by the spectrum of the modularity matrix $M$. In this work we propose a nonlinear relaxation which is instead based on the spectrum of a nonlinear modularity operator $\mathcal M$. We show that extremal eigenvalues of $\mathcal M$ provide an exact relaxation of the modularity measure $Q$, however at the price of being more challenging to be computed than those of $M$. Thus we extend the work made on nonlinear Laplacians, by proposing a computational scheme, named \textit{generalized RatioDCA}, to address such extremal eigenvalues. We show monotonic ascent and convergence of the method. We finally apply the new method to several synthetic and real-world data sets, showing both effectiveness of the model and performance of the method.

* SIAM J. Applied Mathematics, 78:2393--2419, 2018

Via

Access Paper or Ask Questions

Node classification for signed networks using diffuse interface methods

Sep 07, 2018

Jessica Bosch, Pedro Mercado, Martin Stoll

Figure 1 for Node classification for signed networks using diffuse interface methods

Figure 2 for Node classification for signed networks using diffuse interface methods

Figure 3 for Node classification for signed networks using diffuse interface methods

Figure 4 for Node classification for signed networks using diffuse interface methods

Abstract:Signed networks are a crucial tool when modeling friend and foe relationships. In contrast to classical undirected, weighted graphs, the edge weights for signed graphs are positive and negative. Crucial network properties are often derived from the study of the associated graph Laplacians. We here study several different signed network Laplacians with a focus on the task of classifying the nodes of the graph. We here extend a recently introduced technique based on a partial differential equation defined on the signed network, namely the Allen-Cahn-equation, to classify the nodes into two or more classes. We illustrate the performance of this approach on several real-world networks.

Via

Access Paper or Ask Questions

The Power Mean Laplacian for Multilayer Graph Clustering

Mar 01, 2018

Pedro Mercado, Antoine Gautier, Francesco Tudisco, Matthias Hein

Figure 1 for The Power Mean Laplacian for Multilayer Graph Clustering

Figure 2 for The Power Mean Laplacian for Multilayer Graph Clustering

Figure 3 for The Power Mean Laplacian for Multilayer Graph Clustering

Figure 4 for The Power Mean Laplacian for Multilayer Graph Clustering

Abstract:Multilayer graphs encode different kind of interactions between the same set of entities. When one wants to cluster such a multilayer graph, the natural question arises how one should merge the information different layers. We introduce in this paper a one-parameter family of matrix power means for merging the Laplacians from different layers and analyze it in expectation in the stochastic block model. We show that this family allows to recover ground truth clusters under different settings and verify this in real world data. While computing the matrix power mean can be very expensive for large graphs, we introduce a numerical scheme to efficiently compute its eigenvectors for the case of large sparse graphs.

* 19 pages, 3 figures. Accepted in Artificial Intelligence and Statistics (AISTATS), 2018

Via

Access Paper or Ask Questions

Clustering Signed Networks with the Geometric Mean of Laplacians

Jan 03, 2017

Pedro Mercado, Francesco Tudisco, Matthias Hein

Figure 1 for Clustering Signed Networks with the Geometric Mean of Laplacians

Figure 2 for Clustering Signed Networks with the Geometric Mean of Laplacians

Figure 3 for Clustering Signed Networks with the Geometric Mean of Laplacians

Figure 4 for Clustering Signed Networks with the Geometric Mean of Laplacians

Abstract:Signed networks allow to model positive and negative relationships. We analyze existing extensions of spectral clustering to signed networks. It turns out that existing approaches do not recover the ground truth clustering in several situations where either the positive or the negative network structures contain no noise. Our analysis shows that these problems arise as existing approaches take some form of arithmetic mean of the Laplacians of the positive and negative part. As a solution we propose to use the geometric mean of the Laplacians of positive and negative part and show that it outperforms the existing approaches. While the geometric mean of matrices is computationally expensive, we show that eigenvectors of the geometric mean can be computed efficiently, leading to a numerical scheme for sparse matrices which is of independent interest.

* Advances in Neural Information Processing Systems 29, pp.4421--4429, 2016
* 14 pages, 5 figures. Accepted in Neural Information Processing Systems (NIPS), 2016

Via

Access Paper or Ask Questions