Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Oct 10, 2023

Yong Liu, Tengge Hu, Haoran Zhang, Haixu Wu, Shiyu Wang, Lintao Ma, Mingsheng Long

Figure 1 for iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Figure 2 for iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Figure 3 for iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Figure 4 for iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Share this with someone who'll enjoy it:

Abstract:The recent boom of linear forecasting models questions the ongoing passion for architectural modifications of Transformer-based forecasters. These forecasters leverage Transformers to model the global dependencies over temporal tokens of time series, with each token formed by multiple variates of the same timestamp. However, Transformer is challenged in forecasting series with larger lookback windows due to performance degradation and computation explosion. Besides, the unified embedding for each temporal token fuses multiple variates with potentially unaligned timestamps and distinct physical measurements, which may fail in learning variate-centric representations and result in meaningless attention maps. In this work, we reflect on the competent duties of Transformer components and repurpose the Transformer architecture without any adaptation on the basic components. We propose iTransformer that simply inverts the duties of the attention mechanism and the feed-forward network. Specifically, the time points of individual series are embedded into variate tokens which are utilized by the attention mechanism to capture multivariate correlations; meanwhile, the feed-forward network is applied for each variate token to learn nonlinear representations. The iTransformer model achieves consistent state-of-the-art on several real-world datasets, which further empowers the Transformer family with promoted performance, generalization ability across different variates, and better utilization of arbitrary lookback windows, making it a nice alternative as the fundamental backbone of time series forecasting.

View paper on

Share this with someone who'll enjoy it:

Title:iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Paper and Code