Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Jan 16, 2021

Shuyuan Yan, Bolin Ding, Wei Guo, Jingren Zhou, Zhewei Wei, Xiaowei Jiang, Sheng Xu

Figure 1 for FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Figure 2 for FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Figure 3 for FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Figure 4 for FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Share this with someone who'll enjoy it:

Abstract:Interactive response time is important in analytical pipelines for users to explore a sufficient number of possibilities and make informed business decisions. We consider a forecasting pipeline with large volumes of high-dimensional time series data. Real-time forecasting can be conducted in two steps. First, we specify the part of data to be focused on and the measure to be predicted by slicing, dicing, and aggregating the data. Second, a forecasting model is trained on the aggregated results to predict the trend of the specified measure. While there are a number of forecasting models available, the first step is the performance bottleneck. A natural idea is to utilize sampling to obtain approximate aggregations in real time as the input to train the forecasting model. Our scalable real-time forecasting system FlashP (Flash Prediction) is built based on this idea, with two major challenges to be resolved in this paper: first, we need to figure out how approximate aggregations affect the fitting of forecasting models, and forecasting results; and second, accordingly, what sampling algorithms we should use to obtain these approximate aggregations and how large the samples are. We introduce a new sampling scheme, called GSW sampling, and analyze error bounds for estimating aggregations using GSW samples. We introduce how to construct compact GSW samples with the existence of multiple measures to be analyzed. We conduct experiments to evaluate our solution and compare it with alternatives on real data.

View paper on

Share this with someone who'll enjoy it:

Title:FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data

Paper and Code