Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaoye S. Li

Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting

Oct 04, 2024

Soon Hoe Lim, Yijin Wang, Annan Yu, Emma Hart, Michael W. Mahoney, Xiaoye S. Li, N. Benjamin Erichson

Abstract:Flow matching has recently emerged as a powerful paradigm for generative modeling and has been extended to probabilistic time series forecasting in latent spaces. However, the impact of the specific choice of probability path model on forecasting performance remains under-explored. In this work, we demonstrate that forecasting spatio-temporal data with flow matching is highly sensitive to the selection of the probability path model. Motivated by this insight, we propose a novel probability path model designed to improve forecasting performance. Our empirical results across various dynamical system benchmarks show that our model achieves faster convergence during training and improved predictive performance compared to existing probability path models. Importantly, our approach is efficient during inference, requiring only a few sampling steps. This makes our proposed model practical for real-world applications and opens new avenues for probabilistic forecasting.

* 30 pages

Via

Access Paper or Ask Questions

Hybrid Models for Mixed Variables in Bayesian Optimization

Jun 06, 2022

Hengrui Luo, Younghyun Cho, James W. Demmel, Xiaoye S. Li, Yang Liu

Figure 1 for Hybrid Models for Mixed Variables in Bayesian Optimization

Figure 2 for Hybrid Models for Mixed Variables in Bayesian Optimization

Figure 3 for Hybrid Models for Mixed Variables in Bayesian Optimization

Figure 4 for Hybrid Models for Mixed Variables in Bayesian Optimization

Abstract:We systematically describe the problem of simultaneous surrogate modeling of mixed variables (i.e., continuous, integer and categorical variables) in the Bayesian optimization (BO) context. We provide a unified hybrid model using both Monte-Carlo tree search (MCTS) and Gaussian processes (GP) that encompasses and generalizes multiple state-of-the-art mixed BO surrogates. Based on the architecture, we propose applying a new dynamic model selection criterion among novel candidate families of covariance kernels, including non-stationary kernels and associated families. Different benchmark problems are studied and presented to support the superiority of our model, along with results highlighting the effectiveness of our method compared to most state-of-the-art mixed-variable methods in BO.

* 56 pages, 22 Figures

Via

Access Paper or Ask Questions

Non-smooth Bayesian Optimization in Tuning Problems

Sep 15, 2021

Hengrui Luo, James W. Demmel, Younghyun Cho, Xiaoye S. Li, Yang Liu

Figure 1 for Non-smooth Bayesian Optimization in Tuning Problems

Figure 2 for Non-smooth Bayesian Optimization in Tuning Problems

Figure 3 for Non-smooth Bayesian Optimization in Tuning Problems

Figure 4 for Non-smooth Bayesian Optimization in Tuning Problems

Abstract:Building surrogate models is one common approach when we attempt to learn unknown black-box functions. Bayesian optimization provides a framework which allows us to build surrogate models based on sequential samples drawn from the function and find the optimum. Tuning algorithmic parameters to optimize the performance of large, complicated "black-box" application codes is a specific important application, which aims at finding the optima of black-box functions. Within the Bayesian optimization framework, the Gaussian process model produces smooth or continuous sample paths. However, the black-box function in the tuning problem is often non-smooth. This difficult tuning problem is worsened by the fact that we usually have limited sequential samples from the black-box function. Motivated by these issues encountered in tuning, we propose a novel additive Gaussian process model called clustered Gaussian process (cGP), where the additive components are induced by clustering. In the examples we studied, the performance can be improved by as much as 90% among repetitive experiments. By using this surrogate model, we want to capture the non-smoothness of the black-box function. In addition to an algorithm for constructing this model, we also apply the model to several artificial and real applications to evaluate it.

* 61 pages

Via

Access Paper or Ask Questions

Multitask and Transfer Learning for Autotuning Exascale Applications

Aug 15, 2019

Wissam M. Sid-Lakhdar, Mohsen Mahmoudi Aznaveh, Xiaoye S. Li, James W. Demmel

Figure 1 for Multitask and Transfer Learning for Autotuning Exascale Applications

Figure 2 for Multitask and Transfer Learning for Autotuning Exascale Applications

Figure 3 for Multitask and Transfer Learning for Autotuning Exascale Applications

Figure 4 for Multitask and Transfer Learning for Autotuning Exascale Applications

Abstract:Multitask learning and transfer learning have proven to be useful in the field of machine learning when additional knowledge is available to help a prediction task. We aim at deriving methods following these paradigms for use in autotuning, where the goal is to find the optimal performance parameters of an application treated as a black-box function. We show comparative results with state-of-the-art autotuning techniques. For instance, we observe an average $1.5x$ improvement of the application runtime compared to the OpenTuner and HpBandSter autotuners. We explain how our approaches can be more suitable than some state-of-the-art autotuners for the tuning of any application in general and of expensive exascale applications in particular.

Via

Access Paper or Ask Questions

Efficient Online Hyperparameter Optimization for Kernel Ridge Regression with Applications to Traffic Time Series Prediction

Nov 01, 2018

Hongyuan Zhan, Gabriel Gomes, Xiaoye S. Li, Kamesh Madduri, Kesheng Wu

Figure 1 for Efficient Online Hyperparameter Optimization for Kernel Ridge Regression with Applications to Traffic Time Series Prediction

Figure 2 for Efficient Online Hyperparameter Optimization for Kernel Ridge Regression with Applications to Traffic Time Series Prediction

Figure 3 for Efficient Online Hyperparameter Optimization for Kernel Ridge Regression with Applications to Traffic Time Series Prediction

Figure 4 for Efficient Online Hyperparameter Optimization for Kernel Ridge Regression with Applications to Traffic Time Series Prediction

Abstract:Computational efficiency is an important consideration for deploying machine learning models for time series prediction in an online setting. Machine learning algorithms adjust model parameters automatically based on the data, but often require users to set additional parameters, known as hyperparameters. Hyperparameters can significantly impact prediction accuracy. Traffic measurements, typically collected online by sensors, are serially correlated. Moreover, the data distribution may change gradually. A typical adaptation strategy is periodically re-tuning the model hyperparameters, at the cost of computational burden. In this work, we present an efficient and principled online hyperparameter optimization algorithm for Kernel Ridge regression applied to traffic prediction problems. In tests with real traffic measurement data, our approach requires as little as one-seventh of the computation time of other tuning methods, while achieving better or similar prediction accuracy.

* H. Zhan, G. Gomes, X. S. Li, K. Madduri, and K. Wu. Efficient Online Hyperparameter Learning for Traffic Flow Prediction. In 2018 IEEE 21th International Conference on Intelligent Transportation Systems (ITSC), pages 1-6. IEEE, 2018
* An extended version of "Efficient Online Hyperparameter Learning for Traffic Flow Prediction" published in The 21st IEEE International Conference on Intelligent Transportation Systems (ITSC 2018)

Via

Access Paper or Ask Questions