Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adam M. Sykulski

SplitWise Regression: Stepwise Modeling with Adaptive Dummy Encoding

May 21, 2025

Marcell T. Kurbucz, Nikolaos Tzivanakis, Nilufer Sari Aslam, Adam M. Sykulski

Abstract:Capturing nonlinear relationships without sacrificing interpretability remains a persistent challenge in regression modeling. We introduce SplitWise, a novel framework that enhances stepwise regression. It adaptively transforms numeric predictors into threshold-based binary features using shallow decision trees, but only when such transformations improve model fit, as assessed by the Akaike Information Criterion (AIC) or Bayesian Information Criterion (BIC). This approach preserves the transparency of linear models while flexibly capturing nonlinear effects. Implemented as a user-friendly R package, SplitWise is evaluated on both synthetic and real-world datasets. The results show that it consistently produces more parsimonious and generalizable models than traditional stepwise and penalized regression techniques.

* 15 pages, 1 figure, 3 tables

Via

Access Paper or Ask Questions

Multivariate Probabilistic Regression with Natural Gradient Boosting

Jun 07, 2021

Michael O'Malley, Adam M. Sykulski, Rick Lumpkin, Alejandro Schuler

Figure 1 for Multivariate Probabilistic Regression with Natural Gradient Boosting

Figure 2 for Multivariate Probabilistic Regression with Natural Gradient Boosting

Figure 3 for Multivariate Probabilistic Regression with Natural Gradient Boosting

Figure 4 for Multivariate Probabilistic Regression with Natural Gradient Boosting

Abstract:Many single-target regression problems require estimates of uncertainty along with the point predictions. Probabilistic regression algorithms are well-suited for these tasks. However, the options are much more limited when the prediction target is multivariate and a joint measure of uncertainty is required. For example, in predicting a 2D velocity vector a joint uncertainty would quantify the probability of any vector in the plane, which would be more expressive than two separate uncertainties on the x- and y- components. To enable joint probabilistic regression, we propose a Natural Gradient Boosting (NGBoost) approach based on nonparametrically modeling the conditional parameters of the multivariate predictive distribution. Our method is robust, works out-of-the-box without extensive tuning, is modular with respect to the assumed target distribution, and performs competitively in comparison to existing approaches. We demonstrate these claims in simulation and with a case study predicting two-dimensional oceanographic velocity data. An implementation of our method is available at https://github.com/stanfordmlgroup/ngboost.

Via

Access Paper or Ask Questions

Efficient Parameter Estimation of Sampled Random Fields

Jul 15, 2019

Arthur P. Guillaumin, Adam M. Sykulski, Sofia C. Olhede, Frederik J. Simons

Figure 1 for Efficient Parameter Estimation of Sampled Random Fields

Figure 2 for Efficient Parameter Estimation of Sampled Random Fields

Figure 3 for Efficient Parameter Estimation of Sampled Random Fields

Figure 4 for Efficient Parameter Estimation of Sampled Random Fields

Abstract:We provide a computationally and statistically efficient method for estimating the parameters of a stochastic Gaussian model observed on a spatial grid, which need not be rectangular. Standard methods are plagued by computational intractability, where designing methods that can be implemented for realistically sized problems has been an issue for a long time. This has motivated the use of the Fourier Transform and the Whittle likelihood approximation. The challenge of frequency-domain methods is to determine and account for observational boundary effects, missing data, and the shape of the observed spatial grid. In this paper we address these effects explicitly by proposing a new quasi-likelihood estimator. We prove consistency and asymptotic normality of our estimator, and show that the proposed method solves boundary issues with Whittle estimation for finite samples, yielding parameter estimates with significantly reduced bias and error. We demonstrate the effectiveness of our method for incomplete lattices, in comparison to other recent methods. Finally, we apply our method to estimate the parameters of a Mat\'ern process used to model data from Venus' topography.

Via

Access Paper or Ask Questions

Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

Apr 26, 2019

Jeffrey J. Early, Adam M. Sykulski

Figure 1 for Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

Figure 2 for Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

Figure 3 for Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

Figure 4 for Smoothing and Interpolating Noisy GPS Data with Smoothing Splines

Abstract:A comprehensive methodology is provided for smoothing noisy, irregularly sampled data with non-Gaussian noise using smoothing splines. We demonstrate how the spline order and tension parameter can be chosen \emph{a priori} from physical reasoning. We also show how to allow for non-Gaussian noise and outliers which are typical in GPS signals. We demonstrate the effectiveness of our methods on GPS trajectory data obtained from oceanographic floating instruments known as drifters.

* 14 pages, 8 figures

Via

Access Paper or Ask Questions

The De-Biased Whittle Likelihood

Sep 12, 2018

Adam M. Sykulski, Sofia C. Olhede, Arthur P. Guillaumin, Jonathan M. Lilly, Jeffrey J. Early

Figure 1 for The De-Biased Whittle Likelihood

Figure 2 for The De-Biased Whittle Likelihood

Figure 3 for The De-Biased Whittle Likelihood

Figure 4 for The De-Biased Whittle Likelihood

Abstract:The Whittle likelihood is a widely used and computationally efficient pseudo-likelihood. However, it is known to produce biased parameter estimates for large classes of models. We propose a method for de-biasing Whittle estimates for second-order stationary stochastic processes. The de-biased Whittle likelihood can be computed in the same $\mathcal{O}(n\log n)$ operations as the standard approach. We demonstrate the superior performance of the method in simulation studies and in application to a large-scale oceanographic dataset, where in both cases the de-biased approach reduces bias by up to two orders of magnitude, achieving estimates that are close to exact maximum likelihood, at a fraction of the computational cost. We prove that the method yields estimates that are consistent at an optimal convergence rate of $n^{-1/2}$, under weaker assumptions than standard theory, where we do not require that the power spectral density is continuous in frequency. We describe how the method can be easily combined with standard methods of bias reduction, such as tapering and differencing, to further reduce bias in parameter estimates.

* To appear shortly in Biometrika. Full published version includes extensions of theory to non-Gaussian processes, and new simulation examples with an AR(4) and non-Gaussian process

Via

Access Paper or Ask Questions

Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding

Mar 15, 2017

Adam M. Sykulski, Donald B. Percival

Figure 1 for Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding

Figure 2 for Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding

Figure 3 for Exact Simulation of Noncircular or Improper Complex-Valued Stationary Gaussian Processes using Circulant Embedding

Abstract:This paper provides an algorithm for simulating improper (or noncircular) complex-valued stationary Gaussian processes. The technique utilizes recently developed methods for multivariate Gaussian processes from the circulant embedding literature. The method can be performed in $\mathcal{O}(n\log_2 n)$ operations, where $n$ is the length of the desired sequence. The method is exact, except when eigenvalues of prescribed circulant matrices are negative. We evaluate the performance of the algorithm empirically, and provide a practical example where the method is guaranteed to be exact for all $n$, with an improper fractional Gaussian noise process.

* 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)
* Link to published version: http://ieeexplore.ieee.org/document/7738840/

Via

Access Paper or Ask Questions

Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

Mar 15, 2017

Adam M. Sykulski, Sofia C. Olhede, Jonathan M. Lilly, Jeffrey J. Early

Figure 1 for Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

Figure 2 for Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

Figure 3 for Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

Figure 4 for Frequency-Domain Stochastic Modeling of Stationary Bivariate or Complex-Valued Signals

Abstract:There are three equivalent ways of representing two jointly observed real-valued signals: as a bivariate vector signal, as a single complex-valued signal, or as two analytic signals known as the rotary components. Each representation has unique advantages depending on the system of interest and the application goals. In this paper we provide a joint framework for all three representations in the context of frequency-domain stochastic modeling. This framework allows us to extend many established statistical procedures for bivariate vector time series to complex-valued and rotary representations. These include procedures for parametrically modeling signal coherence, estimating model parameters using the Whittle likelihood, performing semi-parametric modeling, and choosing between classes of nested models using model choice. We also provide a new method of testing for impropriety in complex-valued signals, which tests for noncircular or anisotropic second-order statistical structure when the signal is represented in the complex plane. Finally, we demonstrate the usefulness of our methodology in capturing the anisotropic structure of signals observed from fluid dynamic simulations of turbulence.

* IEEE Transactions on Signal Processing, 2017
* To appear in IEEE Transactions on Signal Processing

Via

Access Paper or Ask Questions

Automated Planning in Repeated Adversarial Games

Mar 15, 2012

Enrique Munoz de Cote, Archie C. Chapman, Adam M. Sykulski, Nicholas R. Jennings

Figure 1 for Automated Planning in Repeated Adversarial Games

Figure 2 for Automated Planning in Repeated Adversarial Games

Figure 3 for Automated Planning in Repeated Adversarial Games

Figure 4 for Automated Planning in Repeated Adversarial Games

Abstract:Game theory's prescriptive power typically relies on full rationality and/or self-play interactions. In contrast, this work sets aside these fundamental premises and focuses instead on heterogeneous autonomous interactions between two or more agents. Specifically, we introduce a new and concise representation for repeated adversarial (constant-sum) games that highlight the necessary features that enable an automated planing agent to reason about how to score above the game's Nash equilibrium, when facing heterogeneous adversaries. To this end, we present TeamUP, a model-based RL algorithm designed for learning and planning such an abstraction. In essence, it is somewhat similar to R-max with a cleverly engineered reward shaping that treats exploration as an adversarial optimization problem. In practice, it attempts to find an ally with which to tacitly collude (in more than two-player games) and then collaborates on a joint plan of actions that can consistently score a high utility in adversarial repeated games. We use the inaugural Lemonade Stand Game Tournament to demonstrate the effectiveness of our approach, and find that TeamUP is the best performing agent, demoting the Tournament's actual winning strategy into second place. In our experimental analysis, we show hat our strategy successfully and consistently builds collaborations with many different heterogeneous (and sometimes very sophisticated) adversaries.

* Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

Via

Access Paper or Ask Questions