Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander T. Ihler

Forecasting Local Ionospheric Parameters Using Transformers

Feb 20, 2025

Daniel J. Alford-Lago, Christopher W. Curtis, Alexander T. Ihler, Katherine A. Zawdie, Douglas P. Drob

Abstract:We present a novel method for forecasting key ionospheric parameters using transformer-based neural networks. The model provides accurate forecasts and uncertainty quantification of the F2-layer peak plasma frequency (foF2), the F2-layer peak density height (hmF2), and total electron content (TEC) for a given geographic location. It supports a number of exogenous variables, including F10.7cm solar flux and disturbance storm time (Dst). We demonstrate how transformers can be trained in a data assimilation-like fashion that use these exogenous variables along with na\"ive predictions from climatology to generate 24-hour forecasts with non-parametric uncertainty bounds. We call this method the Local Ionospheric Forecast Transformer (LIFT). We demonstrate that the trained model can generalize to new geographic locations and time periods not seen during training, and we compare its performance to that of the International Reference Ionosphere (IRI).

* 47 pages, 42 figures

Via

Access Paper or Ask Questions

A Cluster-Cumulant Expansion at the Fixed Points of Belief Propagation

Oct 16, 2012

Max Welling, Andrew E. Gelfand, Alexander T. Ihler

Figure 1 for A Cluster-Cumulant Expansion at the Fixed Points of Belief Propagation

Figure 2 for A Cluster-Cumulant Expansion at the Fixed Points of Belief Propagation

Figure 3 for A Cluster-Cumulant Expansion at the Fixed Points of Belief Propagation

Figure 4 for A Cluster-Cumulant Expansion at the Fixed Points of Belief Propagation

Abstract:We introduce a new cluster-cumulant expansion (CCE) based on the fixed points of iterative belief propagation (IBP). This expansion is similar in spirit to the loop-series (LS) recently introduced in [1]. However, in contrast to the latter, the CCE enjoys the following important qualities: 1) it is defined for arbitrary state spaces 2) it is easily extended to fixed points of generalized belief propagation (GBP), 3) disconnected groups of variables will not contribute to the CCE and 4) the accuracy of the expansion empirically improves upon that of the LS. The CCE is based on the same M\"obius transform as the Kikuchi approximation, but unlike GBP does not require storing the beliefs of the GBP-clusters nor does it suffer from convergence issues during belief updating.

* Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

Via

Access Paper or Ask Questions

Belief Propagation for Structured Decision Making

Oct 16, 2012

Qiang Liu, Alexander T. Ihler

Figure 1 for Belief Propagation for Structured Decision Making

Figure 2 for Belief Propagation for Structured Decision Making

Figure 3 for Belief Propagation for Structured Decision Making

Figure 4 for Belief Propagation for Structured Decision Making

Abstract:Variational inference algorithms such as belief propagation have had tremendous impact on our ability to learn and use graphical models, and give many insights for developing or understanding exact and approximate inference. However, variational approaches have not been widely adoped for decision making in graphical models, often formulated through influence diagrams and including both centralized and decentralized (or multi-agent) decisions. In this work, we present a general variational framework for solving structured cooperative decision-making problems, use it to propose several belief propagation-like algorithms, and analyze them both theoretically and empirically.

* Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

Via

Access Paper or Ask Questions

Join-graph based cost-shifting schemes

Oct 16, 2012

Alexander T. Ihler, Natalia Flerova, Rina Dechter, Lars Otten

Figure 1 for Join-graph based cost-shifting schemes

Figure 2 for Join-graph based cost-shifting schemes

Figure 3 for Join-graph based cost-shifting schemes

Figure 4 for Join-graph based cost-shifting schemes

Abstract:We develop several algorithms taking advantage of two common approaches for bounding MPE queries in graphical models: minibucket elimination and message-passing updates for linear programming relaxations. Both methods are quite similar, and offer useful perspectives for the other; our hybrid approaches attempt to balance the advantages of each. We demonstrate the power of our hybrid algorithms through extensive empirical evaluation. Most notably, a Branch and Bound search guided by the heuristic function calculated by one of our new algorithms has recently won first place in the PASCAL2 inference challenge.

* Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

Via

Access Paper or Ask Questions

Fast Planar Correlation Clustering for Image Segmentation

Aug 02, 2012

Julian Yarkony, Alexander T. Ihler, Charless C. Fowlkes

Figure 1 for Fast Planar Correlation Clustering for Image Segmentation

Figure 2 for Fast Planar Correlation Clustering for Image Segmentation

Figure 3 for Fast Planar Correlation Clustering for Image Segmentation

Figure 4 for Fast Planar Correlation Clustering for Image Segmentation

Abstract:We describe a new optimization scheme for finding high-quality correlation clusterings in planar graphs that uses weighted perfect matching as a subroutine. Our method provides lower-bounds on the energy of the optimal correlation clustering that are typically fast to compute and tight in practice. We demonstrate our algorithm on the problem of image segmentation where this approach outperforms existing global optimization techniques in minimizing the objective and is competitive with the state of the art in producing high-quality segmentations.

* This is the extended version of a paper to appear at the 12th European Conference on Computer Vision (ECCV 2012)

Via

Access Paper or Ask Questions

Gibbs Sampling for Infinite Mixture Models in the Stick Breaking Representation

Jun 27, 2012

Ian Porteous, Alexander T. Ihler, Padhraic Smyth, Max Welling

Figure 1 for Gibbs Sampling for Infinite Mixture Models in the Stick Breaking Representation

Figure 2 for Gibbs Sampling for Infinite Mixture Models in the Stick Breaking Representation

Figure 3 for Gibbs Sampling for Infinite Mixture Models in the Stick Breaking Representation

Figure 4 for Gibbs Sampling for Infinite Mixture Models in the Stick Breaking Representation

Abstract:Nonparametric Bayesian approaches to clustering, information retrieval, language modeling and object recognition have recently shown great promise as a new paradigm for unsupervised data analysis. Most contributions have focused on the Dirichlet process mixture models or extensions thereof for which efficient Gibbs samplers exist. In this paper we explore Gibbs samplers for infinite complexity mixture models in the stick breaking representation. The advantage of this representation is improved modeling flexibility. For instance, one can design the prior distribution over cluster sizes or couple multiple infinite mixture models (e.g. over time) at the level of their parameters (i.e. the dependent Dirichlet process model). However, Gibbs samplers for infinite mixture models (as recently introduced in the statistics literature) seem to mix poorly over cluster labels. Among others issues, this can have the adverse effect that labels for the same cluster in coupled mixture models are mixed up. We introduce additional moves in these samplers to improve mixing over cluster labels and to bring clusters into correspondence. An application to modeling of storm trajectories is used to illustrate these ideas.

* Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

Via

Access Paper or Ask Questions

Accuracy Bounds for Belief Propagation

Jun 20, 2012

Alexander T. Ihler

Figure 1 for Accuracy Bounds for Belief Propagation

Figure 2 for Accuracy Bounds for Belief Propagation

Figure 3 for Accuracy Bounds for Belief Propagation

Abstract:The belief propagation (BP) algorithm is widely applied to perform approximate inference on arbitrary graphical models, in part due to its excellent empirical properties and performance. However, little is known theoretically about when this algorithm will perform well. Using recent analysis of convergence and stability properties in BP and new results on approximations in binary systems, we derive a bound on the error in BP's estimates for pairwise Markov random fields over discrete valued random variables. Our bound is relatively simple to compute, and compares favorably with a previous method of bounding the accuracy of BP.

* Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

Via

Access Paper or Ask Questions

Adaptive Inference on General Graphical Models

Jun 13, 2012

Umut A. Acar, Alexander T. Ihler, Ramgopal Mettu, Ozgur Sumer

Figure 1 for Adaptive Inference on General Graphical Models

Figure 2 for Adaptive Inference on General Graphical Models

Figure 3 for Adaptive Inference on General Graphical Models

Figure 4 for Adaptive Inference on General Graphical Models

Abstract:Many algorithms and applications involve repeatedly solving variations of the same inference problem; for example we may want to introduce new evidence to the model or perform updates to conditional dependencies. The goal of adaptive inference is to take advantage of what is preserved in the model and perform inference more rapidly than from scratch. In this paper, we describe techniques for adaptive inference on general graphs that support marginal computation and updates to the conditional probabilities and dependencies in logarithmic time. We give experimental results for an implementation of our algorithm, and demonstrate its potential performance benefit in the study of protein structure.

* Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

Via

Access Paper or Ask Questions

Negative Tree Reweighted Belief Propagation

Mar 15, 2012

Qiang Liu, Alexander T. Ihler

Figure 1 for Negative Tree Reweighted Belief Propagation

Figure 2 for Negative Tree Reweighted Belief Propagation

Figure 3 for Negative Tree Reweighted Belief Propagation

Abstract:We introduce a new class of lower bounds on the log partition function of a Markov random field which makes use of a reversed Jensen's inequality. In particular, our method approximates the intractable distribution using a linear combination of spanning trees with negative weights. This technique is a lower-bound counterpart to the tree-reweighted belief propagation algorithm, which uses a convex combination of spanning trees with positive weights to provide corresponding upper bounds. We develop algorithms to optimize and tighten the lower bounds over the non-convex set of valid parameter values. Our algorithm generalizes mean field approaches (including naive and structured mean field approximations), which it includes as a limiting case.

* Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

Via

Access Paper or Ask Questions

Tightening MRF Relaxations with Planar Subproblems

Feb 14, 2012

Julian Yarkony, Ragib Morshed, Alexander T. Ihler, Charless C. Fowlkes

Figure 1 for Tightening MRF Relaxations with Planar Subproblems

Figure 2 for Tightening MRF Relaxations with Planar Subproblems

Abstract:We describe a new technique for computing lower-bounds on the minimum energy configuration of a planar Markov Random Field (MRF). Our method successively adds large numbers of constraints and enforces consistency over binary projections of the original problem state space. These constraints are represented in terms of subproblems in a dual-decomposition framework that is optimized using subgradient techniques. The complete set of constraints we consider enforces cycle consistency over the original graph. In practice we find that the method converges quickly on most problems with the addition of a few subproblems and outperforms existing methods for some interesting classes of hard potentials.

Via

Access Paper or Ask Questions