Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Simo Särkkä

Ultrafast Deep Learning-Based Scatter Estimation in Cone-Beam Computed Tomography

Sep 10, 2025

Harshit Agrawal, Ari Hietanen, Simo Särkkä

Abstract:Purpose: Scatter artifacts drastically degrade the image quality of cone-beam computed tomography (CBCT) scans. Although deep learning-based methods show promise in estimating scatter from CBCT measurements, their deployment in mobile CBCT systems or edge devices is still limited due to the large memory footprint of the networks. This study addresses the issue by applying networks at varying resolutions and suggesting an optimal one, based on speed and accuracy. Methods: First, the reconstruction error in down-up sampling of CBCT scatter signal was examined at six resolutions by comparing four interpolation methods. Next, a recent state-of-the-art method was trained across five image resolutions and evaluated for the reductions in floating-point operations (FLOPs), inference times, and GPU memory requirements. Results: Reducing the input size and network parameters achieved a 78-fold reduction in FLOPs compared to the baseline method, while maintaining comarable performance in terms of mean-absolute-percentage-error (MAPE) and mean-square-error (MSE). Specifically, the MAPE decreased to 3.85% compared to 4.42%, and the MSE decreased to 1.34 \times 10^{-2} compared to 2.01 \times 10^{-2}. Inference time and GPU memory usage were reduced by factors of 16 and 12, respectively. Further experiments comparing scatter-corrected reconstructions on a large, simulated dataset and real CBCT scans from water and Sedentex CT phantoms clearly demonstrated the robustness of our method. Conclusion: This study highlights the underappreciated role of downsampling in deep learning-based scatter estimation. The substantial reduction in FLOPs and GPU memory requirements achieved by our method enables scatter correction in resource-constrained environments, such as mobile CBCT and edge devices.

Via

Access Paper or Ask Questions

Conditional Normalizing Flow Surrogate for Monte Carlo Prediction of Radiative Properties in Nanoparticle-Embedded Layers

Aug 27, 2025

Fahime Seyedheydari, Kevin Conley, Simo Särkkä

Abstract:We present a probabilistic, data-driven surrogate model for predicting the radiative properties of nanoparticle embedded scattering media. The model uses conditional normalizing flows, which learn the conditional distribution of optical outputs, including reflectance, absorbance, and transmittance, given input parameters such as the absorption coefficient, scattering coefficient, anisotropy factor, and particle size distribution. We generate training data using Monte Carlo radiative transfer simulations, with optical properties derived from Mie theory. Unlike conventional neural networks, the conditional normalizing flow model yields full posterior predictive distributions, enabling both accurate forecasts and principled uncertainty quantification. Our results demonstrate that this model achieves high predictive accuracy and reliable uncertainty estimates, establishing it as a powerful and efficient surrogate for radiative transfer simulations.

* Proceedings of META 2025 (15th International Conference on Metamaterials, Photonic Crystals and Plasmonics), Torremolinos, Spain, July 22-25, 2025, pp. 1565-1570
* Version of record (publishers PDF) from META 2025 (CC BY). Please cite the proceedings

Via

Access Paper or Ask Questions

Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

May 22, 2025

Hany Abdulsamad, Sahel Iqbal, Simo Särkkä

Figure 1 for Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

Figure 2 for Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

Figure 3 for Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

Figure 4 for Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

Abstract:Optimal decision-making under partial observability requires agents to balance reducing uncertainty (exploration) against pursuing immediate objectives (exploitation). In this paper, we introduce a novel policy optimization framework for continuous partially observable Markov decision processes (POMDPs) that explicitly addresses this challenge. Our method casts policy learning as probabilistic inference in a non-Markovian Feynman--Kac model that inherently captures the value of information gathering by anticipating future observations, without requiring extrinsic exploration bonuses or handcrafted heuristics. To optimize policies under this model, we develop a nested sequential Monte Carlo~(SMC) algorithm that efficiently estimates a history-dependent policy gradient under samples from the optimal trajectory distribution induced by the POMDP. We demonstrate the effectiveness of our algorithm across standard continuous POMDP benchmarks, where existing methods struggle to act under uncertainty.

Via

Access Paper or Ask Questions

Statistical Linear Regression Approach to Kalman Filtering and Smoothing under Cyber-Attacks

Apr 11, 2025

Kundan Kumar, Muhammad Iqbal, Simo Särkkä

Abstract:Remote state estimation in cyber-physical systems is often vulnerable to cyber-attacks due to wireless connections between sensors and computing units. In such scenarios, adversaries compromise the system by injecting false data or blocking measurement transmissions via denial-of-service attacks, distorting sensor readings. This paper develops a Kalman filter and Rauch--Tung--Striebel (RTS) smoother for linear stochastic state-space models subject to cyber-attacked measurements. We approximate the faulty measurement model via generalized statistical linear regression (GSLR). The GSLR-based approximated measurement model is then used to develop a Kalman filter and RTS smoother for the problem. The effectiveness of the proposed algorithms under cyber-attacks is demonstrated through a simulated aircraft tracking experiment.

* 5 pages, 4 figures

Via

Access Paper or Ask Questions

Provable Quantum Algorithm Advantage for Gaussian Process Quadrature

Feb 20, 2025

Cristian A. Galvis-Florez, Ahmad Farooq, Simo Särkkä

Figure 1 for Provable Quantum Algorithm Advantage for Gaussian Process Quadrature

Figure 2 for Provable Quantum Algorithm Advantage for Gaussian Process Quadrature

Figure 3 for Provable Quantum Algorithm Advantage for Gaussian Process Quadrature

Figure 4 for Provable Quantum Algorithm Advantage for Gaussian Process Quadrature

Abstract:The aim of this paper is to develop novel quantum algorithms for Gaussian process quadrature methods. Gaussian process quadratures are numerical integration methods where Gaussian processes are used as functional priors for the integrands to capture the uncertainty arising from the sparse function evaluations. Quantum computers have emerged as potential replacements for classical computers, offering exponential reductions in the computational complexity of machine learning tasks. In this paper, we combine Gaussian process quadratures and quantum computing by proposing a quantum low-rank Gaussian process quadrature method based on a Hilbert space approximation of the Gaussian process kernel and enhancing the quadrature using a quantum circuit. The method combines the quantum phase estimation algorithm with the quantum principal component analysis technique to extract information up to a desired rank. Then, Hadamard and SWAP tests are implemented to find the expected value and variance that determines the quadrature. We use numerical simulations of a quantum computer to demonstrate the effectiveness of the method. Furthermore, we provide a theoretical complexity analysis that shows a polynomial advantage over classical Gaussian process quadrature methods. The code is available at https://github.com/cagalvisf/Quantum_HSGPQ.

* 21 pages, 6 figures

Via

Access Paper or Ask Questions

Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation

Nov 29, 2024

Ángel F. García-Fernández, Simo Särkkä

Figure 1 for Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation

Figure 2 for Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation

Figure 3 for Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation

Figure 4 for Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation

Abstract:This paper proposes multi-target filtering algorithms in which target dynamics are given in continuous time and measurements are obtained at discrete time instants. In particular, targets appear according to a Poisson point process (PPP) in time with a given Gaussian spatial distribution, targets move according to a general time-invariant linear stochastic differential equation, and the life span of each target is modelled with an exponential distribution. For this multi-target dynamic model, we derive the distribution of the set of new born targets and calculate closed-form expressions for the best fitting mean and covariance of each target at its time of birth by minimising the Kullback-Leibler divergence via moment matching. This yields a novel Gaussian continuous-discrete Poisson multi-Bernoulli mixture (PMBM) filter, and its approximations based on Poisson multi-Bernoulli and probability hypothesis density filtering. These continuous-discrete multi-target filters are also extended to target dynamics driven by nonlinear stochastic differential equations.

Via

Access Paper or Ask Questions

A Parallel-in-Time Newton's Method for Nonlinear Model Predictive Control

Sep 30, 2024

Casian Iacob, Hany Abdulsamad, Simo Särkkä

Figure 1 for A Parallel-in-Time Newton's Method for Nonlinear Model Predictive Control

Figure 2 for A Parallel-in-Time Newton's Method for Nonlinear Model Predictive Control

Abstract:Model predictive control (MPC) is a powerful framework for optimal control of dynamical systems. However, MPC solvers suffer from a high computational burden that restricts their application to systems with low sampling frequency. This issue is further amplified in nonlinear and constrained systems that require nesting MPC solvers within iterative procedures. In this paper, we address these issues by developing parallel-in-time algorithms for constrained nonlinear optimization problems that take advantage of massively parallel hardware to achieve logarithmic computational time scaling over the planning horizon. We develop time-parallel second-order solvers based on interior point methods and the alternating direction method of multipliers, leveraging fast convergence and lower computational cost per iteration. The parallelization is based on a reformulation of the subproblems in terms of associative operations that can be parallelized using the associative scan algorithm. We validate our approach on numerical examples of nonlinear and constrained dynamical systems.

Via

Access Paper or Ask Questions

Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design

Sep 09, 2024

Sahel Iqbal, Hany Abdulsamad, Sara Pérez-Vieites, Simo Särkkä, Adrien Corenflos

Figure 1 for Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design

Figure 2 for Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design

Figure 3 for Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design

Figure 4 for Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design

Abstract:This paper introduces the Inside-Out Nested Particle Filter (IO-NPF), a novel, fully recursive, algorithm for amortized sequential Bayesian experimental design in the non-exchangeable setting. We frame policy optimization as maximum likelihood estimation in a non-Markovian state-space model, achieving (at most) $\mathcal{O}(T^2)$ computational complexity in the number of experiments. We provide theoretical convergence guarantees and introduce a backward sampling algorithm to reduce trajectory degeneracy. IO-NPF offers a practical, extensible, and provably consistent approach to sequential Bayesian experimental design, demonstrating improved efficiency over existing methods.

Via

Access Paper or Ask Questions

Conditioning diffusion models by explicit forward-backward bridging

May 22, 2024

Adrien Corenflos, Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

Figure 1 for Conditioning diffusion models by explicit forward-backward bridging

Figure 2 for Conditioning diffusion models by explicit forward-backward bridging

Figure 3 for Conditioning diffusion models by explicit forward-backward bridging

Figure 4 for Conditioning diffusion models by explicit forward-backward bridging

Abstract:Given an unconditional diffusion model $\pi(x, y)$, using it to perform conditional simulation $\pi(x \mid y)$ is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express conditional simulation as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to implement efficient and principled particle Gibbs and pseudo-marginal samplers marginally targeting the conditional distribution $\pi(x \mid y)$. Contrary to existing methodology, our methods do not introduce any additional approximation to the unconditional diffusion model aside from the Monte Carlo error. We showcase the benefits and drawbacks of our approach on a series of synthetic and real data examples.

* 24 pages, 12 figures

Via

Access Paper or Ask Questions

Utilizing U-Net Architectures with Auxiliary Information for Scatter Correction in CBCT Across Different Field-of-View Settings

Feb 27, 2024

Harshit Agrawal, Ari Hietanen, Simo Särkkä

Abstract:Cone-beam computed tomography (CBCT) has become a vital imaging technique in various medical fields but scatter artifacts are a major limitation in CBCT scanning. This challenge is exacerbated by the use of large flat panel 2D detectors. The scatter-to-primary ratio increases significantly with the increase in the size of FOV being scanned. Several deep learning methods, particularly U-Net architectures, have shown promising capabilities in estimating the scatter directly from the CBCT projections. However, the influence of varying FOV sizes on these deep learning models remains unexplored. Having a single neural network for the scatter estimation of varying FOV projections can be of significant importance towards real clinical applications. This study aims to train and evaluate the performance of a U-Net network on a simulated dataset with varying FOV sizes. We further propose a new method (Aux-Net) by providing auxiliary information, such as FOV size, to the U-Net encoder. We validate our method on 30 different FOV sizes and compare it with the U-Net. Our study demonstrates that providing auxiliary information to the network enhances the generalization capability of the U-Net. Our findings suggest that this novel approach outperforms the baseline U-Net, offering a significant step towards practical application in real clinical settings where CBCT systems are employed to scan a wide range of FOVs.

Via

Access Paper or Ask Questions