Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vegard Flovik

SALVE: Sparse Autoencoder-Latent Vector Editing for Mechanistic Control of Neural Networks

Dec 17, 2025

Vegard Flovik

Abstract:Deep neural networks achieve impressive performance but remain difficult to interpret and control. We present SALVE (Sparse Autoencoder-Latent Vector Editing), a unified "discover, validate, and control" framework that bridges mechanistic interpretability and model editing. Using an $\ell_1$-regularized autoencoder, we learn a sparse, model-native feature basis without supervision. We validate these features with Grad-FAM, a feature-level saliency mapping method that visually grounds latent features in input data. Leveraging the autoencoder's structure, we perform precise and permanent weight-space interventions, enabling continuous modulation of both class-defining and cross-class features. We further derive a critical suppression threshold, $α_{crit}$, quantifying each class's reliance on its dominant feature, supporting fine-grained robustness diagnostics. Our approach is validated on both convolutional (ResNet-18) and transformer-based (ViT-B/16) models, demonstrating consistent, interpretable control over their behavior. This work contributes a principled methodology for turning feature discovery into actionable model edits, advancing the development of transparent and controllable AI systems.

* Under review

Via

Access Paper or Ask Questions

Gaussian Process Surrogate Models for Efficient Estimation of Structural Response Distributions and Order Statistics

Mar 03, 2025

Vegard Flovik, Sebastian Winter, Christian Agrell

Figure 1 for Gaussian Process Surrogate Models for Efficient Estimation of Structural Response Distributions and Order Statistics

Figure 2 for Gaussian Process Surrogate Models for Efficient Estimation of Structural Response Distributions and Order Statistics

Figure 3 for Gaussian Process Surrogate Models for Efficient Estimation of Structural Response Distributions and Order Statistics

Figure 4 for Gaussian Process Surrogate Models for Efficient Estimation of Structural Response Distributions and Order Statistics

Abstract:Engineering disciplines often rely on extensive simulations to ensure that structures are designed to withstand harsh conditions while avoiding over-engineering for unlikely scenarios. Assessments such as Serviceability Limit State (SLS) involve evaluating weather events, including estimating loads not expected to be exceeded more than a specified number of times (e.g., 100) throughout the structure's design lifetime. Although physics-based simulations provide robust and detailed insights, they are computationally expensive, making it challenging to generate statistically valid representations of a wide range of weather conditions. To address these challenges, we propose an approach using Gaussian Process (GP) surrogate models trained on a limited set of simulation outputs to directly generate the structural response distribution. We apply this method to an SLS assessment for estimating the order statistics $Y_{100}$, representing the 100th highest response, of a structure exposed to 25 years of historical weather observations. Our results indicate that the GP surrogate models provide comparable results to full simulations but at a fraction of the computational cost.

* Accepted for publication, journal reference will be added after publication

Via

Access Paper or Ask Questions

Quantifying Distribution Shifts and Uncertainties for Enhanced Model Robustness in Machine Learning Applications

May 03, 2024

Vegard Flovik

Abstract:Distribution shifts, where statistical properties differ between training and test datasets, present a significant challenge in real-world machine learning applications where they directly impact model generalization and robustness. In this study, we explore model adaptation and generalization by utilizing synthetic data to systematically address distributional disparities. Our investigation aims to identify the prerequisites for successful model adaptation across diverse data distributions, while quantifying the associated uncertainties. Specifically, we generate synthetic data using the Van der Waals equation for gases and employ quantitative measures such as Kullback-Leibler divergence, Jensen-Shannon distance, and Mahalanobis distance to assess data similarity. These metrics en able us to evaluate both model accuracy and quantify the associated uncertainty in predictions arising from data distribution shifts. Our findings suggest that utilizing statistical measures, such as the Mahalanobis distance, to determine whether model predictions fall within the low-error "interpolation regime" or the high-error "extrapolation regime" provides a complementary method for assessing distribution shift and model uncertainty. These insights hold significant value for enhancing model robustness and generalization, essential for the successful deployment of machine learning applications in real-world scenarios.

* Working paper

Via

Access Paper or Ask Questions

Point Cloud Data Simulation and Modelling with Aize Workspace

Jan 19, 2023

Boris Mocialov, Eirik Eythorsson, Reza Parseh, Hoang Tran, Vegard Flovik

Abstract:This work takes a look at data models often used in digital twins and presents preliminary results specifically from surface reconstruction and semantic segmentation models trained using simulated data. This work is expected to serve as a ground work for future endeavours in data contextualisation inside a digital twin.

* Extended abstract, Northern Lights Deep Learning Conference, 2023

Via

Access Paper or Ask Questions