Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Steffen Schneider

Time-series attribution maps with regularized contrastive learning

Feb 17, 2025

Steffen Schneider, Rodrigo González Laiz, Anastasiia Filippova, Markus Frey, Mackenzie Weygandt Mathis

Abstract:Gradient-based attribution methods aim to explain decisions of deep learning models but so far lack identifiability guarantees. Here, we propose a method to generate attribution maps with identifiability guarantees by developing a regularized contrastive learning algorithm trained on time-series data plus a new attribution method called Inverted Neuron Gradient (collectively named xCEBRA). We show theoretically that xCEBRA has favorable properties for identifying the Jacobian matrix of the data generating process. Empirically, we demonstrate robust approximation of zero vs. non-zero entries in the ground-truth attribution map on synthetic datasets, and significant improvements across previous attribution methods based on feature ablation, Shapley values, and other gradient-based methods. Our work constitutes a first example of identifiable inference of time-series attribution maps and opens avenues to a better understanding of time-series data, such as for neural dynamics and decision-processes within neural networks.

* The 28th International Conference on Artificial Intelligence and Statistics 2025
* Accepted at The 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025). Code is available at https://github.com/AdaptiveMotorControlLab/CEBRA

Via

Access Paper or Ask Questions

Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Dec 06, 2024

Hyesu Lim, Jinho Choi, Jaegul Choo, Steffen Schneider

Figure 1 for Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Figure 2 for Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Figure 3 for Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Figure 4 for Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Abstract:Adapting foundation models for specific purposes has become a standard approach to build machine learning systems for downstream applications. Yet, it is an open question which mechanisms take place during adaptation. Here we develop a new Sparse Autoencoder (SAE) for the CLIP vision transformer, named PatchSAE, to extract interpretable concepts at granular levels (e.g. shape, color, or semantics of an object) and their patch-wise spatial attributions. We explore how these concepts influence the model output in downstream image classification tasks and investigate how recent state-of-the-art prompt-based adaptation techniques change the association of model inputs to these concepts. While activations of concepts slightly change between adapted and non-adapted models, we find that the majority of gains on common adaptation tasks can be explained with the existing concepts already present in the non-adapted foundation model. This work provides a concrete framework to train and use SAEs for Vision Transformers and provides insights into explaining adaptation mechanisms.

* A demo is available at github.com/dynamical-inference/patchsae

Via

Access Paper or Ask Questions

Self-supervised contrastive learning performs non-linear system identification

Oct 18, 2024

Rodrigo González Laiz, Tobias Schmidt, Steffen Schneider

Abstract:Self-supervised learning (SSL) approaches have brought tremendous success across many tasks and domains. It has been argued that these successes can be attributed to a link between SSL and identifiable representation learning: Temporal structure and auxiliary variables ensure that latent representations are related to the true underlying generative factors of the data. Here, we deepen this connection and show that SSL can perform system identification in latent space. We propose DynCL, a framework to uncover linear, switching linear and non-linear dynamics under a non-linear observation model, give theoretical guarantees and validate them empirically.

Via

Access Paper or Ask Questions

RDumb: A simple approach that questions our progress in continual test-time adaptation

Jun 08, 2023

Ori Press, Steffen Schneider, Matthias Kümmerer, Matthias Bethge

Abstract:Test-Time Adaptation (TTA) allows to update pretrained models to changing data distributions at deployment time. While early work tested these algorithms for individual fixed distribution shifts, recent work proposed and applied methods for continual adaptation over long timescales. To examine the reported progress in the field, we propose the Continuously Changing Corruptions (CCC) benchmark to measure asymptotic performance of TTA techniques. We find that eventually all but one state-of-the-art methods collapse and perform worse than a non-adapting model, including models specifically proposed to be robust to performance collapse. In addition, we introduce a simple baseline, "RDumb", that periodically resets the model to its pretrained state. RDumb performs better or on par with the previously proposed state-of-the-art in all considered benchmarks. Our results show that previous TTA approaches are neither effective at regularizing adaptation to avoid collapse nor able to outperform a simplistic resetting strategy.

Via

Access Paper or Ask Questions

Learnable latent embeddings for joint behavioral and neural analysis

Apr 01, 2022

Steffen Schneider, Jin Hwa Lee, Mackenzie Weygandt Mathis

Figure 1 for Learnable latent embeddings for joint behavioral and neural analysis

Figure 2 for Learnable latent embeddings for joint behavioral and neural analysis

Figure 3 for Learnable latent embeddings for joint behavioral and neural analysis

Figure 4 for Learnable latent embeddings for joint behavioral and neural analysis

Abstract:Mapping behavioral actions to neural activity is a fundamental goal of neuroscience. As our ability to record large neural and behavioral data increases, there is growing interest in modeling neural dynamics during adaptive behaviors to probe neural representations. In particular, neural latent embeddings can reveal underlying correlates of behavior, yet, we lack non-linear techniques that can explicitly and flexibly leverage joint behavior and neural data. Here, we fill this gap with a novel method, CEBRA, that jointly uses behavioral and neural data in a hypothesis- or discovery-driven manner to produce consistent, high-performance latent spaces. We validate its accuracy and demonstrate our tool's utility for both calcium and electrophysiology datasets, across sensory and motor tasks, and in simple or complex behaviors across species. It allows for single and multi-session datasets to be leveraged for hypothesis testing or can be used label-free. Lastly, we show that CEBRA can be used for the mapping of space, uncovering complex kinematic features, and rapid, high-accuracy decoding of natural movies from visual cortex.

* Website: cebra.ai

Via

Access Paper or Ask Questions

Unsupervised Object Learning via Common Fate

Oct 13, 2021

Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

Figure 1 for Unsupervised Object Learning via Common Fate

Figure 2 for Unsupervised Object Learning via Common Fate

Figure 3 for Unsupervised Object Learning via Common Fate

Figure 4 for Unsupervised Object Learning via Common Fate

Abstract:Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling. We decompose this problem into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative models are trained on the masks of the background and the moving objects, respectively. Third, background and foreground models are combined in a conditional "dead leaves" scene model to sample novel scene configurations where occlusions and depth layering arise naturally. To evaluate the individual stages, we introduce the Fishbowl dataset positioned between complex real-world scenes and common object-centric benchmarks of simplistic objects. We show that our approach allows learning generative models that generalize beyond the occlusions present in the input videos, and represent scenes in a modular fashion that allows sampling plausible scenes outside the training distribution by permitting, for instance, object numbers or densities not observed in the training set.

Via

Access Paper or Ask Questions

Adapting ImageNet-scale models to complex distribution shifts with self-learning

Apr 28, 2021

Evgenia Rusak, Steffen Schneider, Peter Gehler, Oliver Bringmann, Wieland Brendel, Matthias Bethge

Figure 1 for Adapting ImageNet-scale models to complex distribution shifts with self-learning

Figure 2 for Adapting ImageNet-scale models to complex distribution shifts with self-learning

Figure 3 for Adapting ImageNet-scale models to complex distribution shifts with self-learning

Figure 4 for Adapting ImageNet-scale models to complex distribution shifts with self-learning

Abstract:While self-learning methods are an important component in many recent domain adaptation techniques, they are not yet comprehensively evaluated on ImageNet-scale datasets common in robustness research. In extensive experiments on ResNet and EfficientNet models, we find that three components are crucial for increasing performance with self-learning: (i) using short update times between the teacher and the student network, (ii) fine-tuning only few affine parameters distributed across the network, and (iii) leveraging methods from robust classification to counteract the effect of label noise. We use these insights to obtain drastically improved state-of-the-art results on ImageNet-C (22.0% mCE), ImageNet-R (17.4% error) and ImageNet-A (14.8% error). Our techniques yield further improvements in combination with previously proposed robustification methods. Self-learning is able to reduce the top-1 error to a point where no substantial further progress can be expected. We therefore re-purpose the dataset from the Visual Domain Adaptation Challenge 2019 and use a subset of it as a new robustness benchmark (ImageNet-D) which proves to be a more challenging dataset for all current state-of-the-art models (58.2% error) to guide future research efforts at the intersection of robustness and domain adaptation on ImageNet scale.

* Web: https://domainadaptation.org/selflearning

Via

Access Paper or Ask Questions

Contrastive Learning Inverts the Data Generating Process

Feb 17, 2021

Roland S. Zimmermann, Yash Sharma, Steffen Schneider, Matthias Bethge, Wieland Brendel

Figure 1 for Contrastive Learning Inverts the Data Generating Process

Figure 2 for Contrastive Learning Inverts the Data Generating Process

Figure 3 for Contrastive Learning Inverts the Data Generating Process

Figure 4 for Contrastive Learning Inverts the Data Generating Process

Abstract:Contrastive learning has recently seen tremendous success in self-supervised learning. So far, however, it is largely unclear why the learned representations generalize so effectively to a large variety of downstream tasks. We here prove that feedforward models trained with objectives belonging to the commonly used InfoNCE family learn to implicitly invert the underlying generative model of the observed data. While the proofs make certain statistical assumptions about the generative model, we observe empirically that our findings hold even if these assumptions are severely violated. Our theory highlights a fundamental connection between contrastive learning, generative modeling, and nonlinear independent component analysis, thereby furthering our understanding of the learned representations as well as providing a theoretical foundation to derive more effective contrastive losses.

* The first three authors, as well as the last two authors, contributed equally. Code is available at https://brendel-group.github.io/cl-ica

Via

Access Paper or Ask Questions

A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives

Sep 02, 2020

Alexander Mathis, Steffen Schneider, Jessy Lauer, Mackenzie W. Mathis

Figure 1 for A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives

Figure 2 for A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives

Figure 3 for A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives

Figure 4 for A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives

Abstract:Extracting behavioral measurements non-invasively from video is stymied by the fact that it is a hard computational problem. Recent advances in deep learning have tremendously advanced predicting posture from videos directly, which quickly impacted neuroscience and biology more broadly. In this primer we review the budding field of motion capture with deep learning. In particular, we will discuss the principles of those novel algorithms, highlight their potential as well as pitfalls for experimentalists, and provide a glimpse into the future.

* Review, 21 pages, 8 figures and 5 boxes

Via

Access Paper or Ask Questions

Improving robustness against common corruptions by covariate shift adaptation

Jun 30, 2020

Steffen Schneider, Evgenia Rusak, Luisa Eck, Oliver Bringmann, Wieland Brendel, Matthias Bethge

Figure 1 for Improving robustness against common corruptions by covariate shift adaptation

Figure 2 for Improving robustness against common corruptions by covariate shift adaptation

Figure 3 for Improving robustness against common corruptions by covariate shift adaptation

Figure 4 for Improving robustness against common corruptions by covariate shift adaptation

Abstract:Today's state-of-the-art machine vision models are vulnerable to image corruptions like blurring or compression artefacts, limiting their performance in many real-world applications. We here argue that popular benchmarks to measure model robustness against common corruptions (like ImageNet-C) underestimate model robustness in many (but not all) application scenarios. The key insight is that in many scenarios, multiple unlabeled examples of the corruptions are available and can be used for unsupervised online adaptation. Replacing the activation statistics estimated by batch normalization on the training set with the statistics of the corrupted images consistently improves the robustness across 25 different popular computer vision models. Using the corrected statistics, ResNet-50 reaches 62.2% mCE on ImageNet-C compared to 76.7% without adaptation. With the more robust AugMix model, we improve the state of the art from 56.5% mCE to 51.0% mCE. Even adapting to a single sample improves robustness for the ResNet-50 and AugMix models, and 32 samples are sufficient to improve the current state of the art for a ResNet-50 architecture. We argue that results with adapted statistics should be included whenever reporting scores in corruption benchmarks and other out-of-distribution generalization settings.

Via

Access Paper or Ask Questions