Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sushrut Thorat

Sparks of cognitive flexibility: self-guided context inference for flexible stimulus-response mapping by attentional routing

Feb 21, 2025

Rowan Sommers, Sushrut Thorat, Daniel Anthes, Tim C. Kietzmann

Abstract:Flexible cognition demands discovering hidden rules to quickly adapt stimulus-response mappings. Standard neural networks struggle in tasks requiring rapid, context-driven remapping. Recently, Hummos (2023) introduced a fast-and-slow learning algorithm to mitigate this shortfall, but its scalability to complex, image-computable tasks was unclear. Here, we propose the Wisconsin Neural Network (WiNN), which expands on fast-and-slow learning for real-world tasks demanding flexible rule-based behavior. WiNN employs a pretrained convolutional neural network for vision, coupled with an adjustable "context state" that guides attention to relevant features. If WiNN produces an incorrect response, it first iteratively updates its context state to refocus attention on task-relevant cues, then performs minimal parameter updates to attention and readout layers. This strategy preserves generalizable representations in the sensory network, reducing catastrophic forgetting. We evaluate WiNN on an image-based extension of the Wisconsin Card Sorting Task, revealing several markers of cognitive flexibility: (i) WiNN autonomously infers underlying rules, (ii) requires fewer examples to do so than control models reliant on large-scale parameter updates, (iii) can perform context-based rule inference solely via context-state adjustments-further enhanced by slow updates of attention and readout parameters, and (iv) generalizes to unseen compositional rules through context-state inference alone. By blending fast context inference with targeted attentional guidance, WiNN achieves "sparks" of flexibility. This approach offers a path toward context-sensitive models that retain knowledge while rapidly adapting to complex, rule-based tasks.

* 11 pages, 4 figures

Via

Access Paper or Ask Questions

Balancing stability and plasticity in continual learning: the readout-decomposition of activation change (RDAC) framework

Oct 10, 2023

Daniel Anthes, Sushrut Thorat, Peter König, Tim C. Kietzmann

Abstract:Continual learning (CL) algorithms strive to acquire new knowledge while preserving prior information. However, this stability-plasticity trade-off remains a central challenge. This paper introduces a framework that dissects this trade-off, offering valuable insights into CL algorithms. The Readout-Decomposition of Activation Change (RDAC) framework first addresses the stability-plasticity dilemma and its relation to catastrophic forgetting. It relates learning-induced activation changes in the range of prior readouts to the degree of stability and changes in the null space to the degree of plasticity. In deep non-linear networks tackling split-CIFAR-110 tasks, the framework clarifies the stability-plasticity trade-offs of the popular regularization algorithms Synaptic intelligence (SI), Elastic-weight consolidation (EWC), and learning without Forgetting (LwF), and replay-based algorithms Gradient episodic memory (GEM), and data replay. GEM and data replay preserved stability and plasticity, while SI, EWC, and LwF traded off plasticity for stability. The inability of the regularization algorithms to maintain plasticity was linked to them restricting the change of activations in the null space of the prior readout. Additionally, for one-hidden-layer linear neural networks, we derived a gradient decomposition algorithm to restrict activation change only in the range of the prior readouts, to maintain high stability while not further sacrificing plasticity. Results demonstrate that the algorithm maintained stability without significant plasticity loss. The RDAC framework informs the behavior of existing CL algorithms and paves the way for novel CL approaches. Finally, it sheds light on the connection between learning-induced activation/representation changes and the stability-plasticity dilemma, also offering insights into representational drift in biological systems.

* 13 pages, 4 figures

Via

Access Paper or Ask Questions

Diagnosing Catastrophe: Large parts of accuracy loss in continual learning can be accounted for by readout misalignment

Oct 09, 2023

Daniel Anthes, Sushrut Thorat, Peter König, Tim C. Kietzmann

Abstract:Unlike primates, training artificial neural networks on changing data distributions leads to a rapid decrease in performance on old tasks. This phenomenon is commonly referred to as catastrophic forgetting. In this paper, we investigate the representational changes that underlie this performance decrease and identify three distinct processes that together account for the phenomenon. The largest component is a misalignment between hidden representations and readout layers. Misalignment occurs due to learning on additional tasks and causes internal representations to shift. Representational geometry is partially conserved under this misalignment and only a small part of the information is irrecoverably lost. All types of representational changes scale with the dimensionality of hidden representations. These insights have implications for deep learning applications that need to be continuously updated, but may also aid aligning ANN models to the rather robust biological vision.

* 3 pages, 1 figure; published at the 2023 Conference on Cognitive Computational Neuroscience

Via

Access Paper or Ask Questions

Characterising representation dynamics in recurrent neural networks for object recognition

Aug 23, 2023

Sushrut Thorat, Adrien Doerig, Tim C. Kietzmann

Abstract:Recurrent neural networks (RNNs) have yielded promising results for both recognizing objects in challenging conditions and modeling aspects of primate vision. However, the representational dynamics of recurrent computations remain poorly understood, especially in large-scale visual models. Here, we studied such dynamics in RNNs trained for object classification on MiniEcoset, a novel subset of ecoset. We report two main insights. First, upon inference, representations continued to evolve after correct classification, suggesting a lack of the notion of being ``done with classification''. Second, focusing on ``readout zones'' as a way to characterize the activation trajectories, we observe that misclassified representations exhibit activation patterns with lower L2 norm, and are positioned more peripherally in the readout zones. Such arrangements help the misclassified representations move into the correct zones as time progresses. Our findings generalize to networks with lateral and top-down connections, and include both additive and multiplicative interactions with the bottom-up sweep. The results therefore contribute to a general understanding of RNN dynamics in naturalistic tasks. We hope that the analysis framework will aid future investigations of other types of RNNs, including understanding of representational dynamics in primate vision.

* 8 pages, 6 figures; revision of our Conference on Cognitive Computational Neuroscience (CCN) 2023 paper

Via

Access Paper or Ask Questions

Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization

Nov 15, 2021

Sushrut Thorat, Giacomo Aldegheri, Tim C. Kietzmann

Figure 1 for Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization

Figure 2 for Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization

Figure 3 for Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization

Figure 4 for Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization

Abstract:Recurrent neural networks (RNNs) have been shown to perform better than feedforward architectures in visual object categorization tasks, especially in challenging conditions such as cluttered images. However, little is known about the exact computational role of recurrent information flow in these conditions. Here we test RNNs trained for object categorization on the hypothesis that recurrence iteratively aids object categorization via the communication of category-orthogonal auxiliary variables (the location, orientation, and scale of the object). Using diagnostic linear readouts, we find that: (a) information about auxiliary variables increases across time in all network layers, (b) this information is indeed present in the recurrent information flow, and (c) its manipulation significantly affects task performance. These observations confirm the hypothesis that category-orthogonal auxiliary variable information is conveyed through recurrent connectivity and is used to optimize category inference in cluttered environments.

* 11 pages, 7 figures, peer-reviewed and accepted at the SVRHM 2021 workshop at NeurIPS

Via

Access Paper or Ask Questions

Modulation of early visual processing alleviates capacity limits in solving multiple tasks

Jul 30, 2019

Sushrut Thorat, Giacomo Aldegheri, Marcel A. J. van Gerven, Marius V. Peelen

Figure 1 for Modulation of early visual processing alleviates capacity limits in solving multiple tasks

Figure 2 for Modulation of early visual processing alleviates capacity limits in solving multiple tasks

Abstract:In daily life situations, we have to perform multiple tasks given a visual stimulus, which requires task-relevant information to be transmitted through our visual system. When it is not possible to transmit all the possibly relevant information to higher layers, due to a bottleneck, task-based modulation of early visual processing might be necessary. In this work, we report how the effectiveness of modulating the early processing stage of an artificial neural network depends on the information bottleneck faced by the network. The bottleneck is quantified by the number of tasks the network has to perform and the neural capacity of the later stage of the network. The effectiveness is gauged by the performance on multiple object detection tasks, where the network is trained with a recent multi-task optimization scheme. By associating neural modulations with task-based switching of the state of the network and characterizing when such switching is helpful in early processing, our results provide a functional perspective towards understanding why task-based modulation of early neural processes might be observed in the primate visual cortex

* 4 pages, 2 figures, accepted at the 2019 Conference on Cognitive Computational Neuroscience

Via

Access Paper or Ask Questions

The functional role of cue-driven feature-based feedback in object recognition

Mar 25, 2019

Sushrut Thorat, Marcel van Gerven, Marius Peelen

Figure 1 for The functional role of cue-driven feature-based feedback in object recognition

Figure 2 for The functional role of cue-driven feature-based feedback in object recognition

Figure 3 for The functional role of cue-driven feature-based feedback in object recognition

Figure 4 for The functional role of cue-driven feature-based feedback in object recognition

Abstract:Visual object recognition is not a trivial task, especially when the objects are degraded or surrounded by clutter or presented briefly. External cues (such as verbal cues or visual context) can boost recognition performance in such conditions. In this work, we build an artificial neural network to model the interaction between the object processing stream (OPS) and the cue. We study the effects of varying neural and representational capacities of the OPS on the performance boost provided by cue-driven feature-based feedback in the OPS. We observe that the feedback provides performance boosts only if the category-specific features about the objects cannot be fully represented in the OPS. This representational limit is more dependent on task demands than neural capacity. We also observe that the feedback scheme trained to maximise recognition performance boost is not the same as tuning-based feedback, and actually performs better than tuning-based feedback.

* 4 pages, 4 figures, published at the Conference on Cognitive Computational Neuroscience (CCN) 2018

Via

Access Paper or Ask Questions

Implementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture

Dec 17, 2016

Sushrut Thorat, Varad Choudhari

Figure 1 for Implementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture

Figure 2 for Implementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture

Figure 3 for Implementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture

Figure 4 for Implementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture

Abstract:In this paper, we outline an approach to build graph-based reverse dictionaries using word definitions. A reverse dictionary takes a phrase as an input and outputs a list of words semantically similar to that phrase. It is a solution to the Tip-of-the-Tongue problem. We use a distance-based similarity measure, computed on a graph, to assess the similarity between a word and the input phrase. We compare the performance of our approach with the Onelook Reverse Dictionary and a distributional semantics method based on word2vec, and show that our approach is much better than the distributional semantics method, and as good as Onelook, on a 3k lexicon. This simple approach sets a new performance baseline for reverse dictionaries.

* Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 2797-2806, Osaka, Japan, December 11-17 2016
* Included publication information

Via

Access Paper or Ask Questions