Picture for Adityanarayanan Radhakrishnan

Adityanarayanan Radhakrishnan

Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers

Add code
Feb 06, 2025
Figure 1 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 2 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 3 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 4 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Viaarxiv icon

Context-Scaling versus Task-Scaling in In-Context Learning

Add code
Oct 16, 2024
Figure 1 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 2 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 3 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 4 for Context-Scaling versus Task-Scaling in In-Context Learning
Viaarxiv icon

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

Add code
Jul 29, 2024
Viaarxiv icon

Linear Recursive Feature Machines provably recover low-rank matrices

Add code
Jan 09, 2024
Viaarxiv icon

Mechanism of feature learning in convolutional neural networks

Add code
Sep 01, 2023
Figure 1 for Mechanism of feature learning in convolutional neural networks
Figure 2 for Mechanism of feature learning in convolutional neural networks
Figure 3 for Mechanism of feature learning in convolutional neural networks
Figure 4 for Mechanism of feature learning in convolutional neural networks
Viaarxiv icon

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Add code
Jun 07, 2023
Viaarxiv icon

Feature learning in neural networks and kernel machines that recursively learn features

Add code
Dec 28, 2022
Viaarxiv icon

Transfer Learning with Kernel Methods

Add code
Nov 01, 2022
Viaarxiv icon

Quadratic models for understanding neural network dynamics

Add code
May 24, 2022
Figure 1 for Quadratic models for understanding neural network dynamics
Figure 2 for Quadratic models for understanding neural network dynamics
Figure 3 for Quadratic models for understanding neural network dynamics
Figure 4 for Quadratic models for understanding neural network dynamics
Viaarxiv icon

Wide and Deep Neural Networks Achieve Optimality for Classification

Add code
Apr 29, 2022
Figure 1 for Wide and Deep Neural Networks Achieve Optimality for Classification
Figure 2 for Wide and Deep Neural Networks Achieve Optimality for Classification
Figure 3 for Wide and Deep Neural Networks Achieve Optimality for Classification
Viaarxiv icon