Picture for Mikhail Belkin

Mikhail Belkin

A Gap Between the Gaussian RKHS and Neural Networks: An Infinite-Center Asymptotic Analysis

Add code
Feb 22, 2025
Viaarxiv icon

Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks?

Add code
Feb 13, 2025
Viaarxiv icon

Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers

Add code
Feb 06, 2025
Figure 1 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 2 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 3 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Figure 4 for Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
Viaarxiv icon

Fast training of large kernel models with delayed projections

Add code
Nov 25, 2024
Figure 1 for Fast training of large kernel models with delayed projections
Figure 2 for Fast training of large kernel models with delayed projections
Figure 3 for Fast training of large kernel models with delayed projections
Figure 4 for Fast training of large kernel models with delayed projections
Viaarxiv icon

Mirror Descent on Reproducing Kernel Banach Spaces

Add code
Nov 18, 2024
Viaarxiv icon

Context-Scaling versus Task-Scaling in In-Context Learning

Add code
Oct 16, 2024
Figure 1 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 2 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 3 for Context-Scaling versus Task-Scaling in In-Context Learning
Figure 4 for Context-Scaling versus Task-Scaling in In-Context Learning
Viaarxiv icon

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

Add code
Jul 29, 2024
Viaarxiv icon

Average gradient outer product as a mechanism for deep neural collapse

Add code
Feb 21, 2024
Viaarxiv icon

Unmemorization in Large Language Models via Self-Distillation and Deliberate Imagination

Add code
Feb 15, 2024
Viaarxiv icon

Linear Recursive Feature Machines provably recover low-rank matrices

Add code
Jan 09, 2024
Viaarxiv icon