Picture for Ivan Oseledets

Ivan Oseledets

AIRI, Skolkovo Institute of Science and Technology

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

Add code
Sep 26, 2025
Viaarxiv icon

The Rogue Scalpel: Activation Steering Compromises LLM Safety

Add code
Sep 26, 2025
Viaarxiv icon

Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers

Add code
Sep 18, 2025
Viaarxiv icon

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Add code
Jun 07, 2025
Viaarxiv icon

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Add code
Jun 05, 2025
Viaarxiv icon

Geological Field Restoration through the Lens of Image Inpainting

Add code
Jun 05, 2025
Viaarxiv icon

Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Add code
Jun 05, 2025
Viaarxiv icon

Curse of Slicing: Why Sliced Mutual Information is a Deceptive Measure of Statistical Dependence

Add code
Jun 04, 2025
Viaarxiv icon

One Task Vector is not Enough: A Large-Scale Study for In-Context Learning

Add code
May 29, 2025
Viaarxiv icon

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Add code
May 27, 2025
Viaarxiv icon