Picture for Mahdi Soltanolkotabi

Mahdi Soltanolkotabi

ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models

Add code
Mar 20, 2026
Viaarxiv icon

Learning to Recall with Transformers Beyond Orthogonal Embeddings

Add code
Mar 16, 2026
Viaarxiv icon

Training Dynamics of Softmax Self-Attention: Fast Global Convergence via Preconditioning

Add code
Mar 02, 2026
Viaarxiv icon

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

Add code
Feb 17, 2026
Viaarxiv icon

Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards

Add code
Feb 11, 2026
Viaarxiv icon

Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning

Add code
Feb 02, 2026
Viaarxiv icon

Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs

Add code
Jul 16, 2025
Figure 1 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 2 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 3 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 4 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Viaarxiv icon

The Rich and the Simple: On the Implicit Bias of Adam and SGD

Add code
May 29, 2025
Figure 1 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 2 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 3 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 4 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Viaarxiv icon

Emergence and Evolution of Interpretable Concepts in Diffusion Models

Add code
Apr 21, 2025
Viaarxiv icon

Test-Time Training Provably Improves Transformers as In-context Learners

Add code
Mar 14, 2025
Viaarxiv icon