Picture for Halil Alperen Gozeten

Halil Alperen Gozeten

Test-Time Training Provably Improves Transformers as In-context Learners

Add code
Mar 14, 2025
Viaarxiv icon

High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws

Add code
Oct 24, 2024
Viaarxiv icon