Picture for Boris Nazarov

Boris Nazarov

Attention Condensation via Sparsity Induced Regularized Training

Add code
Mar 03, 2025
Viaarxiv icon

Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

Add code
Mar 03, 2025
Viaarxiv icon