Picture for Darya Frolova

Darya Frolova

Attention Condensation via Sparsity Induced Regularized Training

Add code
Mar 03, 2025
Viaarxiv icon

Rethinking Data: Towards Better Performing Domain-Specific Small Language Models

Add code
Mar 03, 2025
Viaarxiv icon