Picture for Eugene Belilovsky

Eugene Belilovsky

MILA

Efficient Refusal Ablation in LLM through Optimal Transport

Add code
Mar 04, 2026
Viaarxiv icon

Celo2: Towards Learned Optimization Free Lunch

Add code
Feb 22, 2026
Viaarxiv icon

Stabilizing Native Low-Rank LLM Pretraining

Add code
Feb 12, 2026
Viaarxiv icon

Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL

Add code
Feb 03, 2026
Viaarxiv icon

Heterogeneous Low-Bandwidth Pre-Training of LLMs

Add code
Jan 05, 2026
Viaarxiv icon

When Data Falls Short: Grokking Below the Critical Threshold

Add code
Nov 06, 2025
Viaarxiv icon

Warming Up for Zeroth-Order Federated Pre-Training with Low Resource Clients

Add code
Sep 03, 2025
Viaarxiv icon

Less is More: Undertraining Experts Improves Model Upcycling

Add code
Jun 17, 2025
Viaarxiv icon

PyLO: Towards Accessible Learned Optimizers in PyTorch

Add code
Jun 12, 2025
Viaarxiv icon

MuLoCo: Muon is a practical inner optimizer for DiLoCo

Add code
May 29, 2025
Viaarxiv icon