Picture for Edouard Oyallon

Edouard Oyallon

MLIA

Unifying Local Communications and Local Updates for LLM Pretraining

Add code
Jun 09, 2026
Viaarxiv icon

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism

Add code
Jun 03, 2026
Viaarxiv icon

Unbiased Approximate Vector-Jacobian Products for Efficient Backpropagation

Add code
Feb 16, 2026
Viaarxiv icon

Stabilizing Native Low-Rank LLM Pretraining

Add code
Feb 12, 2026
Viaarxiv icon

Test-time Generalization for Physics through Neural Operator Splitting

Add code
Jan 31, 2026
Viaarxiv icon

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Add code
Apr 28, 2025
Viaarxiv icon

PETRA: Parallel End-to-end Training with Reversible Architectures

Add code
Jun 04, 2024
Figure 1 for PETRA: Parallel End-to-end Training with Reversible Architectures
Figure 2 for PETRA: Parallel End-to-end Training with Reversible Architectures
Figure 3 for PETRA: Parallel End-to-end Training with Reversible Architectures
Figure 4 for PETRA: Parallel End-to-end Training with Reversible Architectures
Viaarxiv icon

ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training

Add code
Jun 03, 2024
Figure 1 for ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training
Figure 2 for ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training
Figure 3 for ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training
Figure 4 for ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training
Viaarxiv icon

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Add code
May 31, 2024
Figure 1 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 2 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 3 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 4 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Viaarxiv icon

WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average

Add code
May 27, 2024
Figure 1 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 2 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 3 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 4 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Viaarxiv icon