Picture for Edouard Oyallon

Edouard Oyallon

MLIA, CNRS, ISIR, SU

PETRA: Parallel End-to-end Training with Reversible Architectures

Add code
Jun 04, 2024
Viaarxiv icon

ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training

Add code
Jun 03, 2024
Viaarxiv icon

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Add code
May 31, 2024
Viaarxiv icon

WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average

Add code
May 27, 2024
Viaarxiv icon

Cyclic Data Parallelism for Efficient Parallelism of Deep Neural Networks

Add code
Mar 13, 2024
Viaarxiv icon

Vectorizing string entries for data processing on tables: when are larger language models better?

Add code
Dec 15, 2023
Figure 1 for Vectorizing string entries for data processing on tables: when are larger language models better?
Figure 2 for Vectorizing string entries for data processing on tables: when are larger language models better?
Figure 3 for Vectorizing string entries for data processing on tables: when are larger language models better?
Figure 4 for Vectorizing string entries for data processing on tables: when are larger language models better?
Viaarxiv icon

$\textbf{A}^2\textbf{CiD}^2$: Accelerating Asynchronous Communication in Decentralized Deep Learning

Add code
Jun 14, 2023
Viaarxiv icon

Can Forward Gradient Match Backpropagation?

Add code
Jun 12, 2023
Viaarxiv icon

Guiding The Last Layer in Federated Learning with Pre-Trained Models

Add code
Jun 06, 2023
Viaarxiv icon

DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization for Time-Varying Gossips

Add code
Jul 26, 2022
Figure 1 for DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization for Time-Varying Gossips
Figure 2 for DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization for Time-Varying Gossips
Figure 3 for DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization for Time-Varying Gossips
Figure 4 for DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization for Time-Varying Gossips
Viaarxiv icon