Picture for Marco Cuturi

Marco Cuturi

CREST, ENSAE ParisTech

The Design Space of Tri-Modal Masked Diffusion Models

Add code
Feb 25, 2026
Viaarxiv icon

LaCy: What Small Language Models Can and Should Learn is Not Just a Question of Loss

Add code
Feb 13, 2026
Viaarxiv icon

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

Add code
Dec 26, 2025
Viaarxiv icon

Learning Unmasking Policies for Diffusion Language Models

Add code
Dec 12, 2025
Viaarxiv icon

The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining

Add code
Oct 02, 2025
Figure 1 for The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
Figure 2 for The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
Figure 3 for The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
Figure 4 for The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
Viaarxiv icon

The Geometries of Truth Are Orthogonal Across Tasks

Add code
Jun 10, 2025
Viaarxiv icon

On Fitting Flow Models with Large Sinkhorn Couplings

Add code
Jun 05, 2025
Viaarxiv icon

Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures

Add code
Mar 13, 2025
Viaarxiv icon

Scaling Laws for Forgetting during Finetuning with Pretraining Data Injection

Add code
Feb 09, 2025
Viaarxiv icon

Multivariate Conformal Prediction using Optimal Transport

Add code
Feb 05, 2025
Figure 1 for Multivariate Conformal Prediction using Optimal Transport
Figure 2 for Multivariate Conformal Prediction using Optimal Transport
Figure 3 for Multivariate Conformal Prediction using Optimal Transport
Figure 4 for Multivariate Conformal Prediction using Optimal Transport
Viaarxiv icon