Picture for Matthieu Wyart

Matthieu Wyart

Sampling Data with Chains of Forward-Backward Diffusion Steps

Add code
May 26, 2026
Viaarxiv icon

Learn from your own latents and not from tokens: A sample-complexity theory

Add code
May 26, 2026
Viaarxiv icon

Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence

Add code
May 22, 2026
Viaarxiv icon

Symmetry in language statistics shapes the geometry of model representations

Add code
Feb 16, 2026
Viaarxiv icon

Deep networks learn to parse uniform-depth context-free languages from local statistics

Add code
Feb 09, 2026
Viaarxiv icon

Deriving Neural Scaling Laws from the statistics of natural language

Add code
Feb 07, 2026
Viaarxiv icon

On the Emergence of Linear Analogies in Word Embeddings

Add code
May 24, 2025
Viaarxiv icon

Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models

Add code
May 22, 2025
Figure 1 for Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Figure 2 for Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Figure 3 for Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Figure 4 for Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Viaarxiv icon

Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures

Add code
May 11, 2025
Figure 1 for Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Figure 2 for Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Figure 3 for Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Figure 4 for Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures
Viaarxiv icon

Learning curves theory for hierarchically compositional data with power-law distributed features

Add code
May 11, 2025
Viaarxiv icon