Picture for David Grangier

David Grangier

Assessing the Role of Data Quality in Training Bilingual Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Scaling Laws for Forgetting during Finetuning with Pretraining Data Injection

Add code
Feb 09, 2025
Viaarxiv icon

Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging

Add code
Feb 03, 2025
Viaarxiv icon

Training Bilingual LMs with Data Constraints in the Targeted Language

Add code
Nov 20, 2024
Figure 1 for Training Bilingual LMs with Data Constraints in the Targeted Language
Figure 2 for Training Bilingual LMs with Data Constraints in the Targeted Language
Figure 3 for Training Bilingual LMs with Data Constraints in the Targeted Language
Figure 4 for Training Bilingual LMs with Data Constraints in the Targeted Language
Viaarxiv icon

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

Add code
Oct 31, 2024
Viaarxiv icon

No Need to Talk: Asynchronous Mixture of Language Models

Add code
Oct 04, 2024
Figure 1 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 2 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 3 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 4 for No Need to Talk: Asynchronous Mixture of Language Models
Viaarxiv icon

Dynamic Gradient Alignment for Online Data Mixing

Add code
Oct 03, 2024
Viaarxiv icon

The AdEMAMix Optimizer: Better, Faster, Older

Add code
Sep 05, 2024
Figure 1 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 2 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 3 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 4 for The AdEMAMix Optimizer: Better, Faster, Older
Viaarxiv icon

Specialized Language Models with Cheap Inference from Limited Domain Data

Add code
Feb 02, 2024
Figure 1 for Specialized Language Models with Cheap Inference from Limited Domain Data
Figure 2 for Specialized Language Models with Cheap Inference from Limited Domain Data
Figure 3 for Specialized Language Models with Cheap Inference from Limited Domain Data
Figure 4 for Specialized Language Models with Cheap Inference from Limited Domain Data
Viaarxiv icon

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Add code
Jan 29, 2024
Figure 1 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 2 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 3 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 4 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Viaarxiv icon