Picture for David Grangier

David Grangier

Training Bilingual LMs with Data Constraints in the Targeted Language

Add code
Nov 20, 2024
Viaarxiv icon

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

Add code
Oct 31, 2024
Viaarxiv icon

No Need to Talk: Asynchronous Mixture of Language Models

Add code
Oct 04, 2024
Figure 1 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 2 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 3 for No Need to Talk: Asynchronous Mixture of Language Models
Figure 4 for No Need to Talk: Asynchronous Mixture of Language Models
Viaarxiv icon

Dynamic Gradient Alignment for Online Data Mixing

Add code
Oct 03, 2024
Viaarxiv icon

The AdEMAMix Optimizer: Better, Faster, Older

Add code
Sep 05, 2024
Figure 1 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 2 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 3 for The AdEMAMix Optimizer: Better, Faster, Older
Figure 4 for The AdEMAMix Optimizer: Better, Faster, Older
Viaarxiv icon

Specialized Language Models with Cheap Inference from Limited Domain Data

Add code
Feb 02, 2024
Viaarxiv icon

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Add code
Jan 29, 2024
Figure 1 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 2 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 3 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Figure 4 for Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Viaarxiv icon

Adaptive Training Distributions with Scalable Online Bilevel Optimization

Add code
Nov 20, 2023
Figure 1 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 2 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 3 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Figure 4 for Adaptive Training Distributions with Scalable Online Bilevel Optimization
Viaarxiv icon

Transfer Learning for Structured Pruning under Limited Task Data

Add code
Nov 10, 2023
Viaarxiv icon

High-Resource Methodological Bias in Low-Resource Investigations

Add code
Nov 14, 2022
Viaarxiv icon