Picture for Jean Kaddour

Jean Kaddour

Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models

Add code
Jul 22, 2024
Viaarxiv icon

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Viaarxiv icon

Are We Done with MMLU?

Add code
Jun 07, 2024
Viaarxiv icon

Text Data Augmentation in Low-Resource Settings via Fine-Tuning of Large Language Models

Add code
Oct 02, 2023
Viaarxiv icon

No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models

Add code
Jul 26, 2023
Viaarxiv icon

Challenges and Applications of Large Language Models

Add code
Jul 19, 2023
Viaarxiv icon

Understanding the Effectiveness of Early Weight Averaging for Training Large Language Models

Add code
Jun 05, 2023
Viaarxiv icon

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

Add code
Apr 18, 2023
Viaarxiv icon

The MiniPile Challenge for Data-Efficient Language Models

Add code
Apr 17, 2023
Viaarxiv icon

Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases

Add code
Mar 09, 2023
Viaarxiv icon