Picture for Alexander Wettig

Alexander Wettig

Establishing Task Scaling Laws via Compute-Efficient Model Ladders

Add code
Dec 05, 2024
Viaarxiv icon

How to Train Long-Context Language Models (Effectively)

Add code
Oct 03, 2024
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon

Finding Transformer Circuits with Edge Pruning

Add code
Jun 24, 2024
Figure 1 for Finding Transformer Circuits with Edge Pruning
Figure 2 for Finding Transformer Circuits with Edge Pruning
Figure 3 for Finding Transformer Circuits with Edge Pruning
Figure 4 for Finding Transformer Circuits with Edge Pruning
Viaarxiv icon

Language Models as Science Tutors

Add code
Feb 16, 2024
Viaarxiv icon

QuRating: Selecting High-Quality Data for Training Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Poisoning Retrieval Corpora by Injecting Adversarial Passages

Add code
Oct 29, 2023
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Oct 10, 2023
Viaarxiv icon

Learning Transformer Programs

Add code
Jun 01, 2023
Viaarxiv icon

Adapting Language Models to Compress Contexts

Add code
May 24, 2023
Viaarxiv icon