Picture for Alexander Wettig

Alexander Wettig

Metadata Conditioning Accelerates Language Model Pre-training

Add code
Jan 03, 2025
Figure 1 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 2 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 3 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 4 for Metadata Conditioning Accelerates Language Model Pre-training
Viaarxiv icon

Establishing Task Scaling Laws via Compute-Efficient Model Ladders

Add code
Dec 05, 2024
Viaarxiv icon

How to Train Long-Context Language Models (Effectively)

Add code
Oct 03, 2024
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon

Finding Transformer Circuits with Edge Pruning

Add code
Jun 24, 2024
Figure 1 for Finding Transformer Circuits with Edge Pruning
Figure 2 for Finding Transformer Circuits with Edge Pruning
Figure 3 for Finding Transformer Circuits with Edge Pruning
Figure 4 for Finding Transformer Circuits with Edge Pruning
Viaarxiv icon

Language Models as Science Tutors

Add code
Feb 16, 2024
Figure 1 for Language Models as Science Tutors
Figure 2 for Language Models as Science Tutors
Figure 3 for Language Models as Science Tutors
Figure 4 for Language Models as Science Tutors
Viaarxiv icon

QuRating: Selecting High-Quality Data for Training Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Poisoning Retrieval Corpora by Injecting Adversarial Passages

Add code
Oct 29, 2023
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Oct 10, 2023
Viaarxiv icon

Learning Transformer Programs

Add code
Jun 01, 2023
Viaarxiv icon