Picture for Tyler A. Chang

Tyler A. Chang

Scalable Influence and Fact Tracing for Large Language Model Pretraining

Add code
Oct 22, 2024
Viaarxiv icon

Goldfish: Monolingual Language Models for 350 Languages

Add code
Aug 19, 2024
Viaarxiv icon

Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement

Add code
Mar 20, 2024
Figure 1 for Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement
Figure 2 for Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement
Figure 3 for Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement
Figure 4 for Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement
Viaarxiv icon

Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

Add code
Mar 13, 2024
Viaarxiv icon

A Bit of a Problem: Measurement Disparities in Dataset Sizes Across Languages

Add code
Mar 01, 2024
Viaarxiv icon

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

Add code
Nov 15, 2023
Viaarxiv icon

When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages

Add code
Nov 15, 2023
Figure 1 for When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Figure 2 for When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Figure 3 for When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Figure 4 for When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
Viaarxiv icon

Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language Models

Add code
Oct 11, 2023
Viaarxiv icon

Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and Stability

Add code
Aug 29, 2023
Viaarxiv icon

Characterizing and Measuring Linguistic Dataset Drift

Add code
May 26, 2023
Viaarxiv icon