Picture for Alisa Liu

Alisa Liu

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Add code
Nov 22, 2024
Viaarxiv icon

Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models

Add code
Aug 12, 2024
Viaarxiv icon

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Add code
Jul 24, 2024
Figure 1 for Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Figure 2 for Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Figure 3 for Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Figure 4 for Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
Viaarxiv icon

A Taxonomy of Ambiguity Types for NLP

Add code
Mar 21, 2024
Viaarxiv icon

Tuning Language Models by Proxy

Add code
Jan 16, 2024
Viaarxiv icon

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?

Add code
Oct 23, 2023
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Viaarxiv icon

How Language Model Hallucinations Can Snowball

Add code
May 22, 2023
Figure 1 for How Language Model Hallucinations Can Snowball
Figure 2 for How Language Model Hallucinations Can Snowball
Figure 3 for How Language Model Hallucinations Can Snowball
Figure 4 for How Language Model Hallucinations Can Snowball
Viaarxiv icon

We're Afraid Language Models Aren't Modeling Ambiguity

Add code
Apr 27, 2023
Viaarxiv icon

Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts

Add code
Dec 20, 2022
Viaarxiv icon