Picture for Daphne Ippolito

Daphne Ippolito

Shammie

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Viaarxiv icon

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Add code
Nov 15, 2024
Viaarxiv icon

Chasing Random: Instruction Selection Strategies Fail to Generalize

Add code
Oct 19, 2024
Viaarxiv icon

Persistent Pre-Training Poisoning of LLMs

Add code
Oct 17, 2024
Viaarxiv icon

Human-aligned Chess with a Bit of Search

Add code
Oct 04, 2024
Figure 1 for Human-aligned Chess with a Bit of Search
Figure 2 for Human-aligned Chess with a Bit of Search
Figure 3 for Human-aligned Chess with a Bit of Search
Figure 4 for Human-aligned Chess with a Bit of Search
Viaarxiv icon

Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

Add code
Sep 06, 2024
Figure 1 for Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning
Figure 2 for Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning
Figure 3 for Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning
Figure 4 for Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning
Viaarxiv icon

Consent in Crisis: The Rapid Decline of the AI Data Commons

Add code
Jul 24, 2024
Figure 1 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 2 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 3 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 4 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Viaarxiv icon

RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors

Add code
May 13, 2024
Figure 1 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 2 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 3 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 4 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Viaarxiv icon

LMD3: Language Model Data Density Dependence

Add code
May 10, 2024
Viaarxiv icon

Forcing Diffuse Distributions out of Language Models

Add code
Apr 16, 2024
Figure 1 for Forcing Diffuse Distributions out of Language Models
Figure 2 for Forcing Diffuse Distributions out of Language Models
Figure 3 for Forcing Diffuse Distributions out of Language Models
Figure 4 for Forcing Diffuse Distributions out of Language Models
Viaarxiv icon