Picture for Xiaosen Zheng

Xiaosen Zheng

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Add code
Oct 09, 2024
Figure 1 for Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Figure 2 for Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Figure 3 for Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Figure 4 for Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Viaarxiv icon

RegMix: Data Mixture as Regression for Language Model Pre-training

Add code
Jul 01, 2024
Viaarxiv icon

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

Add code
Jun 03, 2024
Viaarxiv icon

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Add code
Feb 13, 2024
Viaarxiv icon

Intriguing Properties of Data Attribution on Diffusion Models

Add code
Nov 01, 2023
Viaarxiv icon

An Empirical Study of Memorization in NLP

Add code
Mar 23, 2022
Figure 1 for An Empirical Study of Memorization in NLP
Figure 2 for An Empirical Study of Memorization in NLP
Figure 3 for An Empirical Study of Memorization in NLP
Figure 4 for An Empirical Study of Memorization in NLP
Viaarxiv icon