Picture for Xiang Lisa Li

Xiang Lisa Li

Auditing Prompt Caching in Language Model APIs

Add code
Feb 11, 2025
Viaarxiv icon

Eliciting Language Model Behaviors with Investigator Agents

Add code
Feb 03, 2025
Viaarxiv icon

s1: Simple test-time scaling

Add code
Jan 31, 2025
Figure 1 for s1: Simple test-time scaling
Figure 2 for s1: Simple test-time scaling
Figure 3 for s1: Simple test-time scaling
Figure 4 for s1: Simple test-time scaling
Viaarxiv icon

AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models

Add code
Jul 11, 2024
Viaarxiv icon

Few-Shot Recalibration of Language Models

Add code
Mar 27, 2024
Viaarxiv icon

On the Learnability of Watermarks for Language Models

Add code
Dec 07, 2023
Viaarxiv icon

Benchmarking and Improving Generator-Validator Consistency of Language Models

Add code
Oct 03, 2023
Viaarxiv icon

Learning to Compress Prompts with Gist Tokens

Add code
Apr 17, 2023
Viaarxiv icon

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

Add code
Dec 28, 2022
Viaarxiv icon

Evaluating Human-Language Model Interaction

Add code
Dec 20, 2022
Viaarxiv icon