Picture for Lucas Dixon

Lucas Dixon

Scalable Influence and Fact Tracing for Large Language Model Pretraining

Add code
Oct 22, 2024
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

Who's asking? User personas and the mechanics of latent misalignment

Add code
Jun 17, 2024
Viaarxiv icon

Interactive Prompt Debugging with Sequence Salience

Add code
Apr 11, 2024
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Figure 1 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 2 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 3 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 4 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Viaarxiv icon

Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

Add code
Mar 13, 2024
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Add code
Feb 16, 2024
Viaarxiv icon

Decoding-time Realignment of Language Models

Add code
Feb 05, 2024
Viaarxiv icon