Picture for Lucas Weber

Lucas Weber

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Add code
Feb 19, 2025
Viaarxiv icon

Interpretability of Language Models via Task Spaces

Add code
Jun 10, 2024
Figure 1 for Interpretability of Language Models via Task Spaces
Figure 2 for Interpretability of Language Models via Task Spaces
Figure 3 for Interpretability of Language Models via Task Spaces
Figure 4 for Interpretability of Language Models via Task Spaces
Viaarxiv icon

Reinforcement Learning and Regret Bounds for Admission Control

Add code
Jun 07, 2024
Viaarxiv icon

Efficient multi-prompt evaluation of LLMs

Add code
May 27, 2024
Viaarxiv icon

tinyBenchmarks: evaluating LLMs with fewer examples

Add code
Feb 22, 2024
Figure 1 for tinyBenchmarks: evaluating LLMs with fewer examples
Figure 2 for tinyBenchmarks: evaluating LLMs with fewer examples
Figure 3 for tinyBenchmarks: evaluating LLMs with fewer examples
Figure 4 for tinyBenchmarks: evaluating LLMs with fewer examples
Viaarxiv icon

The ICL Consistency Test

Add code
Dec 08, 2023
Viaarxiv icon

Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning

Add code
Oct 20, 2023
Viaarxiv icon

Curriculum Learning with Adam: The Devil Is in the Wrong Details

Add code
Aug 23, 2023
Viaarxiv icon

Language Modelling as a Multi-Task Problem

Add code
Jan 27, 2021
Figure 1 for Language Modelling as a Multi-Task Problem
Figure 2 for Language Modelling as a Multi-Task Problem
Figure 3 for Language Modelling as a Multi-Task Problem
Figure 4 for Language Modelling as a Multi-Task Problem
Viaarxiv icon