Picture for Carolin Lawrence

Carolin Lawrence

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Add code
Mar 25, 2025
Viaarxiv icon

Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

Add code
Mar 05, 2025
Viaarxiv icon

MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis

Add code
Feb 26, 2025
Viaarxiv icon

Evaluating Language Models as Synthetic Data Generators

Add code
Dec 04, 2024
Figure 1 for Evaluating Language Models as Synthetic Data Generators
Figure 2 for Evaluating Language Models as Synthetic Data Generators
Figure 3 for Evaluating Language Models as Synthetic Data Generators
Figure 4 for Evaluating Language Models as Synthetic Data Generators
Viaarxiv icon

AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents

Add code
Apr 09, 2024
Viaarxiv icon

Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains

Add code
Nov 25, 2023
Figure 1 for Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Figure 2 for Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Figure 3 for Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Figure 4 for Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Viaarxiv icon

Linking Surface Facts to Large-Scale Knowledge Graphs

Add code
Oct 23, 2023
Figure 1 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 2 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 3 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 4 for Linking Surface Facts to Large-Scale Knowledge Graphs
Viaarxiv icon

Large Language Models Enable Few-Shot Clustering

Add code
Jul 02, 2023
Viaarxiv icon

Uncertainty Propagation in Node Classification

Add code
Apr 03, 2023
Viaarxiv icon

State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions

Add code
Dec 10, 2022
Viaarxiv icon