Picture for Carolin Lawrence

Carolin Lawrence

AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents

Add code
Apr 09, 2024
Viaarxiv icon

Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains

Add code
Nov 25, 2023
Viaarxiv icon

Linking Surface Facts to Large-Scale Knowledge Graphs

Add code
Oct 23, 2023
Viaarxiv icon

Large Language Models Enable Few-Shot Clustering

Add code
Jul 02, 2023
Viaarxiv icon

Uncertainty Propagation in Node Classification

Add code
Apr 03, 2023
Viaarxiv icon

State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions

Add code
Dec 10, 2022
Viaarxiv icon

Multi-Source Survival Domain Adaptation

Add code
Dec 01, 2022
Viaarxiv icon

KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models

Add code
Aug 23, 2022
Figure 1 for KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models
Figure 2 for KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models
Figure 3 for KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models
Figure 4 for KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models
Viaarxiv icon

Human-Centric Research for NLP: Towards a Definition and Guiding Questions

Add code
Jul 10, 2022
Figure 1 for Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Figure 2 for Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Figure 3 for Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Figure 4 for Human-Centric Research for NLP: Towards a Definition and Guiding Questions
Viaarxiv icon

A Human-Centric Assessment Framework for AI

Add code
May 25, 2022
Figure 1 for A Human-Centric Assessment Framework for AI
Figure 2 for A Human-Centric Assessment Framework for AI
Viaarxiv icon