Picture for Elizabeth Daly

Elizabeth Daly

BenchmarkCards: Large Language Model and Risk Reporting

Add code
Oct 16, 2024
Figure 1 for BenchmarkCards: Large Language Model and Risk Reporting
Figure 2 for BenchmarkCards: Large Language Model and Risk Reporting
Figure 3 for BenchmarkCards: Large Language Model and Risk Reporting
Viaarxiv icon

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Add code
Jun 19, 2024
Figure 1 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 2 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 3 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 4 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Viaarxiv icon

Ranking Large Language Models without Ground Truth

Add code
Feb 21, 2024
Figure 1 for Ranking Large Language Models without Ground Truth
Figure 2 for Ranking Large Language Models without Ground Truth
Figure 3 for Ranking Large Language Models without Ground Truth
Figure 4 for Ranking Large Language Models without Ground Truth
Viaarxiv icon

Explaining Knock-on Effects of Bias Mitigation

Add code
Dec 01, 2023
Viaarxiv icon

Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification

Add code
Aug 30, 2023
Viaarxiv icon

Leveraging Explanations in Interactive Machine Learning: An Overview

Add code
Jul 29, 2022
Figure 1 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 2 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 3 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 4 for Leveraging Explanations in Interactive Machine Learning: An Overview
Viaarxiv icon

Boolean Decision Rules for Reinforcement Learning Policy Summarisation

Add code
Jul 18, 2022
Figure 1 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 2 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 3 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 4 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Viaarxiv icon

Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents

Add code
Dec 17, 2021
Figure 1 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 2 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 3 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 4 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Viaarxiv icon

Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Add code
Jul 14, 2021
Figure 1 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 2 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 3 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 4 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Viaarxiv icon

Generating Dialogue Agents via Automated Planning

Add code
Feb 02, 2019
Figure 1 for Generating Dialogue Agents via Automated Planning
Figure 2 for Generating Dialogue Agents via Automated Planning
Figure 3 for Generating Dialogue Agents via Automated Planning
Figure 4 for Generating Dialogue Agents via Automated Planning
Viaarxiv icon