Picture for Elizabeth Daly

Elizabeth Daly

BenchmarkCards: Large Language Model and Risk Reporting

Add code
Oct 16, 2024
Figure 1 for BenchmarkCards: Large Language Model and Risk Reporting
Figure 2 for BenchmarkCards: Large Language Model and Risk Reporting
Figure 3 for BenchmarkCards: Large Language Model and Risk Reporting
Viaarxiv icon

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Add code
Jun 19, 2024
Figure 1 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 2 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 3 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 4 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Viaarxiv icon

Ranking Large Language Models without Ground Truth

Add code
Feb 21, 2024
Figure 1 for Ranking Large Language Models without Ground Truth
Figure 2 for Ranking Large Language Models without Ground Truth
Figure 3 for Ranking Large Language Models without Ground Truth
Figure 4 for Ranking Large Language Models without Ground Truth
Viaarxiv icon

Explaining Knock-on Effects of Bias Mitigation

Add code
Dec 01, 2023
Viaarxiv icon

Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification

Add code
Aug 30, 2023
Figure 1 for Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
Figure 2 for Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
Figure 3 for Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
Figure 4 for Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
Viaarxiv icon

Leveraging Explanations in Interactive Machine Learning: An Overview

Add code
Jul 29, 2022
Figure 1 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 2 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 3 for Leveraging Explanations in Interactive Machine Learning: An Overview
Figure 4 for Leveraging Explanations in Interactive Machine Learning: An Overview
Viaarxiv icon

Boolean Decision Rules for Reinforcement Learning Policy Summarisation

Add code
Jul 18, 2022
Figure 1 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 2 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 3 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Figure 4 for Boolean Decision Rules for Reinforcement Learning Policy Summarisation
Viaarxiv icon

Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents

Add code
Dec 17, 2021
Figure 1 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 2 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 3 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Figure 4 for Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents
Viaarxiv icon

Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization

Add code
Jul 14, 2021
Figure 1 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 2 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 3 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Figure 4 for Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling Optimization
Viaarxiv icon

Generating Dialogue Agents via Automated Planning

Add code
Feb 02, 2019
Figure 1 for Generating Dialogue Agents via Automated Planning
Figure 2 for Generating Dialogue Agents via Automated Planning
Figure 3 for Generating Dialogue Agents via Automated Planning
Figure 4 for Generating Dialogue Agents via Automated Planning
Viaarxiv icon