Picture for Florian E. Dorner

Florian E. Dorner

Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data

Add code
Oct 17, 2024
Figure 1 for Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Figure 2 for Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Figure 3 for Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Figure 4 for Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Viaarxiv icon

Training on the Test Task Confounds Evaluation and Emergence

Add code
Jul 10, 2024
Figure 1 for Training on the Test Task Confounds Evaluation and Emergence
Figure 2 for Training on the Test Task Confounds Evaluation and Emergence
Figure 3 for Training on the Test Task Confounds Evaluation and Emergence
Figure 4 for Training on the Test Task Confounds Evaluation and Emergence
Viaarxiv icon

Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback

Add code
Jun 09, 2024
Figure 1 for Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback
Figure 2 for Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback
Figure 3 for Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback
Figure 4 for Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback
Viaarxiv icon

Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget

Add code
Feb 03, 2024
Viaarxiv icon

Do personality tests generalize to Large Language Models?

Add code
Nov 09, 2023
Viaarxiv icon

Incentivizing Honesty among Competitors in Collaborative Learning and Optimization

Add code
May 25, 2023
Viaarxiv icon

Human-Guided Fair Classification for Natural Language Processing

Add code
Dec 20, 2022
Figure 1 for Human-Guided Fair Classification for Natural Language Processing
Figure 2 for Human-Guided Fair Classification for Natural Language Processing
Figure 3 for Human-Guided Fair Classification for Natural Language Processing
Figure 4 for Human-Guided Fair Classification for Natural Language Processing
Viaarxiv icon

Algorithmic collusion: A critical review

Add code
Oct 10, 2021
Figure 1 for Algorithmic collusion: A critical review
Viaarxiv icon

Measuring Progress in Deep Reinforcement Learning Sample Efficiency

Add code
Feb 09, 2021
Figure 1 for Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Figure 2 for Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Figure 3 for Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Figure 4 for Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Viaarxiv icon