Picture for Giulio Starace

Giulio Starace

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Add code
Oct 09, 2024
Figure 1 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 2 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 3 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 4 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Viaarxiv icon

Probing LLMs for Joint Encoding of Linguistic Categories

Add code
Oct 28, 2023
Viaarxiv icon

[Re] Badder Seeds: Reproducing the Evaluation of Lexical Methods for Bias Measurement

Add code
Jun 03, 2022
Figure 1 for [Re] Badder Seeds: Reproducing the Evaluation of Lexical Methods for Bias Measurement
Figure 2 for [Re] Badder Seeds: Reproducing the Evaluation of Lexical Methods for Bias Measurement
Figure 3 for [Re] Badder Seeds: Reproducing the Evaluation of Lexical Methods for Bias Measurement
Figure 4 for [Re] Badder Seeds: Reproducing the Evaluation of Lexical Methods for Bias Measurement
Viaarxiv icon