Picture for Dane Sherburn

Dane Sherburn

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Add code
Oct 09, 2024
Figure 1 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 2 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 3 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Figure 4 for MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Viaarxiv icon

Can Language Models Explain Their Own Classification Behavior?

Add code
May 13, 2024
Figure 1 for Can Language Models Explain Their Own Classification Behavior?
Figure 2 for Can Language Models Explain Their Own Classification Behavior?
Figure 3 for Can Language Models Explain Their Own Classification Behavior?
Figure 4 for Can Language Models Explain Their Own Classification Behavior?
Viaarxiv icon

Relational Graph Attention Networks

Add code
Apr 11, 2019
Figure 1 for Relational Graph Attention Networks
Figure 2 for Relational Graph Attention Networks
Figure 3 for Relational Graph Attention Networks
Figure 4 for Relational Graph Attention Networks
Viaarxiv icon