Picture for Fernando Martínez-Plumed

Fernando Martínez-Plumed

Shammie

General Scales Unlock AI Evaluation with Explanatory and Predictive Power

Add code
Mar 09, 2025
Viaarxiv icon

PredictaBoard: Benchmarking LLM Score Predictability

Add code
Feb 20, 2025
Viaarxiv icon

Predictable Artificial Intelligence

Add code
Oct 09, 2023
Figure 1 for Predictable Artificial Intelligence
Figure 2 for Predictable Artificial Intelligence
Figure 3 for Predictable Artificial Intelligence
Figure 4 for Predictable Artificial Intelligence
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Compute and Energy Consumption Trends in Deep Learning Inference

Add code
Sep 12, 2021
Figure 1 for Compute and Energy Consumption Trends in Deep Learning Inference
Figure 2 for Compute and Energy Consumption Trends in Deep Learning Inference
Figure 3 for Compute and Energy Consumption Trends in Deep Learning Inference
Figure 4 for Compute and Energy Consumption Trends in Deep Learning Inference
Viaarxiv icon

Fairness and Missing Values

Add code
May 29, 2019
Figure 1 for Fairness and Missing Values
Figure 2 for Fairness and Missing Values
Figure 3 for Fairness and Missing Values
Figure 4 for Fairness and Missing Values
Viaarxiv icon

Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them

Add code
Nov 20, 2018
Figure 1 for Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them
Figure 2 for Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them
Figure 3 for Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them
Figure 4 for Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them
Viaarxiv icon

General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation

Add code
Sep 26, 2018
Figure 1 for General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation
Figure 2 for General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation
Figure 3 for General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation
Figure 4 for General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation
Viaarxiv icon

A multidisciplinary task-based perspective for evaluating the impact of AI autonomy and generality on the future of work

Add code
Jul 06, 2018
Figure 1 for A multidisciplinary task-based perspective for evaluating the impact of AI autonomy and generality on the future of work
Viaarxiv icon

Accounting for the Neglected Dimensions of AI Progress

Add code
Jun 02, 2018
Figure 1 for Accounting for the Neglected Dimensions of AI Progress
Figure 2 for Accounting for the Neglected Dimensions of AI Progress
Figure 3 for Accounting for the Neglected Dimensions of AI Progress
Figure 4 for Accounting for the Neglected Dimensions of AI Progress
Viaarxiv icon