Picture for Riccardo Fogliato

Riccardo Fogliato

Improving LLM Group Fairness on Tabular Data via In-Context Learning

Add code
Dec 05, 2024
Viaarxiv icon

Precise Model Benchmarking with Only a Few Observations

Add code
Oct 07, 2024
Figure 1 for Precise Model Benchmarking with Only a Few Observations
Figure 2 for Precise Model Benchmarking with Only a Few Observations
Figure 3 for Precise Model Benchmarking with Only a Few Observations
Figure 4 for Precise Model Benchmarking with Only a Few Observations
Viaarxiv icon

A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

Add code
Jun 11, 2024
Figure 1 for A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
Figure 2 for A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
Figure 3 for A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
Figure 4 for A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
Viaarxiv icon

Multicalibration for Confidence Scoring in LLMs

Add code
Apr 06, 2024
Viaarxiv icon

Confidence Intervals for Error Rates in Matching Tasks: Critical Review and Recommendations

Add code
Jun 01, 2023
Viaarxiv icon

Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging

Add code
May 19, 2022
Figure 1 for Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging
Figure 2 for Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging
Figure 3 for Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging
Figure 4 for Who Goes First? Influences of Human-AI Workflow on Decision Making in Clinical Imaging
Viaarxiv icon

The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies

Add code
Sep 03, 2021
Figure 1 for The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies
Figure 2 for The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies
Figure 3 for The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies
Figure 4 for The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies
Viaarxiv icon

Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty

Add code
Nov 15, 2020
Figure 1 for Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty
Figure 2 for Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty
Figure 3 for Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty
Figure 4 for Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty
Viaarxiv icon