Picture for Carsten T. Lüth

Carsten T. Lüth

SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks

Add code
Nov 29, 2024
Viaarxiv icon

Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics

Add code
Sep 25, 2024
Viaarxiv icon

Overcoming Common Flaws in the Evaluation of Selective Classification Systems

Add code
Jul 01, 2024
Viaarxiv icon

ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation

Add code
Jan 16, 2024
Viaarxiv icon

cOOpD: Reformulating COPD classification on chest CT scans as anomaly detection using contrastive representations

Add code
Jul 14, 2023
Viaarxiv icon

Toward Realistic Evaluation of Deep Active Learning Algorithms in Image Classification

Add code
Jan 25, 2023
Viaarxiv icon

CRADL: Contrastive Representations for Unsupervised Anomaly Detection and Localization

Add code
Jan 05, 2023
Viaarxiv icon

A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification

Add code
Nov 28, 2022
Viaarxiv icon