Picture for Jasper Dekoninck

Jasper Dekoninck

A Unified Approach to Routing and Cascading for LLMs

Add code
Oct 14, 2024
Viaarxiv icon

Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation

Add code
Sep 01, 2024
Figure 1 for Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Figure 2 for Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Figure 3 for Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Figure 4 for Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Viaarxiv icon

ConStat: Performance-Based Contamination Detection in Large Language Models

Add code
May 25, 2024
Figure 1 for ConStat: Performance-Based Contamination Detection in Large Language Models
Figure 2 for ConStat: Performance-Based Contamination Detection in Large Language Models
Figure 3 for ConStat: Performance-Based Contamination Detection in Large Language Models
Figure 4 for ConStat: Performance-Based Contamination Detection in Large Language Models
Viaarxiv icon

Evading Data Contamination Detection for Language Models is (too) Easy

Add code
Feb 12, 2024
Viaarxiv icon

Controlled Text Generation via Language Model Arithmetic

Add code
Nov 24, 2023
Viaarxiv icon