Picture for Jean-Francois Ton

Jean-Francois Ton

Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives

Add code
Nov 07, 2024
Viaarxiv icon

ACC-Debate: An Actor-Critic Approach to Multi-Agent Debate

Add code
Nov 04, 2024
Viaarxiv icon

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Mar 08, 2024
Viaarxiv icon

Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees

Add code
Feb 27, 2024
Viaarxiv icon

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

Add code
Feb 16, 2024
Viaarxiv icon

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

Add code
Dec 03, 2023
Viaarxiv icon

Deep Concept Removal

Add code
Oct 09, 2023
Viaarxiv icon

Invariant Learning via Probability of Sufficient and Necessary Causes

Add code
Sep 22, 2023
Viaarxiv icon

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Add code
Aug 10, 2023
Figure 1 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 2 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 3 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Figure 4 for Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Viaarxiv icon

Conformal Off-Policy Prediction in Contextual Bandits

Add code
Jun 09, 2022
Figure 1 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 2 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 3 for Conformal Off-Policy Prediction in Contextual Bandits
Figure 4 for Conformal Off-Policy Prediction in Contextual Bandits
Viaarxiv icon