Picture for Alex J. Chan

Alex J. Chan

Discovering Preference Optimization Algorithms with and for Large Language Models

Add code
Jun 12, 2024
Figure 1 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 2 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 3 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 4 for Discovering Preference Optimization Algorithms with and for Large Language Models
Viaarxiv icon

Dense Reward for Free in Reinforcement Learning from Human Feedback

Add code
Feb 01, 2024
Figure 1 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 2 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 3 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 4 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Viaarxiv icon

Harmonizing Global Voices: Culturally-Aware Models for Enhanced Content Moderation

Add code
Dec 05, 2023
Viaarxiv icon

When is Off-Policy Evaluation Useful? A Data-Centric Perspective

Add code
Nov 23, 2023
Viaarxiv icon

Optimising Human-AI Collaboration by Learning Convincing Explanations

Add code
Nov 13, 2023
Viaarxiv icon

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Add code
Sep 26, 2023
Viaarxiv icon

Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes

Add code
Nov 11, 2022
Viaarxiv icon

Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning

Add code
Oct 11, 2022
Figure 1 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 2 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 3 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 4 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Viaarxiv icon

POETREE: Interpretable Policy Learning with Adaptive Decision Trees

Add code
Mar 15, 2022
Figure 1 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 2 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 3 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 4 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Viaarxiv icon

Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

Add code
Mar 14, 2022
Figure 1 for Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies
Figure 2 for Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies
Figure 3 for Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies
Figure 4 for Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies
Viaarxiv icon