Picture for Alex J. Chan

Alex J. Chan

LM2: Large Memory Models

Add code
Feb 09, 2025
Viaarxiv icon

Discovering Preference Optimization Algorithms with and for Large Language Models

Add code
Jun 12, 2024
Figure 1 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 2 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 3 for Discovering Preference Optimization Algorithms with and for Large Language Models
Figure 4 for Discovering Preference Optimization Algorithms with and for Large Language Models
Viaarxiv icon

Dense Reward for Free in Reinforcement Learning from Human Feedback

Add code
Feb 01, 2024
Figure 1 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 2 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 3 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Figure 4 for Dense Reward for Free in Reinforcement Learning from Human Feedback
Viaarxiv icon

Harmonizing Global Voices: Culturally-Aware Models for Enhanced Content Moderation

Add code
Dec 05, 2023
Viaarxiv icon

When is Off-Policy Evaluation Useful? A Data-Centric Perspective

Add code
Nov 23, 2023
Figure 1 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 2 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 3 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 4 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Viaarxiv icon

Optimising Human-AI Collaboration by Learning Convincing Explanations

Add code
Nov 13, 2023
Figure 1 for Optimising Human-AI Collaboration by Learning Convincing Explanations
Figure 2 for Optimising Human-AI Collaboration by Learning Convincing Explanations
Figure 3 for Optimising Human-AI Collaboration by Learning Convincing Explanations
Figure 4 for Optimising Human-AI Collaboration by Learning Convincing Explanations
Viaarxiv icon

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Add code
Sep 26, 2023
Figure 1 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 2 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 3 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 4 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Viaarxiv icon

Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes

Add code
Nov 11, 2022
Viaarxiv icon

Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning

Add code
Oct 11, 2022
Figure 1 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 2 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 3 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Figure 4 for Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Viaarxiv icon

POETREE: Interpretable Policy Learning with Adaptive Decision Trees

Add code
Mar 15, 2022
Figure 1 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 2 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 3 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Figure 4 for POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Viaarxiv icon