Picture for Branislav Kveton

Branislav Kveton

Multi-Objective Alignment of Large Language Models Through Hypervolume Maximization

Add code
Dec 06, 2024
Viaarxiv icon

Language-Model Prior Overcomes Cold-Start Items

Add code
Nov 13, 2024
Viaarxiv icon

OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models

Add code
Oct 31, 2024
Viaarxiv icon

Personalization of Large Language Models: A Survey

Add code
Oct 29, 2024
Viaarxiv icon

Online Posterior Sampling with a Diffusion Prior

Add code
Oct 04, 2024
Viaarxiv icon

Off-Policy Evaluation from Logged Human Feedback

Add code
Jun 14, 2024
Figure 1 for Off-Policy Evaluation from Logged Human Feedback
Figure 2 for Off-Policy Evaluation from Logged Human Feedback
Figure 3 for Off-Policy Evaluation from Logged Human Feedback
Figure 4 for Off-Policy Evaluation from Logged Human Feedback
Viaarxiv icon

Cross-Validated Off-Policy Evaluation

Add code
May 27, 2024
Viaarxiv icon

Optimal Design for Human Feedback

Add code
Apr 22, 2024
Viaarxiv icon

Experimental Design for Active Transductive Inference in Large Language Models

Add code
Apr 12, 2024
Viaarxiv icon

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

Add code
Jan 17, 2024
Figure 1 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 2 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 3 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 4 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Viaarxiv icon