Picture for Michal Valko

Michal Valko

Sid

Optimal Design for Reward Modeling in RLHF

Add code
Oct 23, 2024
Figure 1 for Optimal Design for Reward Modeling in RLHF
Figure 2 for Optimal Design for Reward Modeling in RLHF
Viaarxiv icon

Preference Optimization with Multi-Sample Comparisons

Add code
Oct 16, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Add code
May 20, 2024
Figure 1 for Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Figure 2 for Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Figure 3 for Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Figure 4 for Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Viaarxiv icon

Understanding the performance gap between online and offline alignment algorithms

Add code
May 14, 2024
Figure 1 for Understanding the performance gap between online and offline alignment algorithms
Figure 2 for Understanding the performance gap between online and offline alignment algorithms
Figure 3 for Understanding the performance gap between online and offline alignment algorithms
Figure 4 for Understanding the performance gap between online and offline alignment algorithms
Viaarxiv icon

Human Alignment of Large Language Models through Online Preference Optimisation

Add code
Mar 13, 2024
Figure 1 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 2 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 3 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 4 for Human Alignment of Large Language Models through Online Preference Optimisation
Viaarxiv icon

Generalized Preference Optimization: A Unified Approach to Offline Alignment

Add code
Feb 08, 2024
Viaarxiv icon

Decoding-time Realignment of Language Models

Add code
Feb 05, 2024
Viaarxiv icon

Nash Learning from Human Feedback

Add code
Dec 06, 2023
Figure 1 for Nash Learning from Human Feedback
Figure 2 for Nash Learning from Human Feedback
Figure 3 for Nash Learning from Human Feedback
Figure 4 for Nash Learning from Human Feedback
Viaarxiv icon

Model-free Posterior Sampling via Learning Rate Randomization

Add code
Oct 27, 2023
Viaarxiv icon