Picture for Thomas Kleine Buening

Thomas Kleine Buening

MAVRL: Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference

Add code
Feb 16, 2026
Viaarxiv icon

Reinforcement Learning via Self-Distillation

Add code
Jan 28, 2026
Viaarxiv icon

Stackelberg Learning from Human Feedback: Preference Optimization as a Sequential Game

Add code
Dec 18, 2025
Viaarxiv icon

Strategyproof Reinforcement Learning from Human Feedback

Add code
Mar 12, 2025
Viaarxiv icon

A Unifying Framework for Causal Imitation Learning with Hidden Confounders

Add code
Feb 11, 2025
Figure 1 for A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Figure 2 for A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Figure 3 for A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Figure 4 for A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Viaarxiv icon

A Minimax Approach to Ad Hoc Teamwork

Add code
Feb 04, 2025
Viaarxiv icon

Strategic Linear Contextual Bandits

Add code
Jun 01, 2024
Viaarxiv icon

Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation

Add code
Nov 27, 2023
Figure 1 for Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation
Viaarxiv icon

Minimax-Bayes Reinforcement Learning

Add code
Feb 21, 2023
Figure 1 for Minimax-Bayes Reinforcement Learning
Figure 2 for Minimax-Bayes Reinforcement Learning
Figure 3 for Minimax-Bayes Reinforcement Learning
Figure 4 for Minimax-Bayes Reinforcement Learning
Viaarxiv icon

Environment Design for Inverse Reinforcement Learning

Add code
Oct 26, 2022
Viaarxiv icon