Picture for Johannes Kirschner

Johannes Kirschner

Confidence Estimation via Sequential Likelihood Mixing

Add code
Feb 20, 2025
Viaarxiv icon

Regret Minimization via Saddle Point Optimization

Add code
Mar 15, 2024
Viaarxiv icon

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 08, 2023
Viaarxiv icon

Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

Add code
Feb 07, 2023
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Dec 19, 2022
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Dec 17, 2022
Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization

Add code
Mar 29, 2022
Figure 1 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 2 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 3 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Figure 4 for Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization
Viaarxiv icon

Bias-Robust Bayesian Optimization via Dueling Bandits

Add code
Jun 09, 2021
Figure 1 for Bias-Robust Bayesian Optimization via Dueling Bandits
Figure 2 for Bias-Robust Bayesian Optimization via Dueling Bandits
Viaarxiv icon

Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

Add code
Jan 21, 2021
Figure 1 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 2 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 3 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Figure 4 for Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback
Viaarxiv icon

Asymptotically Optimal Information-Directed Sampling

Add code
Nov 11, 2020
Figure 1 for Asymptotically Optimal Information-Directed Sampling
Figure 2 for Asymptotically Optimal Information-Directed Sampling
Figure 3 for Asymptotically Optimal Information-Directed Sampling
Figure 4 for Asymptotically Optimal Information-Directed Sampling
Viaarxiv icon