Picture for Nadav Merlis

Nadav Merlis

Stable Matching with Ties: Approximation Ratios and Learning

Add code
Nov 05, 2024
Viaarxiv icon

Improved Algorithms for Contextual Dynamic Pricing

Add code
Jun 17, 2024
Viaarxiv icon

Reinforcement Learning with Lookahead Information

Add code
Jun 04, 2024
Viaarxiv icon

On Bits and Bandits: Quantifying the Regret-Information Trade-off

Add code
May 26, 2024
Viaarxiv icon

The Value of Reward Lookahead in Reinforcement Learning

Add code
Mar 18, 2024
Viaarxiv icon

Reinforcement Learning with History-Dependent Dynamic Contexts

Add code
Feb 04, 2023
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
May 30, 2022
Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits

Add code
Oct 12, 2021
Figure 1 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 2 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 3 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Figure 4 for Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits
Viaarxiv icon

Ensemble Bootstrapping for Q-Learning

Add code
Feb 28, 2021
Figure 1 for Ensemble Bootstrapping for Q-Learning
Figure 2 for Ensemble Bootstrapping for Q-Learning
Figure 3 for Ensemble Bootstrapping for Q-Learning
Figure 4 for Ensemble Bootstrapping for Q-Learning
Viaarxiv icon

Confidence-Budget Matching for Sequential Budgeted Learning

Add code
Feb 05, 2021
Viaarxiv icon