Picture for Ren Kishimoto

Ren Kishimoto

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

Add code
Aug 20, 2024
Figure 1 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 2 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 3 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 4 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Add code
Dec 04, 2023
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Add code
Dec 04, 2023
Figure 1 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 2 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 3 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 4 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Viaarxiv icon