Picture for Haruka Kiyohara

Haruka Kiyohara

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

Add code
Aug 20, 2024
Figure 1 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 2 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 3 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 4 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Viaarxiv icon

Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Add code
Feb 03, 2024
Figure 1 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 2 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 3 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 4 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Add code
Dec 04, 2023
Figure 1 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 2 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 3 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 4 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Add code
Dec 04, 2023
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Jun 26, 2023
Viaarxiv icon

Policy-Adaptive Estimator Selection for Off-Policy Evaluation

Add code
Nov 25, 2022
Viaarxiv icon

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

Add code
Jul 26, 2022
Figure 1 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 2 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 3 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 4 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Viaarxiv icon

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Add code
Feb 03, 2022
Figure 1 for Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Figure 2 for Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Figure 3 for Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Figure 4 for Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
Viaarxiv icon

Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation

Add code
Sep 17, 2021
Figure 1 for Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Figure 2 for Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Viaarxiv icon

Evaluating the Robustness of Off-Policy Evaluation

Add code
Aug 31, 2021
Figure 1 for Evaluating the Robustness of Off-Policy Evaluation
Figure 2 for Evaluating the Robustness of Off-Policy Evaluation
Figure 3 for Evaluating the Robustness of Off-Policy Evaluation
Figure 4 for Evaluating the Robustness of Off-Policy Evaluation
Viaarxiv icon