Picture for Yuta Saito

Yuta Saito

A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search

Add code
Sep 18, 2024
Viaarxiv icon

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

Add code
Aug 20, 2024
Figure 1 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 2 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 3 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Figure 4 for Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Viaarxiv icon

Long-term Off-Policy Evaluation and Learning

Add code
Apr 24, 2024
Viaarxiv icon

Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It

Add code
Apr 23, 2024
Viaarxiv icon

Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems

Add code
Feb 22, 2024
Viaarxiv icon

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Add code
Feb 09, 2024
Figure 1 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 2 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 3 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 4 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Viaarxiv icon

Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Add code
Feb 03, 2024
Figure 1 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 2 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 3 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Figure 4 for Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Add code
Dec 04, 2023
Figure 1 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 2 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 3 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Figure 4 for Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Add code
Dec 04, 2023
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Jun 26, 2023
Viaarxiv icon