Picture for Yuta Saito

Yuta Saito

A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search

Add code
Sep 18, 2024
Viaarxiv icon

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

Add code
Aug 20, 2024
Viaarxiv icon

Long-term Off-Policy Evaluation and Learning

Add code
Apr 24, 2024
Viaarxiv icon

Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It

Add code
Apr 23, 2024
Viaarxiv icon

Scalable and Provably Fair Exposure Control for Large-Scale Recommender Systems

Add code
Feb 22, 2024
Viaarxiv icon

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Add code
Feb 09, 2024
Viaarxiv icon

Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

Add code
Feb 03, 2024
Viaarxiv icon

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Add code
Dec 04, 2023
Viaarxiv icon

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Add code
Dec 04, 2023
Viaarxiv icon

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Add code
Jun 26, 2023
Viaarxiv icon