Picture for Otmane Sakhi

Otmane Sakhi

Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning

Add code
May 23, 2024
Viaarxiv icon

Fast Slate Policy Optimization: Going Beyond Plackett-Luce

Add code
Aug 03, 2023
Viaarxiv icon

PAC-Bayesian Offline Contextual Bandits With Guarantees

Add code
Oct 24, 2022
Viaarxiv icon

Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation

Add code
Sep 18, 2022
Viaarxiv icon

Fast Offline Policy Optimization for Large Scale Recommendation

Add code
Aug 11, 2022
Figure 1 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 2 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 3 for Fast Offline Policy Optimization for Large Scale Recommendation
Figure 4 for Fast Offline Policy Optimization for Large Scale Recommendation
Viaarxiv icon

A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation

Add code
Aug 10, 2022
Figure 1 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 2 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 3 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Figure 4 for A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation
Viaarxiv icon

Improving Offline Contextual Bandits with Distributional Robustness

Add code
Nov 13, 2020
Figure 1 for Improving Offline Contextual Bandits with Distributional Robustness
Figure 2 for Improving Offline Contextual Bandits with Distributional Robustness
Figure 3 for Improving Offline Contextual Bandits with Distributional Robustness
Viaarxiv icon

BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals

Add code
Aug 28, 2020
Figure 1 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 2 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 3 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Figure 4 for BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Viaarxiv icon

Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks

Add code
Oct 03, 2019
Figure 1 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Figure 2 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Figure 3 for Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Viaarxiv icon