Picture for Alessandro Lazaric

Alessandro Lazaric

INRIA Lille - Nord Europe

System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

Add code
May 29, 2024
Figure 1 for System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes
Figure 2 for System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes
Viaarxiv icon

Reinforcement Learning with Options and State Representation

Add code
Mar 25, 2024
Figure 1 for Reinforcement Learning with Options and State Representation
Figure 2 for Reinforcement Learning with Options and State Representation
Figure 3 for Reinforcement Learning with Options and State Representation
Figure 4 for Reinforcement Learning with Options and State Representation
Viaarxiv icon

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Viaarxiv icon

Layered State Discovery for Incremental Autonomous Exploration

Add code
Feb 07, 2023
Viaarxiv icon

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Add code
Jan 05, 2023
Figure 1 for Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Figure 2 for Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Figure 3 for Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Figure 4 for Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Viaarxiv icon

On the Complexity of Representation Learning in Contextual Linear Bandits

Add code
Dec 19, 2022
Viaarxiv icon

Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler

Add code
Nov 04, 2022
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Oct 24, 2022
Viaarxiv icon

Contextual bandits with concave rewards, and an application to fair ranking

Add code
Oct 18, 2022
Figure 1 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 2 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 3 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 4 for Contextual bandits with concave rewards, and an application to fair ranking
Viaarxiv icon

Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

Add code
Oct 10, 2022
Figure 1 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 2 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 3 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Viaarxiv icon