Picture for Alex Nikulkov

Alex Nikulkov

Pearl: A Production-ready Reinforcement Learning Agent

Add code
Dec 06, 2023
Viaarxiv icon

Offline Reinforcement Learning for Optimizing Production Bidding Policies

Add code
Oct 13, 2023
Viaarxiv icon

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Add code
May 24, 2023
Viaarxiv icon

Optimism Based Exploration in Large-Scale Recommender Systems

Add code
Apr 05, 2023
Viaarxiv icon