Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Mar 01, 2018

Jun Zhao, Guang Qiu, Ziyu Guan, Wei Zhao, Xiaofei He

Figure 1 for Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Figure 2 for Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Figure 3 for Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Figure 4 for Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Share this with someone who'll enjoy it:

Abstract:Bidding optimization is one of the most critical problems in online advertising. Sponsored search (SS) auction, due to the randomness of user query behavior and platform nature, usually adopts keyword-level bidding strategies. In contrast, the display advertising (DA), as a relatively simpler scenario for auction, has taken advantage of real-time bidding (RTB) to boost the performance for advertisers. In this paper, we consider the RTB problem in sponsored search auction, named SS-RTB. SS-RTB has a much more complex dynamic environment, due to stochastic user query behavior and more complex bidding policies based on multiple keywords of an ad. Most previous methods for DA cannot be applied. We propose a reinforcement learning (RL) solution for handling the complex dynamic environment. Although some RL methods have been proposed for online advertising, they all fail to address the "environment changing" problem: the state transition probabilities vary between two days. Motivated by the observation that auction sequences of two days share similar transition patterns at a proper aggregation level, we formulate a robust MDP model at hour-aggregation level of the auction data and propose a control-by-model framework for SS-RTB. Rather than generating bid prices directly, we decide a bidding model for impressions of each hour and perform real-time bidding accordingly. We also extend the method to handle the multi-agent problem. We deployed the SS-RTB system in the e-commerce search auction platform of Alibaba. Empirical experiments of offline evaluation and online A/B test demonstrate the effectiveness of our method.

View paper on

Share this with someone who'll enjoy it:

Title:Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Paper and Code