Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Local Differentially Private Regret Minimization in Reinforcement Learning

Oct 15, 2020

Evrard Garcelon, Vianney Perchet, Ciara Pike-Burke, Matteo Pirotta

Figure 1 for Local Differentially Private Regret Minimization in Reinforcement Learning

Figure 2 for Local Differentially Private Regret Minimization in Reinforcement Learning

Figure 3 for Local Differentially Private Regret Minimization in Reinforcement Learning

Figure 4 for Local Differentially Private Regret Minimization in Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Reinforcement learning algorithms are widely used in domains where it is desirable to provide a personalized service. In these domains it is common that user data contains sensitive information that needs to be protected from third parties. Motivated by this, we study privacy in the context of finite-horizon Markov Decision Processes (MDPs) by requiring information to be obfuscated on the user side. We formulate this notion of privacy for RL by leveraging the local differential privacy (LDP) framework. We present an optimistic algorithm that simultaneously satisfies LDP requirements, and achieves sublinear regret. We also establish a lower bound for regret minimization in finite-horizon MDPs with LDP guarantees. These results show that while LDP is appealing in practical applications, the setting is inherently more complex. In particular, our results demonstrate that the cost of privacy is multiplicative when compared to non-private settings.

View paper on

Share this with someone who'll enjoy it:

Title:Local Differentially Private Regret Minimization in Reinforcement Learning

Paper and Code