Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adrien Banse

A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning

Jul 11, 2024

Adrien Banse, Venkatraman Renganathan, Raphaël M. Jungers

Figure 1 for A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning

Figure 2 for A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning

Abstract:We extend the notion of Cantor-Kantorovich distance between Markov chains introduced by (Banse et al., 2023) in the context of Markov Decision Processes (MDPs). The proposed metric is well-defined and can be efficiently approximated given a finite horizon. Then, we provide numerical evidences that the latter metric can lead to interesting applications in the field of reinforcement learning. In particular, we show that it could be used for forecasting the performance of transfer learning algorithms.

* Presented at the 26th International Symposium on Mathematical Theory of Networks and Systems (Cambridge, UK)

Via

Access Paper or Ask Questions

Federated Learning with Differential Privacy

Feb 03, 2024

Adrien Banse, Jan Kreischer, Xavier Oliva i Jürgens

Abstract:Federated learning (FL), as a type of distributed machine learning, is capable of significantly preserving client's private data from being shared among different parties. Nevertheless, private information can still be divulged by analyzing uploaded parameter weights from clients. In this report, we showcase our empirical benchmark of the effect of the number of clients and the addition of differential privacy (DP) mechanisms on the performance of the model on different types of data. Our results show that non-i.i.d and small datasets have the highest decrease in performance in a distributed and differentially private setting.

* Machine Learning (ML) & Federated Learning (FL); 4 pages, 3 figures

Via

Access Paper or Ask Questions

Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

Apr 03, 2023

Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

Figure 1 for Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

Figure 2 for Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

Figure 3 for Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

Figure 4 for Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

Abstract:We introduce an adaptive refinement procedure for smart, and scalable abstraction of dynamical systems. Our technique relies on partitioning the state space depending on the observation of future outputs. However, this knowledge is dynamically constructed in an adaptive, asymmetric way. In order to learn the optimal structure, we define a Kantorovich-inspired metric between Markov chains, and we use it as a loss function. Our technique is prone to data-driven frameworks, but not restricted to. We also study properties of the above mentioned metric between Markov chains, which we believe could be of application for wider purpose. We propose an algorithm to approximate it, and we show that our method yields a much better computational complexity than using classical linear programming techniques.

* This paper is an extended version of a CDC2023 submission

Via

Access Paper or Ask Questions