Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tal Kachman

Modelling Mean-Field Games with Neural Ordinary Differential Equations

Apr 17, 2025

Anna C. M. Thöni, Yoram Bachrach, Tal Kachman

Abstract:Mean-field game theory relies on approximating games that would otherwise have been intractable to model. While the games can be solved analytically via the associated system of partial derivatives, this approach is not model-free, can lead to the loss of the existence or uniqueness of solutions and may suffer from modelling bias. To reduce the dependency between the model and the game, we combine mean-field game theory with deep learning in the form of neural ordinary differential equations. The resulting model is data-driven, lightweight and can learn extensive strategic interactions that are hard to capture using mean-field theory alone. In addition, the model is based on automatic differentiation, making it more robust and objective than approaches based on finite differences. We highlight the efficiency and flexibility of our approach by solving three mean-field games that vary in their complexity, observability and the presence of noise. Using these results, we show that the model is flexible, lightweight and requires few observations to learn the distribution underlying the data.

Via

Access Paper or Ask Questions

InfluenceNet: AI Models for Banzhaf and Shapley Value Prediction

Mar 11, 2025

Benjamin Kempinski, Tal Kachman

Abstract:Power indices are essential in assessing the contribution and influence of individual agents in multi-agent systems, providing crucial insights into collaborative dynamics and decision-making processes. While invaluable, traditional computational methods for exact or estimated power indices values require significant time and computational constraints, especially for large $(n\ge10)$ coalitions. These constraints have historically limited researchers' ability to analyse complex multi-agent interactions comprehensively. To address this limitation, we introduce a novel Neural Networks-based approach that efficiently estimates power indices for voting games, demonstrating comparable and often superiour performance to existing tools in terms of both speed and accuracy. This method not only addresses existing computational bottlenecks, but also enables rapid analysis of large coalitions, opening new avenues for multi-agent system research by overcoming previous computational limitations and providing researchers with a more accessible, scalable analytical tool.This increased efficiency will allow for the analysis of more complex and realistic multi-agent scenarios.

* 20 pages main text + 6 pages appendix, 11 figures. Accepted to IntelliSys 2025

Via

Access Paper or Ask Questions

Using Cooperative Game Theory to Prune Neural Networks

Nov 17, 2023

Mauricio Diaz-Ortiz Jr, Benjamin Kempinski, Daphne Cornelisse, Yoram Bachrach, Tal Kachman

Abstract:We show how solution concepts from cooperative game theory can be used to tackle the problem of pruning neural networks. The ever-growing size of deep neural networks (DNNs) increases their performance, but also their computational requirements. We introduce a method called Game Theory Assisted Pruning (GTAP), which reduces the neural network's size while preserving its predictive accuracy. GTAP is based on eliminating neurons in the network based on an estimation of their joint impact on the prediction quality through game theoretic solutions. Specifically, we use a power index akin to the Shapley value or Banzhaf index, tailored using a procedure similar to Dropout (commonly used to tackle overfitting problems in machine learning). Empirical evaluation of both feedforward networks and convolutional neural networks shows that this method outperforms existing approaches in the achieved tradeoff between the number of parameters and model accuracy.

Via

Access Paper or Ask Questions

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees

Sep 18, 2023

Alexia Jolicoeur-Martineau, Kilian Fatras, Tal Kachman

Abstract:Tabular data is hard to acquire and is subject to missing values. This paper proposes a novel approach to generate and impute mixed-type (continuous and categorical) tabular data using score-based diffusion and conditional flow matching. Contrary to previous work that relies on neural networks as function approximators, we instead utilize XGBoost, a popular Gradient-Boosted Tree (GBT) method. In addition to being elegant, we empirically show on various datasets that our method i) generates highly realistic synthetic data when the training dataset is either clean or tainted by missing data and ii) generates diverse plausible data imputations. Our method often outperforms deep-learning generation methods and can trained in parallel using CPUs without the need for a GPU. To make it easily accessible, we release our code through a Python library on PyPI and an R package on CRAN.

Via

Access Paper or Ask Questions

Explainability Techniques for Chemical Language Models

May 25, 2023

Stefan Hödl, William Robinson, Yoram Bachrach, Wilhelm Huck, Tal Kachman

Figure 1 for Explainability Techniques for Chemical Language Models

Figure 2 for Explainability Techniques for Chemical Language Models

Figure 3 for Explainability Techniques for Chemical Language Models

Figure 4 for Explainability Techniques for Chemical Language Models

Abstract:Explainability techniques are crucial in gaining insights into the reasons behind the predictions of deep learning models, which have not yet been applied to chemical language models. We propose an explainable AI technique that attributes the importance of individual atoms towards the predictions made by these models. Our method backpropagates the relevance information towards the chemical input string and visualizes the importance of individual atoms. We focus on self-attention Transformers operating on molecular string representations and leverage a pretrained encoder for finetuning. We showcase the method by predicting and visualizing solubility in water and organic solvents. We achieve competitive model performance while obtaining interpretable predictions, which we use to inspect the pretrained model.

Via

Access Paper or Ask Questions

Charting the Topography of the Neural Network Landscape with Thermal-Like Noise

Apr 18, 2023

Theo Jules, Gal Brener, Tal Kachman, Noam Levi, Yohai Bar-Sinai

Abstract:The training of neural networks is a complex, high-dimensional, non-convex and noisy optimization problem whose theoretical understanding is interesting both from an applicative perspective and for fundamental reasons. A core challenge is to understand the geometry and topography of the landscape that guides the optimization. In this work, we employ standard Statistical Mechanics methods, namely, phase-space exploration using Langevin dynamics, to study this landscape for an over-parameterized fully connected network performing a classification task on random data. Analyzing the fluctuation statistics, in analogy to thermal dynamics at a constant temperature, we infer a clear geometric description of the low-loss region. We find that it is a low-dimensional manifold whose dimension can be readily obtained from the fluctuations. Furthermore, this dimension is controlled by the number of data points that reside near the classification decision boundary. Importantly, we find that a quadratic approximation of the loss near the minimum is fundamentally inadequate due to the exponential nature of the decision boundary and the flatness of the low-loss region. This causes the dynamics to sample regions with higher curvature at higher temperatures, while producing quadratic-like statistics at any given temperature. We explain this behavior by a simplified loss model which is analytically tractable and reproduces the observed fluctuation statistics.

* 7 pages, 4 figures

Via

Access Paper or Ask Questions

Diffusion models with location-scale noise

Apr 12, 2023

Alexia Jolicoeur-Martineau, Kilian Fatras, Ke Li, Tal Kachman

Figure 1 for Diffusion models with location-scale noise

Abstract:Diffusion Models (DMs) are powerful generative models that add Gaussian noise to the data and learn to remove it. We wanted to determine which noise distribution (Gaussian or non-Gaussian) led to better generated data in DMs. Since DMs do not work by design with non-Gaussian noise, we built a framework that allows reversing a diffusion process with non-Gaussian location-scale noise. We use that framework to show that the Gaussian distribution performs the best over a wide range of other distributions (Laplace, Uniform, t, Generalized-Gaussian).

Via

Access Paper or Ask Questions

Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

Oct 02, 2022

Yannick Hogewind, Thiago D. Simao, Tal Kachman, Nils Jansen

Figure 1 for Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

Figure 2 for Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

Figure 3 for Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

Figure 4 for Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

Abstract:We address the problem of safe reinforcement learning from pixel observations. Inherent challenges in such settings are (1) a trade-off between reward optimization and adhering to safety constraints, (2) partial observability, and (3) high-dimensional observations. We formalize the problem in a constrained, partially observable Markov decision process framework, where an agent obtains distinct reward and safety signals. To address the curse of dimensionality, we employ a novel safety critic using the stochastic latent actor-critic (SLAC) approach. The latent variable model predicts rewards and safety violations, and we use the safety critic to train safe policies. Using well-known benchmark environments, we demonstrate competitive performance over existing approaches with respects to computational requirements, final reward return, and satisfying the safety constraints.

Via

Access Paper or Ask Questions

Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

Aug 18, 2022

Daphne Cornelisse, Thomas Rood, Mateusz Malinowski, Yoram Bachrach, Tal Kachman

Figure 1 for Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

Figure 2 for Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

Figure 3 for Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

Figure 4 for Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

Abstract:In many multi-agent settings, participants can form teams to achieve collective outcomes that may far surpass their individual capabilities. Measuring the relative contributions of agents and allocating them shares of the reward that promote long-lasting cooperation are difficult tasks. Cooperative game theory offers solution concepts identifying distribution schemes, such as the Shapley value, that fairly reflect the contribution of individuals to the performance of the team or the Core, which reduces the incentive of agents to abandon their team. Applications of such methods include identifying influential features and sharing the costs of joint ventures or team formation. Unfortunately, using these solutions requires tackling a computational barrier as they are hard to compute, even in restricted settings. In this work, we show how cooperative game-theoretic solutions can be distilled into a learned model by training neural networks to propose fair and stable payoff allocations. We show that our approach creates models that can generalize to games far from the training distribution and can predict solutions for more players than observed during training. An important application of our framework is Explainable AI: our approach can be used to speed-up Shapley value computations on many instances.

Via

Access Paper or Ask Questions

Lyapunov Exponents for Diversity in Differentiable Games

Dec 24, 2021

Jonathan Lorraine, Paul Vicol, Jack Parker-Holder, Tal Kachman, Luke Metz, Jakob Foerster

Figure 1 for Lyapunov Exponents for Diversity in Differentiable Games

Figure 2 for Lyapunov Exponents for Diversity in Differentiable Games

Figure 3 for Lyapunov Exponents for Diversity in Differentiable Games

Figure 4 for Lyapunov Exponents for Diversity in Differentiable Games

Abstract:Ridge Rider (RR) is an algorithm for finding diverse solutions to optimization problems by following eigenvectors of the Hessian ("ridges"). RR is designed for conservative gradient systems (i.e., settings involving a single loss function), where it branches at saddles - easy-to-find bifurcation points. We generalize this idea to non-conservative, multi-agent gradient systems by proposing a method - denoted Generalized Ridge Rider (GRR) - for finding arbitrary bifurcation points. We give theoretical motivation for our method by leveraging machinery from the field of dynamical systems. We construct novel toy problems where we can visualize new phenomena while giving insight into high-dimensional problems of interest. Finally, we empirically evaluate our method by finding diverse solutions in the iterated prisoners' dilemma and relevant machine learning problems including generative adversarial networks.

* AAMAS2022, 24 pages

Via

Access Paper or Ask Questions