Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Benedikt

CRIStAL, CNRS, SPIRALS

Logical perspectives on learning statistical objects

Apr 01, 2025

Aaron Anderson, Michael Benedikt

Abstract:We consider the relationship between learnability of a ``base class'' of functions on a set X and learnability of a class of statistical functions derived from the base class. For example, we refine results showing that learnability of a family of functions implies learnability of the family of functions mapping a function in the class to its expectation under a distribution. We will look at both Probably Approximately Correct (PAC) learning, where example inputs and outputs are chosen at random, and online learning, where the examples are chosen adversarially. We establish improved bounds on the sample complexity of learning for statistical classes, stated in terms of combinatorial dimensions of the base class. We do this by adapting techniques introduced in model theory for ``randomizing a structure''. We give particular attention to classes derived from logical formulas, and relate learnability of the statistical classes to properties of the formula. Finally, we provide bounds on the complexity of learning the statistical classes built on top of a logic-based hypothesis class.

Via

Access Paper or Ask Questions

Graph neural network outputs are almost surely asymptotically constant

Mar 06, 2024

Sam Adam-Day, Michael Benedikt, İsmail İlkan Ceylan, Ben Finkelshtein

Figure 1 for Graph neural network outputs are almost surely asymptotically constant

Figure 2 for Graph neural network outputs are almost surely asymptotically constant

Figure 3 for Graph neural network outputs are almost surely asymptotically constant

Figure 4 for Graph neural network outputs are almost surely asymptotically constant

Abstract:Graph neural networks (GNNs) are the predominant architectures for a variety of learning tasks on graphs. We present a new angle on the expressive power of GNNs by studying how the predictions of a GNN probabilistic classifier evolve as we apply it on larger graphs drawn from some random graph model. We show that the output converges to a constant function, which upper-bounds what these classifiers can express uniformly. This convergence phenomenon applies to a very wide class of GNNs, including state of the art models, with aggregates including mean and the attention-based mechanism of graph transformers. Our results apply to a broad class of random graph models, including the (sparse) Erd\H{o}s-R\'enyi model and the stochastic block model. We empirically validate these findings, observing that the convergence phenomenon already manifests itself on graphs of relatively modest size.

* 10 body pages, 23 appendix pages, 2 figures

Via

Access Paper or Ask Questions

Towards Unbiased Exploration in Partial Label Learning

Jul 02, 2023

Zsolt Zombori, Agapi Rissaki, Kristóf Szabó, Wolfgang Gatterbauer, Michael Benedikt

Abstract:We consider learning a probabilistic classifier from partially-labelled supervision (inputs denoted with multiple possibilities) using standard neural architectures with a softmax as the final layer. We identify a bias phenomenon that can arise from the softmax layer in even simple architectures that prevents proper exploration of alternative options, making the dynamics of gradient descent overly sensitive to initialisation. We introduce a novel loss function that allows for unbiased exploration within the space of alternative outputs. We give a theoretical justification for our loss function, and provide an extensive evaluation of its impact on synthetic data, on standard partially labelled benchmarks and on a contributed novel benchmark related to an existing rule learning challenge.

Via

Access Paper or Ask Questions

Query Answering with Transitive and Linear-Ordered Data

Feb 17, 2022

Antoine Amarilli, Michael Benedikt, Pierre Bourhis, Michael Vanden Boom

Figure 1 for Query Answering with Transitive and Linear-Ordered Data

Figure 2 for Query Answering with Transitive and Linear-Ordered Data

Figure 3 for Query Answering with Transitive and Linear-Ordered Data

Abstract:We consider entailment problems involving powerful constraint languages such as frontier-guarded existential rules in which we impose additional semantic restrictions on a set of distinguished relations. We consider restricting a relation to be transitive, restricting a relation to be the transitive closure of another relation, and restricting a relation to be a linear order. We give some natural variants of guardedness that allow inference to be decidable in each case, and isolate the complexity of the corresponding decision problems. Finally we show that slight changes in these conditions lead to undecidability.

* This article was originally published at JAIR in 2018: https://www.jair.org/index.php/jair/article/view/11240 (DOI 10.1613/jair.1.11240). This version of the paper includes one modification from the publisher version: we fix an incorrect proof for one of our undecidability results (Theorem 6.2). arXiv admin note: substantial text overlap with arXiv:1607.00813

Via

Access Paper or Ask Questions

Reasoning about disclosure in data integration in the presence of source constraints

Jun 03, 2019

Michael Benedikt, Pierre Bourhis, Louis Jachiet, Michaël Thomazo

Figure 1 for Reasoning about disclosure in data integration in the presence of source constraints

Figure 2 for Reasoning about disclosure in data integration in the presence of source constraints

Abstract:Data integration systems allow users to access data sitting in multiple sources by means of queries over a global schema, related to the sources via mappings. Data sources often contain sensitive information, and thus an analysis is needed to verify that a schema satisfies a privacy policy, given as a set of queries whose answers should not be accessible to users. Such an analysis should take into account not only knowledge that an attacker may have about the mappings, but also what they may know about the semantics of the sources. In this paper, we show that source constraints can have a dramatic impact on disclosure analysis. We study the problem of determining whether a given data integration system discloses a source query to an attacker in the presence of constraints, providing both lower and upper bounds on source-aware disclosure analysis.

* 28th International Joint Conference on Artificial Intelligence (IJCAI-19), Aug 2019, Macau, China

Via

Access Paper or Ask Questions

Goal-Driven Query Answering for Existential Rules with Equality

Nov 20, 2017

Michael Benedikt, Boris Motik, Efthymia Tsamoura

Figure 1 for Goal-Driven Query Answering for Existential Rules with Equality

Figure 2 for Goal-Driven Query Answering for Existential Rules with Equality

Figure 3 for Goal-Driven Query Answering for Existential Rules with Equality

Abstract:Inspired by the magic sets for Datalog, we present a novel goal-driven approach for answering queries over terminating existential rules with equality (aka TGDs and EGDs). Our technique improves the performance of query answering by pruning the consequences that are not relevant for the query. This is challenging in our setting because equalities can potentially affect all predicates in a dataset. We address this problem by combining the existing singularization technique with two new ingredients: an algorithm for identifying the rules relevant to a query and a new magic sets algorithm. We show empirically that our technique can significantly improve the performance of query answering, and that it can mean the difference between answering a query in a few seconds or not being able to process the query at all.

Via

Access Paper or Ask Questions