Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sandra Zilles

Common Benchmarks Undervalue the Generalization Power of Programmatic Policies

Jun 17, 2025

Amirhossein Rajabpour, Kiarash Aghakasiri, Sandra Zilles, Levi H. S. Lelis

Abstract:Algorithms for learning programmatic representations for sequential decision-making problems are often evaluated on out-of-distribution (OOD) problems, with the common conclusion that programmatic policies generalize better than neural policies on OOD problems. In this position paper, we argue that commonly used benchmarks undervalue the generalization capabilities of programmatic representations. We analyze the experiments of four papers from the literature and show that neural policies, which were shown not to generalize, can generalize as effectively as programmatic policies on OOD problems. This is achieved with simple changes in the neural policies training pipeline. Namely, we show that simpler neural architectures with the same type of sparse observation used with programmatic policies can help attain OOD generalization. Another modification we have shown to be effective is the use of reward functions that allow for safer policies (e.g., agents that drive slowly can generalize better). Also, we argue for creating benchmark problems highlighting concepts needed for OOD generalization that may challenge neural policies but align with programmatic representations, such as tasks requiring algorithmic constructs like stacks.

* 17 pages, 5 figures

Via

Access Paper or Ask Questions

Approximation Algorithms for Preference Aggregation Using CP-Nets

Dec 15, 2023

Abu Mohammmad Hammad Ali, Boting Yang, Sandra Zilles

Figure 1 for Approximation Algorithms for Preference Aggregation Using CP-Nets

Abstract:This paper studies the design and analysis of approximation algorithms for aggregating preferences over combinatorial domains, represented using Conditional Preference Networks (CP-nets). Its focus is on aggregating preferences over so-called \emph{swaps}, for which optimal solutions in general are already known to be of exponential size. We first analyze a trivial 2-approximation algorithm that simply outputs the best of the given input preferences, and establish a structural condition under which the approximation ratio of this algorithm is improved to $4/3$. We then propose a polynomial-time approximation algorithm whose outputs are provably no worse than those of the trivial algorithm, but often substantially better. A family of problem instances is presented for which our improved algorithm produces optimal solutions, while, for any $\varepsilon$, the trivial algorithm can\emph{not}\/ attain a $(2-\varepsilon)$-approximation. These results may lead to the first polynomial-time approximation algorithm that solves the CP-net aggregation problem for swaps with an approximation ratio substantially better than $2$.

* 11 pages, main body and appendix. Full version of a paper accepted at the 38th Annual AAAI Conference on Artificial Intelligence

Via

Access Paper or Ask Questions

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

Dec 28, 2022

Farnam Mansouri, Sandra Zilles

Abstract:This paper presents a construction of a proper and stable labelled sample compression scheme of size $O(\VCD^2)$ for any finite concept class, where $\VCD$ denotes the Vapnik-Chervonenkis Dimension. The construction is based on a well-known model of machine teaching, referred to as recursive teaching dimension. This substantially improves on the currently best known bound on the size of sample compression schemes (due to Moran and Yehudayoff), which is exponential in $\VCD$. The long-standing open question whether the smallest size of a sample compression scheme is in $O(\VCD)$ remains unresolved, but our results show that research on machine teaching is a promising avenue for the study of this open problem. As further evidence of the strong connections between machine teaching and sample compression, we prove that the model of no-clash teaching, introduced by Kirkpatrick et al., can be used to define a non-trivial lower bound on the size of stable sample compression schemes.

* Our main claim is wrong. Our construction for labelled compression scheme does not have relationship with RTD^* and subsequently is not O(VCD^2). It has a scientific error

Via

Access Paper or Ask Questions

Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Jun 20, 2022

Mohamadsadegh Khosravani, Sandra Zilles

Figure 1 for Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Figure 2 for Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Figure 3 for Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Figure 4 for Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Abstract:Active learning is popular approach for reducing the amount of data in training deep neural network model. Its success hinges on the choice of an effective acquisition function, which ranks not yet labeled data points according to their expected informativeness. In uncertainty sampling, the uncertainty that the current model has about a point's class label is the main criterion for this type of ranking. This paper proposes a new approach to uncertainty sampling in training a Convolutional Neural Network (CNN). The main idea is to use feature representation extracted extracted by the CNN as data for training a Sum-Product Network (SPN). Since SPNs are typically used for estimating the distribution of a dataset, they are well suited to the task of estimating class probabilities that can be used directly by standard acquisition functions such as max entropy and variational ratio. Moreover, we enhance these acquisition functions by weights calculated with the help of the SPN model; these weights make the acquisition function more sensitive to the diversity of conceivable class labels for data points. The effectiveness of our method is demonstrated in an experimental study on the MNIST, Fashion-MNIST and CIFAR-10 datasets, where we compare it to the state-of-the-art methods MC Dropout and Bayesian Batch.

* 15 pages,9 figures, 4 tables

Via

Access Paper or Ask Questions

Inferring Symbolic Automata

Nov 12, 2020

Dana Fisman, Hadar Frenkel, Sandra Zilles

Figure 1 for Inferring Symbolic Automata

Figure 2 for Inferring Symbolic Automata

Abstract:We study the learnability of {symbolic finite state automata}, a model shown useful in many applications in software verification. The state-of-the-art literature on this topic follows the {query learning} paradigm, and so far all obtained results are positive. We provide a necessary condition for efficient learnability of SFAs in this paradigm, from which we obtain the first negative result. Most of this work studies learnability of SFAs under the paradigm of {identification in the limit using polynomial time and data}. We provide a sufficient condition for efficient learnability of SFAs in this paradigm, as well as a necessary condition, and provide several positive and negative results.

Via

Access Paper or Ask Questions

Optimal Collusion-Free Teaching

Mar 10, 2019

David Kirkpatrick, Hans U. Simon, Sandra Zilles

Figure 1 for Optimal Collusion-Free Teaching

Figure 2 for Optimal Collusion-Free Teaching

Figure 3 for Optimal Collusion-Free Teaching

Figure 4 for Optimal Collusion-Free Teaching

Abstract:Formal models of learning from teachers need to respect certain criteria to avoid collusion. The most commonly accepted notion of collusion-freeness was proposed by Goldman and Mathias (1996), and various teaching models obeying their criterion have been studied. For each model $M$ and each concept class $\mathcal{C}$, a parameter $M$-$\mathrm{TD}(\mathcal{C})$ refers to the teaching dimension of concept class $\mathcal{C}$ in model $M$---defined to be the number of examples required for teaching a concept, in the worst case over all concepts in $\mathcal{C}$. This paper introduces a new model of teaching, called no-clash teaching, together with the corresponding parameter $\mathrm{NCTD}(\mathcal{C})$. No-clash teaching is provably optimal in the strong sense that, given any concept class $\mathcal{C}$ and any model $M$ obeying Goldman and Mathias's collusion-freeness criterion, one obtains $\mathrm{NCTD}(\mathcal{C})\le M$-$\mathrm{TD}(\mathcal{C})$. We also study a corresponding notion $\mathrm{NCTD}^+$ for the case of learning from positive data only, establish useful bounds on $\mathrm{NCTD}$ and $\mathrm{NCTD}^+$, and discuss relations of these parameters to the VC-dimension and to sample compression. In addition to formulating an optimal model of collusion-free teaching, our main results are on the computational complexity of deciding whether $\mathrm{NCTD}^+(\mathcal{C})=k$ (or $\mathrm{NCTD}(\mathcal{C})=k$) for given $\mathcal{C}$ and $k$. We show some such decision problems to be equivalent to the existence question for certain constrained matchings in bipartite graphs. Our NP-hardness results for the latter are of independent interest in the study of constrained graph matchings.

* 26 pages and 6 figures. This is an expanded version of a similarly titled paper to appear in Proceedings of Machine Learning Research (ALT 2019), vol. 98, 2019

Via

Access Paper or Ask Questions

The Complexity of Learning Acyclic Conditional Preference Networks

Aug 25, 2018

Eisa Alanazi, Malek Mouhoub, Sandra Zilles

Figure 1 for The Complexity of Learning Acyclic Conditional Preference Networks

Figure 2 for The Complexity of Learning Acyclic Conditional Preference Networks

Figure 3 for The Complexity of Learning Acyclic Conditional Preference Networks

Figure 4 for The Complexity of Learning Acyclic Conditional Preference Networks

Abstract:Learning of user preferences, as represented by, for example, Conditional Preference Networks (CP-nets), has become a core issue in AI research. Recent studies investigate learning of CP-nets from randomly chosen examples or from membership and equivalence queries. To assess the optimality of learning algorithms as well as to better understand the combinatorial structure of classes of CP-nets, it is helpful to calculate certain learning-theoretic information complexity parameters. This article focuses on the frequently studied case of learning from so-called swap examples, which express preferences among objects that differ in only one attribute. It presents bounds on or exact values of some well-studied information complexity parameters, namely the VC dimension, the teaching dimension, and the recursive teaching dimension, for classes of acyclic CP-nets. We further provide algorithms that learn tree-structured and general acyclic CP-nets from membership queries. Using our results on complexity parameters, we assess the optimality of our algorithms as well as that of another query learning algorithm for acyclic CP-nets presented in the literature. Our algorithms are near-optimal, and can, under certain assumptions, be adapted to the case when the membership oracle is faulty.

* 64 pages

Via

Access Paper or Ask Questions

An Overview of Machine Teaching

Jan 18, 2018

Xiaojin Zhu, Adish Singla, Sandra Zilles, Anna N. Rafferty

Abstract:In this paper we try to organize machine teaching as a coherent set of ideas. Each idea is presented as varying along a dimension. The collection of dimensions then form the problem space of machine teaching, such that existing teaching problems can be characterized in this space. We hope this organization allows us to gain deeper understanding of individual teaching problems, discover connections among them, and identify gaps in the field.

* A tutorial document grown out of NIPS 2017 Workshop on Teaching Machines, Robots, and Humans

Via

Access Paper or Ask Questions

An Empirical Study of the Effects of Spurious Transitions on Abstraction-based Heuristics

Nov 14, 2017

Mehdi Sadeqi, Robert C. Holte, Sandra Zilles

Figure 1 for An Empirical Study of the Effects of Spurious Transitions on Abstraction-based Heuristics

Figure 2 for An Empirical Study of the Effects of Spurious Transitions on Abstraction-based Heuristics

Figure 3 for An Empirical Study of the Effects of Spurious Transitions on Abstraction-based Heuristics

Figure 4 for An Empirical Study of the Effects of Spurious Transitions on Abstraction-based Heuristics

Abstract:The efficient solution of state space search problems is often attempted by guiding search algorithms with heuristics (estimates of the distance from any state to the goal). A popular way for creating heuristic functions is by using an abstract version of the state space. However, the quality of abstraction-based heuristic functions, and thus the speed of search, can suffer from spurious transitions, i.e., state transitions in the abstract state space for which no corresponding transitions in the reachable component of the original state space exist. Our first contribution is a quantitative study demonstrating that the harmful effects of spurious transitions on heuristic functions can be substantial, in terms of both the increase in the number of abstract states and the decrease in the heuristic values, which may slow down search. Our second contribution is an empirical study on the benefits of removing a certain kind of spurious transition, namely those that involve states with a pair of mutually exclusive (mutex) variablevalue assignments. In the context of state space planning, a mutex pair is a pair of variable-value assignments that does not occur in any reachable state. Detecting mutex pairs is a problem that has been addressed frequently in the planning literature. Our study shows that there are cases in which mutex detection helps to eliminate harmful spurious transitions to a large extent and thus to speed up search substantially.

* 38 pages, 9 figures, appendix with 5 figures

Via

Access Paper or Ask Questions

Front-to-End Bidirectional Heuristic Search with Near-Optimal Node Expansions

May 23, 2017

Jingwei Chen, Robert C. Holte, Sandra Zilles, Nathan R. Sturtevant

Figure 1 for Front-to-End Bidirectional Heuristic Search with Near-Optimal Node Expansions

Figure 2 for Front-to-End Bidirectional Heuristic Search with Near-Optimal Node Expansions

Figure 3 for Front-to-End Bidirectional Heuristic Search with Near-Optimal Node Expansions

Figure 4 for Front-to-End Bidirectional Heuristic Search with Near-Optimal Node Expansions

Abstract:It is well-known that any admissible unidirectional heuristic search algorithm must expand all states whose $f$-value is smaller than the optimal solution cost when using a consistent heuristic. Such states are called "surely expanded" (s.e.). A recent study characterized s.e. pairs of states for bidirectional search with consistent heuristics: if a pair of states is s.e. then at least one of the two states must be expanded. This paper derives a lower bound, VC, on the minimum number of expansions required to cover all s.e. pairs, and present a new admissible front-to-end bidirectional heuristic search algorithm, Near-Optimal Bidirectional Search (NBS), that is guaranteed to do no more than 2VC expansions. We further prove that no admissible front-to-end algorithm has a worst case better than 2VC. Experimental results show that NBS competes with or outperforms existing bidirectional search algorithms, and often outperforms A* as well.

* Accepted to IJCAI 2017. Camera ready version with new timing results

Via

Access Paper or Ask Questions