Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nathanaël Fijalkow

GPU accelerated program synthesis: Enumerate semantics, not syntax!

Apr 26, 2025

Martin Berger, Nathanaël Fijalkow, Mojtaba Valizadeh

Abstract:Program synthesis is an umbrella term for generating programs and logical formulae from specifications. With the remarkable performance improvements that GPUs enable for deep learning, a natural question arose: can we also implement a search-based program synthesiser on GPUs to achieve similar performance improvements? In this article we discuss our insights on this question, based on recent works~. The goal is to build a synthesiser running on GPUs which takes as input positive and negative example traces and returns a logical formula accepting the positive and rejecting the negative traces. With GPU-friendly programming techniques -- using the semantics of formulae to minimise data movement and reduce data-dependent branching -- our synthesiser scales to significantly larger synthesis problems, and operates much faster than the previous CPU-based state-of-the-art. We believe the insights that make our approach GPU-friendly have wide potential for enhancing the performance of other formal methods (FM) workloads.

* 10 pages

Via

Access Paper or Ask Questions

EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis

Dec 23, 2024

Théo Matricon, Nathanaël Fijalkow, Guillaume Lagarde

Abstract:Many approaches to program synthesis perform a combinatorial search within a large space of programs to find one that satisfies a given specification. To tame the search space blowup, previous works introduced probabilistic and neural approaches to guide this combinatorial search by inducing heuristic cost functions. Best-first search algorithms ensure to search in the exact order induced by the cost function, significantly reducing the portion of the program space to be explored. We present a new best-first search algorithm called EcoSearch, which is the first constant-delay algorithm for pre-generation cost function: the amount of compute required between outputting two programs is constant, and in particular does not increase over time. This key property yields important speedups: we observe that EcoSearch outperforms its predecessors on two classic domains.

* Extended version of AAAI 2025

Via

Access Paper or Ask Questions

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Dec 16, 2024

Marius Belly, Nathanaël Fijalkow, Hugo Gimbert, Florian Horn, Guillermo A. Pérez, Pierre Vandenhove

Abstract:Partially observable Markov decision processes (POMDPs) form a prominent model for uncertainty in sequential decision making. We are interested in constructing algorithms with theoretical guarantees to determine whether the agent has a strategy ensuring a given specification with probability 1. This well-studied problem is known to be undecidable already for very simple omega-regular objectives, because of the difficulty of reasoning on uncertain events. We introduce a revelation mechanism which restricts information loss by requiring that almost surely the agent has eventually full information of the current state. Our main technical results are to construct exact algorithms for two classes of POMDPs called weakly and strongly revealing. Importantly, the decidable cases reduce to the analysis of a finite belief-support Markov decision process. This yields a conceptually simple and exact algorithm for a large class of POMDPs.

* Extended version of paper accepted to AAAI 2025. 26 pages, 10 figures

Via

Access Paper or Ask Questions

LTL learning on GPUs

Feb 19, 2024

Mojtaba Valizadeh, Nathanaël Fijalkow, Martin Berger

Abstract:Linear temporal logic (LTL) is widely used in industrial verification. LTL formulae can be learned from traces. Scaling LTL formula learning is an open problem. We implement the first GPU-based LTL learner using a novel form of enumerative program synthesis. The learner is sound and complete. Our benchmarks indicate that it handles traces at least 2048 times more numerous, and on average at least 46 times faster than existing state-of-the-art learners. This is achieved with, among others, novel branch-free LTL semantics that has $O(\log n)$ time complexity, where $n$ is trace length, while previous implementations are $O(n^2)$ or worse (assuming bitwise boolean operations and shifts by powers of 2 have unit costs -- a realistic assumption on modern processors).

* 27 pages

Via

Access Paper or Ask Questions

Theoretical foundations for programmatic reinforcement learning

Feb 18, 2024

Guruprerana Shabadi, Nathanaël Fijalkow, Théo Matricon

Abstract:The field of Reinforcement Learning (RL) is concerned with algorithms for learning optimal policies in unknown stochastic environments. Programmatic RL studies representations of policies as programs, meaning involving higher order constructs such as control loops. Despite attracting a lot of attention at the intersection of the machine learning and formal methods communities, very little is known on the theoretical front about programmatic RL: what are good classes of programmatic policies? How large are optimal programmatic policies? How can we learn them? The goal of this paper is to give first answers to these questions, initiating a theoretical study of programmatic RL.

Via

Access Paper or Ask Questions

Learning temporal formulas from examples is hard

Dec 26, 2023

Corto Mascle, Nathanaël Fijalkow, Guillaume Lagarde

Abstract:We study the problem of learning linear temporal logic (LTL) formulas from examples, as a first step towards expressing a property separating positive and negative instances in a way that is comprehensible for humans. In this paper we initiate the study of the computational complexity of the problem. Our main results are hardness results: we show that the LTL learning problem is NP-complete, both for the full logic and for almost all of its fragments. This motivates the search for efficient heuristics, and highlights the complexity of expressing separating properties in concise natural language.

* This article is a long version of the article arXiv:2102.00876 presented in the International Conference on Grammatical Inference (ICGI) in 2021. It includes much stronger and more general results than the extended abstract. Submitted to a journal

Via

Access Paper or Ask Questions

WikiCoder: Learning to Write Knowledge-Powered Code

Mar 15, 2023

Théo Matricon, Nathanaël Fijalkow, Gaëtan Margueritte

Abstract:We tackle the problem of automatic generation of computer programs from a few pairs of input-output examples. The starting point of this work is the observation that in many applications a solution program must use external knowledge not present in the examples: we call such programs knowledge-powered since they can refer to information collected from a knowledge graph such as Wikipedia. This paper makes a first step towards knowledge-powered program synthesis. We present WikiCoder, a system building upon state of the art machine-learned program synthesizers and integrating knowledge graphs. We evaluate it to show its wide applicability over different domains and discuss its limitations. WikiCoder solves tasks that no program synthesizers were able to solve before thanks to the use of knowledge graphs, while integrating with recent developments in the field to operate at scale.

* Published in the proceedings of SPIN 2023

Via

Access Paper or Ask Questions

Scalable Anytime Algorithms for Learning Formulas in Linear Temporal Logic

Oct 27, 2021

Ritam Raha, Rajarshi Roy, Nathanaël Fijalkow, Daniel Neider

Figure 1 for Scalable Anytime Algorithms for Learning Formulas in Linear Temporal Logic

Figure 2 for Scalable Anytime Algorithms for Learning Formulas in Linear Temporal Logic

Figure 3 for Scalable Anytime Algorithms for Learning Formulas in Linear Temporal Logic

Figure 4 for Scalable Anytime Algorithms for Learning Formulas in Linear Temporal Logic

Abstract:Linear temporal logic (LTL) is a specification language for finite sequences (called traces) widely used in program verification, motion planning in robotics, process mining, and many other areas. We consider the problem of learning LTL formulas for classifying traces; despite a growing interest of the research community, existing solutions suffer from two limitations: they do not scale beyond small formulas, and they may exhaust computational resources without returning any result. We introduce a new algorithm addressing both issues: our algorithm is able to construct formulas an order of magnitude larger than previous methods, and it is anytime, meaning that it in most cases successfully outputs a formula, albeit possibly not of minimal size. We evaluate the performances of our algorithm using an open source implementation against publicly available benchmarks.

Via

Access Paper or Ask Questions

Scaling Neural Program Synthesis with Distribution-based Search

Oct 24, 2021

Nathanaël Fijalkow, Guillaume Lagarde, Théo Matricon, Kevin Ellis, Pierre Ohlmann, Akarsh Potta

Figure 1 for Scaling Neural Program Synthesis with Distribution-based Search

Figure 2 for Scaling Neural Program Synthesis with Distribution-based Search

Figure 3 for Scaling Neural Program Synthesis with Distribution-based Search

Figure 4 for Scaling Neural Program Synthesis with Distribution-based Search

Abstract:We consider the problem of automatically constructing computer programs from input-output examples. We investigate how to augment probabilistic and neural program synthesis methods with new search algorithms, proposing a framework called distribution-based search. Within this framework, we introduce two new search algorithms: Heap Search, an enumerative method, and SQRT Sampling, a probabilistic method. We prove certain optimality guarantees for both methods, show how they integrate with probabilistic and neural techniques, and demonstrate how they can operate at scale across parallel compute environments. Collectively these findings offer theoretical and applied studies of search algorithms for program synthesis that integrate with recent developments in machine-learned program synthesizers.

* Attached repository: https://github.com/nathanael-fijalkow/DeepSynth/

Via

Access Paper or Ask Questions

The Complexity of Learning Linear Temporal Formulas from Examples

Feb 01, 2021

Nathanaël Fijalkow, Guillaume Lagarde

Abstract:In this paper we initiate the study of the computational complexity of learning linear temporal logic (LTL) formulas from examples. We construct approximation algorithms for fragments of LTL and prove hardness results; in particular we obtain tight bounds for approximation of the fragment containing only the next operator and conjunctions, and prove NP-completeness results for many fragments.

Via

Access Paper or Ask Questions