Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Robert Zinkov

Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Feb 13, 2024

David Brandfonbrener, Sibi Raja, Tarun Prasad, Chloe Loughridge, Jianang Yang, Simon Henniger, William E. Byrd, Robert Zinkov, Nada Amin

Figure 1 for Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Figure 2 for Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Figure 3 for Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Figure 4 for Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Abstract:We present an approach using Monte Carlo Tree Search (MCTS) to guide Large Language Models (LLMs) to generate verified programs in Dafny, Lean and Coq. Our method, which we call VMCTS, leverages the verifier inside the search algorithm by checking partial programs at each step. In combination with the LLM prior, the verifier feedback raises the synthesis capabilities of open source models. On a set of five verified programming problems, we find that in four problems where the base model cannot solve the question even when re-sampling solutions for one hour, VMCTS can solve the problems within 6 minutes. The base model with VMCTS is even competitive with ChatGPT4 augmented with plugins and multiple re-tries on these problems. Our code and benchmarks are available at https://github.com/namin/llm-verified-with-monte-carlo-tree-search .

Via

Access Paper or Ask Questions

Amortized Rejection Sampling in Universal Probabilistic Programming

Nov 30, 2019

Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth(+2 more)

Figure 1 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 2 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 3 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 4 for Amortized Rejection Sampling in Universal Probabilistic Programming

Abstract:Existing approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance. An instance of this is importance sampling inference in programs that explicitly include rejection sampling as part of the user-programmed generative procedure. In this paper we develop a new and efficient amortized importance sampling estimator. We prove finite variance of our estimator and empirically demonstrate our method's correctness and efficiency compared to existing alternatives on generative programs containing rejection sampling loops and discuss how to implement our method in a generic probabilistic programming framework.

Via

Access Paper or Ask Questions

Faithful Inversion of Generative Models for Effective Amortized Inference

Oct 24, 2018

Stefan Webb, Adam Golinski, Robert Zinkov, N. Siddharth, Tom Rainforth, Yee Whye Teh, Frank Wood

Figure 1 for Faithful Inversion of Generative Models for Effective Amortized Inference

Figure 2 for Faithful Inversion of Generative Models for Effective Amortized Inference

Figure 3 for Faithful Inversion of Generative Models for Effective Amortized Inference

Figure 4 for Faithful Inversion of Generative Models for Effective Amortized Inference

Abstract:Inference amortization methods share information across multiple posterior-inference problems, allowing each to be carried out more efficiently. Generally, they require the inversion of the dependency structure in the generative model, as the modeller must learn a mapping from observations to distributions approximating the posterior. Previous approaches have involved inverting the dependency structure in a heuristic way that fails to capture these dependencies correctly, thereby limiting the achievable accuracy of the resulting approximations. We introduce an algorithm for faithfully, and minimally, inverting the graphical model structure of any generative model. Such inverses have two crucial properties: (a) they do not encode any independence assertions that are absent from the model and; (b) they are local maxima for the number of true independencies encoded. We prove the correctness of our approach and empirically show that the resulting minimally faithful inverses lead to better inference amortization than existing heuristic approaches.

* To appear at the 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montreal, Canada

Via

Access Paper or Ask Questions

Composing inference algorithms as program transformations

Jul 12, 2017

Robert Zinkov, Chung-chieh Shan

Figure 1 for Composing inference algorithms as program transformations

Figure 2 for Composing inference algorithms as program transformations

Figure 3 for Composing inference algorithms as program transformations

Abstract:Probabilistic inference procedures are usually coded painstakingly from scratch, for each target model and each inference algorithm. We reduce this effort by generating inference procedures from models automatically. We make this code generation modular by decomposing inference algorithms into reusable program-to-program transformations. These transformations perform exact inference as well as generate probabilistic programs that compute expectations, densities, and MCMC samples. The resulting inference procedures are about as accurate and fast as other probabilistic programming systems on real-world problems.

* 10 pages, 5 figures. To appear in Proceedings of the 33rd Conference on Uncertainty in Artificial Intelligence (UAI2017)

Via

Access Paper or Ask Questions

Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

Mar 02, 2017

Tuan Anh Le, Atilim Gunes Baydin, Robert Zinkov, Frank Wood

Figure 1 for Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

Figure 2 for Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

Figure 3 for Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

Figure 4 for Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

Abstract:We draw a formal connection between using synthetic training data to optimize neural network parameters and approximate, Bayesian, model-based reasoning. In particular, training a neural network using synthetic data can be viewed as learning a proposal distribution generator for approximate inference in the synthetic-data generative model. We demonstrate this connection in a recognition task where we develop a novel Captcha-breaking architecture and train it using synthetic data, demonstrating both state-of-the-art performance and a way of computing task-specific posterior uncertainty. Using a neural network trained this way, we also demonstrate successful breaking of real-world Captchas currently used by Facebook and Wikipedia. Reasoning from these empirical results and drawing connections with Bayesian modeling, we discuss the robustness of synthetic data results and suggest important considerations for ensuring good neural network generalization when training with synthetic data.

* 8 pages, 4 figures

Via

Access Paper or Ask Questions