Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Botond Cseke

Guided Decoding for Robot Motion Generation and Adaption

Mar 22, 2024

Nutan Chen, Elie Aljalbout, Botond Cseke, Patrick van der Smagt

Abstract:We address motion generation for high-DoF robot arms in complex settings with obstacles, via points, etc. A significant advancement in this domain is achieved by integrating Learning from Demonstration (LfD) into the motion generation process. This integration facilitates rapid adaptation to new tasks and optimizes the utilization of accumulated expertise by allowing robots to learn and generalize from demonstrated trajectories. We train a transformer architecture on a large dataset of simulated trajectories. This architecture, based on a conditional variational autoencoder transformer, learns essential motion generation skills and adapts these to meet auxiliary tasks and constraints. Our auto-regressive approach enables real-time integration of feedback from the physical system, enhancing the adaptability and efficiency of motion generation. We show that our model can generate motion from initial and target points, but also that it can adapt trajectories in navigating complex tasks, including obstacle avoidance, via points, and meeting velocity and acceleration constraints, across platforms.

* 7 pages

Via

Access Paper or Ask Questions

Local distance preserving auto-encoders using Continuous k-Nearest Neighbours graphs

Jun 13, 2022

Nutan Chen, Patrick van der Smagt, Botond Cseke

Figure 1 for Local distance preserving auto-encoders using Continuous k-Nearest Neighbours graphs

Figure 2 for Local distance preserving auto-encoders using Continuous k-Nearest Neighbours graphs

Figure 3 for Local distance preserving auto-encoders using Continuous k-Nearest Neighbours graphs

Figure 4 for Local distance preserving auto-encoders using Continuous k-Nearest Neighbours graphs

Abstract:Auto-encoder models that preserve similarities in the data are a popular tool in representation learning. In this paper we introduce several auto-encoder models that preserve local distances when mapping from the data space to the latent space. We use a local distance preserving loss that is based on the continuous k-nearest neighbours graph which is known to capture topological features at all scales simultaneously. To improve training performance, we formulate learning as a constraint optimisation problem with local distance preservation as the main objective and reconstruction accuracy as a constraint. We generalise this approach to hierarchical variational auto-encoders thus learning generative models with geometrically consistent latent and data spaces. Our method provides state-of-the-art performance across several standard datasets and evaluation metrics.

Via

Access Paper or Ask Questions

Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

Jan 29, 2021

Felix Frank, Alexandros Paraschos, Patrick van der Smagt, Botond Cseke

Figure 1 for Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

Figure 2 for Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

Figure 3 for Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

Figure 4 for Constrained Probabilistic Movement Primitives for Robot Trajectory Adaptation

Abstract:Versatile movement representations allow robots to learn new tasks and rapidly adapt them to environmental changes, e.g. introduction of obstacles, placing additional robots in the workspace, modification of the joint range due to faults or range of motion constraints due to tool manipulation. Probabilistic movement primitives (ProMP) model robot movements as a distribution over trajectories and they are an important tool due to their analytical tractability and ability to learn and generalise from a small number of demonstrations. Current approaches solve specific adaptation problems, e.g. obstacle avoidance, however, a generic probabilistic approach to adaptation has not yet been developed. In this paper we propose a generic probabilistic framework for adapting ProMPs. We formulate adaptation as a constrained optimisation problem where we minimise the Kullback-Leibler divergence between the adapted distribution and the distribution of the original primitive and we constrain the probability mass associated with undesired trajectories to be low. We derive several types of constraints that can be added depending on the task, such us joint limiting, various types of obstacle avoidance, via-points, and mutual avoidance, under a common framework. We demonstrate our approach on several adaptation problems on simulated planar robot arms and 7-DOF Franka-Emika robots in single and dual robot arm settings.

* There is a supplementary video accompanying the paper. It can be found at https://youtu.be/ErdP7bA11v8

Via

Access Paper or Ask Questions

Increasing the Generalisation Capacity of Conditional VAEs

Sep 10, 2019

Alexej Klushyn, Nutan Chen, Botond Cseke, Justin Bayer, Patrick van der Smagt

Figure 1 for Increasing the Generalisation Capacity of Conditional VAEs

Figure 2 for Increasing the Generalisation Capacity of Conditional VAEs

Figure 3 for Increasing the Generalisation Capacity of Conditional VAEs

Figure 4 for Increasing the Generalisation Capacity of Conditional VAEs

Abstract:We address the problem of one-to-many mappings in supervised learning, where a single instance has many different solutions of possibly equal cost. The framework of conditional variational autoencoders describes a class of methods to tackle such structured-prediction tasks by means of latent variables. We propose to incentivise informative latent representations for increasing the generalisation capacity of conditional variational autoencoders. To this end, we modify the latent variable model by defining the likelihood as a function of the latent variable only and introduce an expressive multimodal prior to enable the model for capturing semantically meaningful features of the data. To validate our approach, we train our model on the Cornell Robot Grasping dataset, and modified versions of MNIST and Fashion-MNIST obtaining results that show a significantly higher generalisation capability.

Via

Access Paper or Ask Questions

Learning Hierarchical Priors in VAEs

May 23, 2019

Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt

Figure 1 for Learning Hierarchical Priors in VAEs

Figure 2 for Learning Hierarchical Priors in VAEs

Figure 3 for Learning Hierarchical Priors in VAEs

Figure 4 for Learning Hierarchical Priors in VAEs

Abstract:We propose to learn a hierarchical prior in the context of variational autoencoders to avoid the over-regularisation resulting from a standard normal prior distribution. To incentivise an informative latent representation of the data by learning a rich hierarchical prior, we formulate the objective function as the Lagrangian of a constrained-optimisation problem and propose an optimisation algorithm inspired by Taming VAEs. We introduce a graph-based interpolation method, which shows that the topology of the learned latent representation corresponds to the topology of the data manifold---and present several examples, where desired properties of latent representation such as smoothness and simple explanatory factors are learned by the prior. Furthermore, we validate our approach on standard datasets, obtaining state-of-the-art test log-likelihoods.

Via

Access Paper or Ask Questions

Efficient Low-Order Approximation of First-Passage Time Distributions

Nov 01, 2017

David Schnoerr, Botond Cseke, Ramon Grima, Guido Sanguinetti

Figure 1 for Efficient Low-Order Approximation of First-Passage Time Distributions

Figure 2 for Efficient Low-Order Approximation of First-Passage Time Distributions

Abstract:We consider the problem of computing first-passage time distributions for reaction processes modelled by master equations. We show that this generally intractable class of problems is equivalent to a sequential Bayesian inference problem for an auxiliary observation process. The solution can be approximated efficiently by solving a closed set of coupled ordinary differential equations (for the low-order moments of the process) whose size scales with the number of species. We apply it to an epidemic model and a trimerisation process, and show good agreement with stochastic simulations.

* Phys. Rev. Lett. 119, 210601 (2017)
* 5 pages, 3 figures

Via

Access Paper or Ask Questions

Expectation propagation for continuous time stochastic processes

Jun 28, 2016

Botond Cseke, David Schnoerr, Manfred Opper, Guido Sanguinetti

Figure 1 for Expectation propagation for continuous time stochastic processes

Figure 2 for Expectation propagation for continuous time stochastic processes

Figure 3 for Expectation propagation for continuous time stochastic processes

Abstract:We consider the inverse problem of reconstructing the posterior measure over the trajec- tories of a diffusion process from discrete time observations and continuous time constraints. We cast the problem in a Bayesian framework and derive approximations to the posterior distributions of single time marginals using variational approximate inference. We then show how the approximation can be extended to a wide class of discrete-state Markov jump pro- cesses by making use of the chemical Langevin equation. Our empirical results show that the proposed method is computationally efficient and provides good approximations for these classes of inverse problems.

Via

Access Paper or Ask Questions

f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

Jun 02, 2016

Sebastian Nowozin, Botond Cseke, Ryota Tomioka

Figure 1 for f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

Figure 2 for f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

Figure 3 for f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

Figure 4 for f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

Abstract:Generative neural samplers are probabilistic models that implement sampling using feedforward neural networks: they take a random input vector and produce a sample from a probability distribution defined by the network weights. These models are expressive and allow efficient computation of samples and derivatives, but cannot be used for computing likelihoods or for marginalization. The generative-adversarial training method allows to train such models through the use of an auxiliary discriminative neural network. We show that the generative-adversarial approach is a special case of an existing more general variational divergence estimation approach. We show that any f-divergence can be used for training generative neural samplers. We discuss the benefits of various choices of divergence functions on training complexity and the quality of the obtained generative models.

* 17 pages

Via

Access Paper or Ask Questions

Sparse Approximate Inference for Spatio-Temporal Point Process Models

Jul 06, 2015

Botond Cseke, Andrew Zammit Mangion, Tom Heskes, Guido Sanguinetti

Figure 1 for Sparse Approximate Inference for Spatio-Temporal Point Process Models

Figure 2 for Sparse Approximate Inference for Spatio-Temporal Point Process Models

Figure 3 for Sparse Approximate Inference for Spatio-Temporal Point Process Models

Figure 4 for Sparse Approximate Inference for Spatio-Temporal Point Process Models

Abstract:Spatio-temporal point process models play a central role in the analysis of spatially distributed systems in several disciplines. Yet, scalable inference remains computa- tionally challenging both due to the high resolution modelling generally required and the analytically intractable likelihood function. Here, we exploit the sparsity structure typical of (spatially) discretised log-Gaussian Cox process models by using approximate message-passing algorithms. The proposed algorithms scale well with the state dimension and the length of the temporal horizon with moderate loss in distributional accuracy. They hence provide a flexible and faster alternative to both non-linear filtering-smoothing type algorithms and to approaches that implement the Laplace method or expectation propagation on (block) sparse latent Gaussian models. We infer the parameters of the latent Gaussian model using a structured variational Bayes approach. We demonstrate the proposed framework on simulation studies with both Gaussian and point-process observations and use it to reconstruct the conflict intensity and dynamics in Afghanistan from the WikiLeaks Afghan War Diary.

Via

Access Paper or Ask Questions

Properties of Bethe Free Energies and Message Passing in Gaussian Models

Jan 16, 2014

Botond Cseke, Tom Heskes

Figure 1 for Properties of Bethe Free Energies and Message Passing in Gaussian Models

Figure 2 for Properties of Bethe Free Energies and Message Passing in Gaussian Models

Figure 3 for Properties of Bethe Free Energies and Message Passing in Gaussian Models

Abstract:We address the problem of computing approximate marginals in Gaussian probabilistic models by using mean field and fractional Bethe approximations. We define the Gaussian fractional Bethe free energy in terms of the moment parameters of the approximate marginals, derive a lower and an upper bound on the fractional Bethe free energy and establish a necessary condition for the lower bound to be bounded from below. It turns out that the condition is identical to the pairwise normalizability condition, which is known to be a sufficient condition for the convergence of the message passing algorithm. We show that stable fixed points of the Gaussian message passing algorithm are local minima of the Gaussian Bethe free energy. By a counterexample, we disprove the conjecture stating that the unboundedness of the free energy implies the divergence of the message passing algorithm.

* Journal Of Artificial Intelligence Research, Volume 41, pages 1-24, 2011

Via

Access Paper or Ask Questions