Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gastone Pietro Rosati Papini

CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization

Dec 17, 2023

Elisa Alboni, Gianluigi Grandesso, Gastone Pietro Rosati Papini, Justin Carpentier, Andrea Del Prete

Abstract:Trajectory Optimization (TO) and Reinforcement Learning (RL) are powerful and complementary tools to solve optimal control problems. On the one hand, TO can efficiently compute locally-optimal solutions, but it tends to get stuck in local minima if the problem is not convex. On the other hand, RL is typically less sensitive to non-convexity, but it requires a much higher computational effort. Recently, we have proposed CACTO (Continuous Actor-Critic with Trajectory Optimization), an algorithm that uses TO to guide the exploration of an actor-critic RL algorithm. In turns, the policy encoded by the actor is used to warm-start TO, closing the loop between TO and RL. In this work, we present an extension of CACTO exploiting the idea of Sobolev learning. To make the training of the critic network faster and more data efficient, we enrich it with the gradient of the Value function, computed via a backward pass of the differential dynamic programming algorithm. Our results show that the new algorithm is more efficient than the original CACTO, reducing the number of TO episodes by a factor ranging from 3 to 10, and consequently the computation time. Moreover, we show that CACTO-SL helps TO to find better minima and to produce more consistent results.

Via

Access Paper or Ask Questions

Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Jun 15, 2020

Alessandro Antonucci, Gastone Pietro Rosati Papini, Luigi Palopoli, Daniele Fontanelli

Figure 1 for Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Figure 2 for Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Figure 3 for Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Figure 4 for Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Abstract:Generating accurate and efficient predictions for the motion of the humans present in the scene is key to the development of effective motion planning algorithms for robots moving in promiscuous areas, where wrong planning decisions could generate safety hazard or simply make the presence of the robot "socially" unacceptable. Our approach to predict human motion is based on a neural network of a peculiar kind. Contrary to conventional deep neural networks, our network embeds in its structure the popular Social Force Model, a dynamic equation describing the motion in physical terms. This choice allows us to concentrate the learning phase in the aspects, which are really unknown (i.e., the model's parameters) and to keep the structure of the network simple and manageable. As a result, we are able to obtain a good prediction accuracy with a small synthetically generated training set, and the accuracy remains acceptable even when the network is applied in scenarios quite different from those for which it was trained. Finally, the choices of the network are "explainable", as they can be interpreted in physical terms. Comparative and experimental results prove the effectiveness of the proposed approach.

* This paper was submitted to the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) on the 03/01/2020, and is still under review

Via

Access Paper or Ask Questions