Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pierre-Loïc Garoche

ENAC

Set-based value operators for non-stationary Markovian environments

Jul 15, 2022

Sarah H. Q. Li, Assalé Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

Figure 1 for Set-based value operators for non-stationary Markovian environments

Figure 2 for Set-based value operators for non-stationary Markovian environments

Figure 3 for Set-based value operators for non-stationary Markovian environments

Figure 4 for Set-based value operators for non-stationary Markovian environments

Abstract:This paper analyzes finite state Markov Decision Processes (MDPs) with uncertain parameters in compact sets and re-examines results from robust MDP via set-based fixed point theory. We generalize the Bellman and policy evaluation operators to operators that contract on the space of value functions and denote them as \emph{value operators}. We generalize these value operators to act on the space of value function sets and denote them as \emph{set-based value operators}. We prove that these set-based value operators are contractions in the space of compact value function sets. Leveraging insights from set theory, we generalize the rectangularity condition for the Bellman operator from classic robust MDP literature to a \emph{containment condition} for a generic value operator, which is weaker and can be applied to a larger set of parameter-uncertain MDPs and contractive operators in dynamic programming and reinforcement learning. We prove that both the rectangularity condition and the containment condition sufficiently ensure that the set-based value operator's fixed point set contains its own supremum and infimum elements. For convex and compact sets of uncertain MDP parameters, we show equivalence between the classic robust value function and the supremum of the fixed point set of the set-based Bellman operator. Under dynamically changing MDP parameters in compact sets, we prove a set convergence result for value iteration, which otherwise may not converge to a single value function.

* 14 pages, 11 figures, 1 table

Via

Access Paper or Ask Questions

An Efficient Summation Algorithm for the Accuracy, Convergence and Reproducibility of Parallel Numerical Methods

May 11, 2022

Farah Benmouhoub, Pierre-Loïc Garoche, Matthieu Martel

Abstract:Nowadays, parallel computing is ubiquitous in several application fields, both in engineering and science. The computations rely on the floating-point arithmetic specified by the IEEE754 Standard. In this context, an elementary brick of computation, used everywhere, is the sum of a sequence of numbers. This sum is subject to many numerical errors in floating-point arithmetic. To alleviate this issue, we have introduced a new parallel algorithm for summing a sequence of floating-point numbers. This algorithm which scales up easily with the number of processors, adds numbers of the same exponents first. In this article, our main contribution is an extensive analysis of its efficiency with respect to several properties: accuracy, convergence and reproducibility. In order to show the usefulness of our algorithm, we have chosen a set of representative numerical methods which are Simpson, Jacobi, LU factorization and the Iterated power method.

* Numerical Software Verification, Jul 2021, Los Angeles, United States. pp.165-181

Via

Access Paper or Ask Questions

Verification and Validation of Convex Optimization Algorithms for Model Predictive Control

May 26, 2020

Raphaël Cohen, Eric Féron, Pierre-Loïc Garoche

Figure 1 for Verification and Validation of Convex Optimization Algorithms for Model Predictive Control

Figure 2 for Verification and Validation of Convex Optimization Algorithms for Model Predictive Control

Figure 3 for Verification and Validation of Convex Optimization Algorithms for Model Predictive Control

Figure 4 for Verification and Validation of Convex Optimization Algorithms for Model Predictive Control

Abstract:Advanced embedded algorithms are growing in complexity and they are an essential contributor to the growth of autonomy in many areas. However, the promise held by these algorithms cannot be kept without proper attention to the considerably stronger design constraints that arise when the applications of interest, such as aerospace systems, are safety-critical. Formal verification is the process of proving or disproving the ''correctness'' of an algorithm with respect to a certain mathematical description of it by means of a computer. This article discusses the formal verification of the Ellipsoid method, a convex optimization algorithm, and its code implementation as it applies to receding horizon control. Options for encoding code properties and their proofs are detailed. The applicability and limitations of those code properties and proofs are presented as well. Finally, floating-point errors are taken into account in a numerical analysis of the Ellipsoid algorithm. Modifications to the algorithm are presented which can be used to control its numerical stability.

* Journal of Aerospace Information Systems, American Institute of Aeronautics and Astronautics, 2020, 17 (5), pp.257-270

Via

Access Paper or Ask Questions