Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edmondo Minisci

A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Jun 04, 2020

Callum Wilson, Annalisa Riccardi, Edmondo Minisci

Figure 1 for A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Figure 2 for A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Figure 3 for A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Figure 4 for A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Abstract:Reinforcement learning is a popular machine learning paradigm which can find near optimal solutions to complex problems. Most often, these procedures involve function approximation using neural networks with gradient based updates to optimise weights for the problem being considered. While this common approach generally works well, there are other update mechanisms which are largely unexplored in reinforcement learning. One such mechanism is Extreme Learning Machines. These were initially proposed to drastically improve the training speed of neural networks and have since seen many applications. Here we attempt to apply extreme learning machines to a reinforcement learning problem in the same manner as gradient based updates. This new algorithm is called Extreme Q-Learning Machine (EQLM). We compare its performance to a typical Q-Network on the cart-pole task - a benchmark reinforcement learning problem - and show EQLM has similar long-term learning performance to a Q-Network.

* Accepted for IJCNN/WCCI 2020

Via

Access Paper or Ask Questions

Approximated Computation of Belief Functions for Robust Design Optimization

Jul 14, 2012

Massimiliano Vasile, Edmondo Minisci, Quirien Wijnands

Figure 1 for Approximated Computation of Belief Functions for Robust Design Optimization

Figure 2 for Approximated Computation of Belief Functions for Robust Design Optimization

Figure 3 for Approximated Computation of Belief Functions for Robust Design Optimization

Figure 4 for Approximated Computation of Belief Functions for Robust Design Optimization

Abstract:This paper presents some ideas to reduce the computational cost of evidence-based robust design optimization. Evidence Theory crystallizes both the aleatory and epistemic uncertainties in the design parameters, providing two quantitative measures, Belief and Plausibility, of the credibility of the computed value of the design budgets. The paper proposes some techniques to compute an approximation of Belief and Plausibility at a cost that is a fraction of the one required for an accurate calculation of the two values. Some simple test cases will show how the proposed techniques scale with the dimension of the problem. Finally a simple example of spacecraft system design is presented.

* AIAA-2012-1932 14th AIAA Non-Deterministic Approaches Conference. 23-26 April 2012 Sheraton Waikiki, Honolulu, Hawaii

Via

Access Paper or Ask Questions

An inflationary differential evolution algorithm for space trajectory optimization

Apr 25, 2011

Massimiliano Vasile, Edmondo Minisci, Marco Locatelli

Figure 1 for An inflationary differential evolution algorithm for space trajectory optimization

Figure 2 for An inflationary differential evolution algorithm for space trajectory optimization

Figure 3 for An inflationary differential evolution algorithm for space trajectory optimization

Figure 4 for An inflationary differential evolution algorithm for space trajectory optimization

Abstract:In this paper we define a discrete dynamical system that governs the evolution of a population of agents. From the dynamical system, a variant of Differential Evolution is derived. It is then demonstrated that, under some assumptions on the differential mutation strategy and on the local structure of the objective function, the proposed dynamical system has fixed points towards which it converges with probability one for an infinite number of generations. This property is used to derive an algorithm that performs better than standard Differential Evolution on some space trajectory optimization problems. The novel algorithm is then extended with a guided restart procedure that further increases the performance, reducing the probability of stagnation in deceptive local minima.

* IEEE Transactions on Evolutionary Computation 2011. ISSN 1089-778X

Via

Access Paper or Ask Questions