Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan Rowe

Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

May 11, 2013

Boris Mitavskiy, Elio Tuci, Chris Cannings, Jonathan Rowe, Jun He

Figure 1 for Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

Figure 2 for Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

Figure 3 for Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

Figure 4 for Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

Abstract:The classical Geiringer theorem addresses the limiting frequency of occurrence of various alleles after repeated application of crossover. It has been adopted to the setting of evolutionary algorithms and, a lot more recently, reinforcement learning and Monte-Carlo tree search methodology to cope with a rather challenging question of action evaluation at the chance nodes. The theorem motivates novel dynamic parallel algorithms that are explicitly described in the current paper for the first time. The algorithms involve independent agents traversing a dynamically constructed directed graph that possibly has loops. A rather elegant and profound category-theoretic model of cognition in biological neural networks developed by a well-known French mathematician, professor Andree Ehresmann jointly with a neurosurgeon, Jan Paul Vanbremeersch over the last thirty years provides a hint at the connection between such algorithms and Hebbian learning.

* Natural Computing, Volume 12, Issue 4 , pp 473-484, 2013
* arXiv admin note: text overlap with arXiv:1110.4657

Via

Access Paper or Ask Questions

A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Oct 20, 2011

Boris Mitavskiy, Jonathan Rowe, Chris Cannings

Figure 1 for A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Figure 2 for A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Figure 3 for A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Figure 4 for A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Abstract:Purpose: In recent years Monte-Carlo sampling methods, such as Monte Carlo tree search, have achieved tremendous success in model free reinforcement learning. A combination of the so called upper confidence bounds policy to preserve the "exploration vs. exploitation" balance to select actions for sample evaluations together with massive computing power to store and to update dynamically a rather large pre-evaluated game tree lead to the development of software that has beaten the top human player in the game of Go on a 9 by 9 board. Much effort in the current research is devoted to widening the range of applicability of the Monte-Carlo sampling methodology to partially observable Markov decision processes with non-immediate payoffs. The main challenge introduced by randomness and incomplete information is to deal with the action evaluation at the chance nodes due to drastic differences in the possible payoffs the same action could lead to. The aim of this article is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential. Due to space limitations the actual algorithms themselves will be presented in the sequel papers, however, the current paper provides a solid mathematical foundation for the development of such algorithms and explains why they are so promising.

* 53 pages in size. This work has been recently submitted to the IJICC (International Journal on Intelligent Computing and Cybernetics)

Via

Access Paper or Ask Questions