Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhiping Dan

Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

Apr 26, 2016

Zhaoxiang Zang, Zhao Li, Junying Wang, Zhiping Dan

Figure 1 for Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

Figure 2 for Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

Figure 3 for Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

Abstract:As a genetics-based machine learning technique, zeroth-level classifier system (ZCS) is based on a discounted reward reinforcement learning algorithm, bucket-brigade algorithm, which optimizes the discounted total reward received by an agent but is not suitable for all multi-step problems, especially large-size ones. There are some undiscounted reinforcement learning methods available, such as R-learning, which optimize the average reward per time step. In this paper, R-learning is used as the reinforcement learning employed by ZCS, to replace its discounted reward reinforcement learning approach, and tournament selection is used to replace roulette wheel selection in ZCS. The modification results in classifier systems that can support long action chains, and thus is able to solve large multi-step problems.

* 14 pages, 3 figures

Via

Access Paper or Ask Questions