Solving Non-Markovian Control Tasks With Neuroevolution

Solving Non-Markovian Control Tasks With Neuroevolution (1999)

Faustino J. Gomez and Risto Miikkulainen

The success of evolutionary methods on standard control learning tasks has created a need for new benchmarks. The classic pole balancing problem is no longer difficult enough to serve as a viable yardstick for measuring the learning efficiency of these systems. The double pole case, where two poles connected to the cart must be balanced simultaneously is much more difficult, especially when velocity information is not available. In this article, we demonstrate a neuroevolution system, Enforced Sub-populations (ESP), that is used to evolve a controller for the standard double pole task and a much harder, non-Markovian version. In both cases, our results show that ESP is faster than other neuroevolution methods. In addition, we introduce an incremental method that evolves on a sequence of tasks, and utilizes a local search technique (Delta-Coding) to sustain diversity. This method enables the system to solve even more difficult versions of the task where direct evolution cannot.

View:

PDF, PS

Citation:

In Proceedings of the International Joint Conference on Artificial Intelligence, 1356-1361, San Francisco, CA, 1999. Kaufmann.

Bibtex:

People

Faustino Gomez	Postdoctoral Alumni	tino [at] idsia ch
Risto Miikkulainen	Faculty	risto [at] cs utexas edu

Demos

Double Pole Balancing with ESP

Faustino Gomez

1999

Software/Data

ESP C++	The ESP package contains the source code for the Enforced Sup-Populations system written in C++. ESP is an extension t...	2000

Areas of Interest

Evolutionary Computation Neuroevolution Reinforcement Learning