neural networks research group
areas
people
projects
demos
publications
software/data
Solving Non-Markovian Control Tasks With Neuroevolution (1999)
Faustino J. Gomez
and
Risto Miikkulainen
The success of evolutionary methods on standard control learning tasks has created a need for new benchmarks. The classic pole balancing problem is no longer difficult enough to serve as a viable yardstick for measuring the learning efficiency of these systems. The double pole case, where two poles connected to the cart must be balanced simultaneously is much more difficult, especially when velocity information is not available. In this article, we demonstrate a neuroevolution system, Enforced Sub-populations (ESP), that is used to evolve a controller for the standard double pole task and a much harder, non-Markovian version. In both cases, our results show that ESP is faster than other neuroevolution methods. In addition, we introduce an incremental method that evolves on a sequence of tasks, and utilizes a local search technique (Delta-Coding) to sustain diversity. This method enables the system to solve even more difficult versions of the task where direct evolution cannot.
View:
PDF
,
PS
Citation:
In
Proceedings of the International Joint Conference on Artificial Intelligence
, 1356-1361, San Francisco, CA, 1999. Kaufmann.
Bibtex:
@inproceedings{gomez:ijcai99, title={Solving Non-Markovian Control Tasks With Neuroevolution}, author={Faustino J. Gomez and Risto Miikkulainen}, booktitle={Proceedings of the International Joint Conference on Artificial Intelligence}, address={San Francisco, CA}, publisher={Kaufmann}, pages={1356-1361}, url="http://nn.cs.utexas.edu/?gomez:ijcai99", year={1999} }
People
Faustino Gomez
Postdoctoral Alumni
tino [at] idsia ch
Risto Miikkulainen
Faculty
risto [at] cs utexas edu
Demos
Double Pole Balancing with ESP
Faustino Gomez
1999
Software/Data
ESP C++
The ESP package contains the source code for the Enforced Sup-Populations system written in C++. ESP is an extension t...
2000
Areas of Interest
Evolutionary Computation
Neuroevolution
Reinforcement Learning