Peon Demos (from about year 2000)
This page is based on research in Real-Time Interactive Neuro-Evolution by Adrian Agogino and Kenneth Stanley
Here is a link to a program for viewing .avi movies on unix: XAnim Home Page (LINK EXPIRED)
NEW: Press here to try an interactive Java demo with Peons (LINK EXPIRED)
What is this?
The following demos depict real-time neuroevolution in action.
They are the result of research in the area of real-time
interactive neuro-evolution, which basically means evolving
brains in a live-action scenario, where a user can interact with them.
The movie clips are actual captures of the evolution occuring,
offering an unusually clear glimpse at the process.
If you find this interesting, take a look at
our paper.
Note that the actual animation of the peons is smoother
in the program than in the movie clips. This allows them to
be more compact.
What is going on in the movies?
The demos actually depict a simple game.
There are a bunch of small dots, called peons.
(Note that occasioanlly we see a larger peon, shaped as a square)
There is a larger, red dot, called "the enemy."
There are one or two large yellow circles, called "goldmines."
There is a square in the middle, called "the base."
The peons always leave from the base. If they don't travel to
one of the goldmines in an alotted amount of time, they die.
Also, if the enemy touches a peon, it dies. When a successful
peon makes it to a mine, it is transported back to the base to try
again. Thus, a good peon can dodge the enemy and find a mine in a
short timespan.
Each peon has a neural network brain, which is a simple feed-forward
network. The input to the network is visual information about where
the enemy is, and where the mines are. The output is a movement decision.
However, keep in mind that the brains are initially completely random
in their connections, so peons initially do not know anything about
their environment, or how to interpret their sensors.
As the game progresses, peons are given a fitness rating. In the
movies below, the highest ranking peon at any given time is
represented as a large box, so that they stand out. High ranking peons
are allowed to reproduce to replace lower ranking peons that have
died. When peons reproduce, mutations occur. Hence, we
get evolution.
Here are the peon movies:
Untrained Peons
The population is composed entirely of peons with randomly connected
brains, so we don't expect to see anything intelligent. Not surprisingly,
we see a lot of peons scattering in random directions. This is how an
initial population looks before any adaptation occurs.
Naive Peons
In only a matter of seconds, the peon population has learned to seek
out the nearest mine. However, this is a naive strategy- the enemy
never moved while they found this technique, and therefore they
are not prepared to react appropriately when the enemy finally attacks.
This illustrates the tendency of evolution to produce populations
that are deceptively simplistic, yet still successful.
Peons Devastated
Suddenly, the enemy attacks the naive population from the above clip.
The population has not learned to deal with a moving enemy, and therefore
it is quickly devastated. You can actually see the population dying out
here as its favorite strategy is completely outwitted.
Peons Employ New Strategy
In only 20 or 30 seconds, the peons devise a new strategy which beats the enemy's strategy. (Note that the enemy's strategy is fixed in this example.)
This illustrates how flexible online adaptation is, and how it is possible in
real-time. The strategy employed appears to confuse the enemy, rendering it
harmless.
Note: The following two demos are with a new peon population starting
from
scratch
Orbital Training an Untrained Set
Once again we are observing an untrained population. However, notice
that the mine and the enemy are both "orbiting" the peons' home base.
This is a training scenario we developed to obtain a population that
could not rely on the destination being in any one location, yet still
had to avoid the enemy. This is a complicated prospect for the untrained
peons. Notice near the end of the movie where one aimless peon accidentally
wanders into the passing mine. This peon will now be honored with higher
selectivity in the future. Interestingly, its strategy is most likely
almost useless, since it just knows to wander in a certain direction,
which won't work with an orbiting mine. The situation will tend to select
for peons that are a little sensitive to the location to of the mine,
and eventually strengthen that sensitivity into a following instinct.
Peons after Orbital Training
The peons in this clip have developed a very sophisticated and reactive
neural network to deal with a very quickly changing situation.
Please compare this clip to the previous one to see how far they have
come. Attaining this level of intelligence took a few minutes of training
with exactly the right parameters for fitness selection. (If the parameters
are changed, the peons are not able to attain this level.) Notice the
elegant balance between enemy avoidance and mine following that the
peons display. Please note that the mine stops midway through the clip.
This is part of the confusion the peons deal with- the mine is set to
stop moving at random times and then start up again later.
Contact authors here:
kstanley@cs.utexas.edu
agogino@ece.utexas.edu