Learning Sequential Decision Tasks

Learning Sequential Decision Tasks (1995)

David E. Moriarty and Risto Miikkulainen

This paper presents a new approach called SANE for learning and performing sequential decision tasks. Compared to problem-general heuristics, SANE forms more effective decision strategies because it learns to utilize domain-specific information. SANE evolves neural networks through genetic algorithms and can learn in a wide range of domains with minimal reinforcement. SANE's evolution algorithm, called symbiotic evolution, is more powerful than standard genetic algorithms because diversity pressures are inherent in the evolution. SANE is shown to be effective in two sequential decision tasks. As a value-ordering method in constraint satisfaction search, SANE required only 1/3 of the backtracks of a problem-general heuristic. As a filter for minimax search, SANE formed a network capable of focusing the search away from misinformation, creating stronger play.

View:

PDF, PS

Citation:

Technical Report AI95-229, Department of Computer Sciences, The University of Texas at Austin, 1995.

Bibtex:

People

Risto Miikkulainen	Faculty	risto [at] cs utexas edu
David E. Moriarty	Ph.D. Alumni	moriarty [at] alumni utexas net

Areas of Interest

Evolutionary Computation Neuroevolution Reinforcement Learning Game Playing Applications