Learning Sequential Decision Tasks (1995)
This paper presents a new approach called SANE for learning and performing sequential decision tasks. Compared to problem-general heuristics, SANE forms more effective decision strategies because it learns to utilize domain-specific information. SANE evolves neural networks through genetic algorithms and can learn in a wide range of domains with minimal reinforcement. SANE's evolution algorithm, called symbiotic evolution, is more powerful than standard genetic algorithms because diversity pressures are inherent in the evolution. SANE is shown to be effective in two sequential decision tasks. As a value-ordering method in constraint satisfaction search, SANE required only 1/3 of the backtracks of a problem-general heuristic. As a filter for minimax search, SANE formed a network capable of focusing the search away from misinformation, creating stronger play.
View:
PDF, PS
Citation:
Technical Report AI95-229, Department of Computer Sciences, The University of Texas at Austin, 1995.
Bibtex:

Risto Miikkulainen Faculty risto [at] cs utexas edu
David E. Moriarty Ph.D. Alumni moriarty [at] alumni utexas net