
Prev Next Watch them adapt: We now get a mixture of behaviors based on current and previous reward settings: approach the enemy, but hang back from it a little and avoid crowding one another too much. (The previous rewards are relevant because the behavior they encouraged tends to be retained, so long as more recent training does not conflict with it.) |