NNRG Demos - Teaching an Agent Manually via Evaluative Reinforcement (TAMER)

Teaching an Agent Manually via Evaluative Reinforcement (TAMER) (2009)

Author: W. Bradley Knox and Peter Stone

Videos of a TAMER agent being trained by a human teacher giving positive and negative feedback signals.

W. Bradley Knox		bradknox [at] mit edu
Peter Stone		pstone [at] cs utexas edu

Projects

Teaching an Agent Manually via Evaluative Reinforcement (TAMER)

Since 2008

Publications

Combining Manual Feedback with Subsequent MDP Reward Signals for Reinforcement Learning	W. Bradley Knox and Peter Stone	In Proc. of 9th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS 2010), May 2010...	2010

Interactively Shaping Agents via Human Reinforcement: The TAMER Framework	W. Bradley Knox and Peter Stone	In The Fifth International Conference on Knowledge Capture, September 2009.	2009

Design Principles for Creating Human-Shapable Agents	W. Bradley Knox and Ian Fasel and Peter Stone	In AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers, March 2009.	2009

Reinforcement Learning from Human Reward: Discounting in Episodic Tasks	W. Bradley Knox and Peter Stone	In In Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communi...	2012

Reinforcement Learning with Human and MDP Reward	W. Bradley Knox and Peter Stone	In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (...	2012

Learning from feedback on actions past and intended	W. Bradley Knox and Cynthia Breazeal and Peter Stone	In In Proceedings of 7th ACM/IEEE International Conference on Human-Robot Interaction, Late-Break...	2012

Understanding Human Teaching Modalities in Reinforcement Learning Environments: A Preliminary Report	W. Bradley Knox and Peter Stone	In IJCAI 2011 Workshop on Agents Learning Interactively from Human Teachers (ALIHT), July 201...	2011

How Humans Teach Agents: A New Experimental Perspective	W. Bradley Knox and Brian D. Glass and Bradley C. Love and W. Todd Maddox and Peter Stone	International Journal of Social Robotics, 4:409-421, 2012. Springer Netherlands.	2012

Learning from Human-Generated Reward	W. Bradley Knox	%RefShort%	2012

Learning Non-Myopically from Human-Generated Reward	W. Bradley Knox and Peter Stone	In In Proceedings of the International Conference on Intelligent User Interfaces (IUI), March...	2013

Areas of Interest

Social Agents Transfer Learning Reinforcement Learning