Shivaram Kalyanakrishnan
Shivaram is interested in practical applications of reinforcement learning. In particular, he has used robot soccer as a test domain for much of his research, which has resulted in two Best Student Paper awards at RoboCup. During an internship at the Honda Research Institute, Shivaram applied machine learning to the problem of humanoid fall prediction. His other interests include history, literature, cricket, geography, and theatre.
     [Expand to show all 16][Minimize]
Half Field Offense: An Environment for Multiagent Learning and Ad Hoc Teamwork Matthew Hausknecht and Prannoy Mupparaju and Sandeep Subramanian and Shivaram Kalyanakrishnan and Pe... In AAMAS Adaptive Learning Agents (ALA) Workshop, Singapore, May 2016. 2016

{UT} {A}ustin {V}illa 2011: A Champion Agent in the {R}obo{C}up 3{D} Soccer Simulation Competition Patrick MacAlpine and Daniel Urieli and Samuel Barrett and Shivaram Kalyanakrishnan and Francisco Ba... In Proc. of 11th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS'12), June 2012... 2012

PAC Subset Selection in Stochastioc Multi-armed Bandits Shivaram Kalyanakrishnan and Ambuj Tewari and Peter Auer and Peter Stone In In proceedings of the 29th International Conference on Machine Learning (ICML 2012), June-... 2012

On Optimizing Interdependent Skills: A Case Study in Simulated 3D Humanoid Robot Soccer Daniel Urieli and Patrick MacAlpine and Shivaram Kalyanakrishnan and Yinon Bentor and Peter Stone In Proc. of 10th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS'11), May 2011. 2011

On Learning with Imperfect Representations Shivaram Kalyanakrishnan and Peter Stone In Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learn... 2011

Characterizing Reinforcement Learning Methods through Parameterized Learning Problems Shivaram Kalyanakrishnan and Peter Stone Machine Learning, 2011. 2011

{UT} {A}ustin {V}illa 2011 3{D} {S}imulation {T}eam Report Patrick MacAlpine and Daniel Urieli and Samuel Barrett and Shivaram Kalyanakrishnan and Francisco Ba... Technical Report, Department of Computer Science, The University of Texas at Austin, December 2011. 2011

Efficient Selection of Multiple Bandit Arms: Theory and Practice Shivaram Kalyanakrishnan and Peter Stone In Proceedings of the 27th International Conference on Machine Learning (ICML 2010), 2010. 2010

An Empirical Analysis of Value Function-Based and Policy Search Reinforcement Learning Shivaram Kalyanakrishnan and Peter Stone In The Eighth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 7... 2009

Learning Complementary Multiagent Behaviors: A Case Study Shivaram Kalyanakrishnan and Peter Stone In Proceedings of the RoboCup International Symposium 2009, 2009. Springer Verlag. 2009

Three Humanoid Soccer Platforms: Comparison and Synthesis Shivaram Kalyanakrishnan and Todd Hester and Michael Quinlan and Yinon Bentor and Peter Stone In Proceedings of the RoboCup International Symposium 2009, 2009. Springer Verlag. 2009

The UT Austin Villa 3D Simulation Soccer Team 2008 Shivaram Kalyanakrishnan and Yinon Bentor and Peter Stone Technical Report AI09-01, The University of Texas at Austin, Department of Computer Sciences, AI Lab... 2009

Model-based Reinforcement Learning in a Complex Domain Shivaram Kalyanakrishnan and Peter Stone and Yaxin Liu In Ubbo Visser and Fernando Ribeiro and Takeshi Ohashi and Frank Dellaert, editors, RoboCup-2007:... 2008

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study Shivaram Kalyanakrishnan and Yaxin Liu and Peter Stone In Gerhard Lakemeyer and Elizabeth Sklar and Domenico Sorenti and Tomoichi Takahashi, editors, Ro... 2007

Batch Reinforcement Learning in a Complex Domain Shivaram Kalyanakrishnan and Peter Stone In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems, 650... 2007

The UT Austin Villa 3D Simulation Soccer Team 2007 Shivaram Kalyanakrishnan and Peter Stone Technical Report AI-07-348, The University of Texas at Austin, Department of Computer Sciences, AI L... 2007