Function Approximation |   |   | Partial Observability |   |   | Learning Methods |   |   | Ensembles |   |   |
Stochastic Optimisation |   |   | General RL |   |   | General ML |   |   | Multiagent Learning |   |   |
Comparison/Integration |   |   | Bandits |   |   | Applications |   |   | Robot Soccer |   |   |
Humanoids |   |   | Parameter |   |   | MDP |   |   | Empirical |   |   |
Failure Warning |   |   | Representation |   |   | General AI |   |   | Neural Networks |   |   |
All |   |   |
A Comprehensive Survey of Multiagent Reinforcement Learning
Lucian Bu\csoniu, Robert Babu\vska, and Bart De Schutter, 2008
Details
Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
Shivaram Kalyanakrishnan, Yaxin Liu, and Peter Stone, 2007
Details
Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh, Sridhar Mahadevan, and Rajbala Makar, 2006
Details
Cooperative Multi-Agent Learning: The State of the Art
Liviu Panait and Sean Luke, 2005
Details
Decentralized Control of Cooperative Systems: Categorization and Complexity Analysis
Claudia V. Goldman and Shlomo Zilberstein, 2004
Details
The Complexity of Decentralized Control of Markov Decision Processes
Daniel S. Bernstein, Robert Givan, Neil Immerman, and Shlomo Zilberstein, 2002
Details
Coordinated Reinforcement Learning
Carlos Guestrin, Michail G. Lagoudakis, and Ronald Parr, 2002
Details
Multiagent Planning with Factored MDPs
Carlos Guestrin, Daphne Koller, and Ronald Parr, 2001
Details
Distributed Value Functions
Jeff Schneider, Weng-Keen Wong, Andrew Moore, and Martin Riedmiller, 1999
Details
Reinforcement Learning in the Multi-Robot Domain
Maja J. Matarić, 1997
Details
Strongly Typed Genetic Programming in Evolving Cooperation Strategies
Thomas Haynes, Roger L. Wainwright, Sandip Sen, and Dale A. Schoenefeld, 1995
Details
Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents
Ming Tan, 1993
Details
On Optimal Cooperation of Knowledge Sources - An Empirical Investigation
M. Benda, V. Jagannathan, and R. Dodhiawala, 1986
Details