Function Approximation |   |   | Partial Observability |   |   | Learning Methods |   |   | Ensembles |   |   |
Stochastic Optimisation |   |   | General RL |   |   | General ML |   |   | Multiagent Learning |   |   |
Comparison/Integration |   |   | Bandits |   |   | Applications |   |   | Robot Soccer |   |   |
Humanoids |   |   | Parameter |   |   | MDP |   |   | Empirical |   |   |
Failure Warning |   |   | Representation |   |   | General AI |   |   | Neural Networks |   |   |
All |   |   |
Learning Complementary Multiagent Behaviors: A Case Study
Shivaram Kalyanakrishnan and Peter Stone, 2010
Details
A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach
Thomas Gabel, Martin Riedmiller, and Florian Trost, 2009
Details
Simulation-Based Approach to General Game Playing
Hilmar Finnsson and Yngvi Björnsson, 2008
Details
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning
Arthur Guez, Robert D. Vincent, Massimo Avoli, and Joelle Pineau, 2008
Details
Model-Based Reinforcement Learning in a Complex Domain
Shivaram Kalyanakrishnan, Peter Stone, and Yaxin Liu, 2008
Details
Reinforcement learning of motor skills with policy gradients
Jan Peters and Stefan Schaal, 2008
Details
Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning
Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, Charles Lefurgy, David W. Levine, and Freeman Rawson, 2008
Details
Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
Engin \.Ipek, Onur Mutlu, José and Martínez, and Rich Caruana, 2008
Details
Learning RoboCup-Keepaway with Kernels
Tobias Jung and Daniel Polani, 2007
Details
Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
Shivaram Kalyanakrishnan, Yaxin Liu, and Peter Stone, 2007
Details
Autonomous blimp control using model-free reinforcement learning in a continuous state and action space
Axel Rottmann, Christian Plagemann, Peter Hilgers, and Wolfram Burgard, 2007
Details
Reinforcement Learning of Local Shape in the Game of Go
David Silver, Richard S. Sutton, and Martin Müller, 2007
Details
Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man
István Szita and András L\Horincz, 2007
Details
On the use of hybrid reinforcement learning for autonomic resource allocation
Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, and Mohamed N. Bennani, 2007
Details
Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura, and Shin Ishii, 2006
Details
Quadruped Robot Obstacle Negotiation via Reinforcement Learning
Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Singh, and Andrew Y. Ng, 2006
Details
Reinforcement learning for optimized trade execution
Yuriy Nevmyvaka, Yi Feng, and Michael Kearns, 2006
Details
Keepaway Soccer: From Machine Learning Testbed to Benchmark
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu, 2006
Details
Learning Tetris using the noisy cross-entropy method
István Szita and András L\Horincz, 2006
Details
Reinforcement Learning for RoboCup-Soccer Keepaway
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann, 2005
Details
Machine Learning for Fast Quadrupedal Locomotion
Nate Kohl and Peter Stone, 2004
Details
Reinforcement learning for sensing strategies
Cody Kwok and Dieter Fox, 2004
Details
Autonomous Helicopter Flight via Reinforcement Learning
Andrew Y. Ng, H. Jin Kim, Michael I. Jordan, and Shankar Sastry, 2004
Details
Multi-Agent Patrolling with Reinforcement Learning
Hugo Santana, Geber Ramalho, Vincent Corruble, and Bohdana Ratitch, 2004
Details
Stochastic policy gradient reinforcement learning on a simple 3D biped
Russ Tedrake, Teresa Weirui Zhang, and H. Sebastian Seung, 2004
Details
Adaptive Job Routing and Scheduling
Shimon Whiteson and Peter Stone, 2004
Details
Active Guidance for a Finless Rocket Using Neuroevolution
Faustino J. Gomez and Risto Miikkulainen, 2003
Details
Deep Blue
Murray Campbell, A. Joseph Hoane Jr., and Feng-hsiung Hsu, 2002
Details
Multiagent Planning with Factored MDPs
Carlos Guestrin, Daphne Koller, and Ronald Parr, 2001
Details
Learning to trade via direct reinforcement
John Moody and Matthew Saffell, 2001
Details
Planning treatment of ischemic heart disease with partially observable Markov decision processes
Milos Hauskrecht and Hamish Fraser, 2000
Details
Reinforcement Learning for Control of Self-Similar Call Traffic in Broadband Networks
Jakob Carlström and Ernst Nordström, 1999
Details
Distributed Value Functions
Jeff Schneider, Weng-Keen Wong, Andrew Moore, and Martin Riedmiller, 1999
Details
Symposium on Applications of Reinforcement Learning: Final Report for NSF Grant IIS-9810208
Pat Langley and Mark Pendrith, 1998
Details
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
Jette Randløv and Preben Alstrøm, 1998
Details
Reinforcement Learning: An Introduction
Richard S. Sutton and Andrew G. Barto, 1998
Details
Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems
Satinder Singh and Dimitri Bertsekas, 1997
Details
Neuro-Dynamic Programming
Dimitri P. Bertsekas and John N. Tsitsiklis, 1996
Details
Improving Elevator Performance Using Reinforcement Learning
Robert H. Crites and Andrew G. Barto, 1996
Details
A Reinforcement Learning Approach to job-shop Scheduling
Wei Zhang and Thomas G. Dietterich, 1995
Details
Practical Issues in Temporal Difference Learning
Gerald Tesauro, 1992
Details
Further Real Applications of Markov Decision Processes
Douglas J. White, 1988
Details
Real Applications of Markov Decision Processes
Douglas J. White, 1985
Details