CS395T: Reinforcement Learning: Theory and Practice -- Fall 2004: Assignments Page

Assignments for Reinforcement Learning: Theory and Practice

Week 0 (8/26): Class Overview

For each class (after the first), be sure to submit a question or comment about the reading by 10pm on the day before any class that has a new reading assignment due as an email in plain ascii text. I prefer that is be sent in the body of the email, rather than as an attachment. Please use the subject line "class readings for [due date]".

Week 1 (8/31,9/2): Introduction

Chapter 1 of the textbook

Week 2 (9/7,9/9): Evaluative Feedback

Discussion leader: Chendi on Thursday.

Chapter 2 of the textbook

Week 3 (9/14,16): The Reinforcement Learning Problem

Chapter 3 of the textbook

Week 4 (9/21,23): Dynamic Programming

Discussion leader: Susan on Tuesday.

Chapter 4 of the textbook

Week 5 (9/28,9/30): Monte Carlo Methods

Discussion leader: Lily on Tuesday.

Chapter 5 of the textbook

Week 6 (10/5,7): Temporal Difference Learning

Discussion leader: Matt on Tuesday, Mazda on Thursday.

Chapter 6 of the textbook

Week 7 (10/12,14): Eligibility Traces

Discussion leader: Sit on Tuesday.

Chapter 7 of the textbook

Week 8 (10/19,21): Generalization and Function Approximation

Discussion leader: Igor on Tuesday.

Chapter 8 of the textbook

Class project proposal due at 12:30pm on Thursday. Please send an email with subject "Project Proposal" with a proposed topic for your class project. I anticipate projects taking one of two forms.

Practice (preferred): An implemenation of RL in some domain of your choice - ideally one that you are using for research or in some other class. In this case, please describe the domain and your initial plans on how you intend to implement learning. What will the states and actions be? What algorithm(s) do you expect will be most effective?

Theory: A proposal, implementation and testing of an algorithmic modification to an RL algorithm presented in the book. In this case, please describe the modification you propose to investigate and on what type of domain (possibly a toy domain) it is likely to show an improvement over things considered in the book.

Week 9 (10/26,28): Planning and Learning

Discussion leader: Greg on Tuesday.

Chapter 9 of the textbook

Week 10 (11/2,4): Case Studies

Discussion leader: Kurt on Thursday.

Chapters 10 and 11 of the textbook

Week 11 (11/9,11): Abstraction: Options and Hierarchy

Discussion leader: Alex on Tuesday, Jon on Thursday.

Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning.
Sutton, R.S., Precup, D., Singh, S.
Artificial Intelligence 112:181-211, 1999.
Due Tuesday.

The MAXQ Method for Hierarchical Reinforcement Learning.
Thomas G. Dietterich
Proceedings of the 15th International Conference on Machine Learning, 1998.
Due Thursday.

Week 12 (11/16,18): Helicopter and Robot Control

Discussion leader: Michael on Tuesday.

Autonomous Helicopter Control using Reinforcement Learning Policy Search Methods.
J. Bagnell and J. Schneider
Proceedings of the International Conference on Robotics and Automation 2001, IEEE, May, 2001.
Due Tuesday.

Inverted autonomous helicopter flight via reinforcement learning.
Andrew Y. Ng, Adam Coates, Mark Diel, Varun Ganapathi, Jamie Schulte, Ben Tse, Eric Berger and Eric Liang.
International Symposium on Experimental Robotics, 2004.
Due Tuesday.

Learning from Observation and Practice Using Primitives.
Darrin Bentivegna, Christopher Atkeson, and Gordon Cheng.
AAAI Fall Symposium on Real Life Reinforcement Learning, 2004.
Due Thursday.

Week 13 (11/23): Robot Soccer

Scaling Reinforcement Learning toward RoboCup Soccer.
Peter Stone and Richard S. Sutton.
Proceedings of the Eighteenth International Conference on Machine Learning, pp. 537-544, Morgan Kaufmann, San Francisco, CA, 2001.
Due Tuesday.

Reinforcement Learning for Sensing Strategies.
C. Kwok and D. Fox.
Proceedings of IROS, 2004.
Due Tuesday.

Week 14 (11/30,12/2): Incorporating Advice

Creating Advice-Taking Reinforcement Learners.
R. Maclin & J. Shavlik.
Machine Learning, 22, pp. 251-281, 1996.
Due Tuesday.

Guiding a Reinforcement Learner with Natural Language Advice: Initial Results in RoboCup Soccer.
Gregory Kuhlmann, Peter Stone, Raymond Mooney, and Jude Shavlik.
The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
Due Thursday.

[Back to Department Homepage]

Page maintained by Peter Stone
Questions? Send me mail