Off-Policy Temporal Difference Learning with Function Approximation
Off-Policy Temporal Difference Learning with Function Approximation
Doina Precup, Richard S. Sutton, and Sanjoy Dasgupta, 2001
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Precup+SD:2001, author = "Precup, Doina and Sutton, Richard S. and Dasgupta, Sanjoy", title = "Off-Policy Temporal Difference Learning with Function Approximation", booktitle = "Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001)", year = "2001", ISBN = "1-55860-778-1", editor = "Brodley, Carla E. and Danyluk, Andrea Pohoreckyj", publisher = "Morgan Kaufmann", pages = "417--424", url = "http://www.cs.ualberta.ca/~sutton/papers/PSD-01-retypeset.pdf", bib2html_rescat = "Function Approximation", }