Fast gradient-descent methods for temporal-difference learning with linear function approximation
Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, and Eric Wiewiora, 2009
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Sutton+MPBSSW:2009, author = "Sutton, Richard S. and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv{\'a}ri, Csaba and Wiewiora, Eric", title = "Fast gradient-descent methods for temporal-difference learning with linear function approximation", booktitle = "Proceedings of the Twenty-sixth Annual International Conference on Machine Learning (ICML 2009)", year = "2009", editor = "Danyluk, Andrea Pohoreckyj and Bottou, L{\'e}on and Littman, Michael L.", volume = "382", series = "ACM International Conference Proceeding Series", publisher = "ACM", pages = "993--1000", url = "http://www.cs.ualberta.ca/%7Esutton/papers/SMPBSSW-09.pdf", bib2html_rescat = "Function Approximation", }