R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
Ronen I. Brafman and Moshe Tennenholtz, 2003
Download
Abstract
(unavailable)
BibTeX Entry
@Article{Brafman+Tennenholtz:2003, author = "Brafman, Ronen I. and Tennenholtz, Moshe", title = "{R-MAX} - a general polynomial time algorithm for near-optimal reinforcement learning", journal = "Journal of Machine Learning Research", year = "2003", volume = "3", pages = "213--231", publisher = "MIT Press", address = "Cambridge, MA, USA", url = "http://jmlr.csail.mit.edu/papers/volume3/brafman02a/brafman02a.pdf", bib2html_rescat = "General RL", }